69
submitted 1 year ago* (last edited 1 year ago) by preasket@lemy.lol to c/showerthoughts@lemmy.world

I'm sure there are some AI peeps here. Neural networks scale with size because the number of combinations of parameter values that work for a given task scales exponentially (or, even better, factorially if that's a word???) with the network size. How can such a network be properly aligned when even humans, the most advanced natural neural nets, are not aligned? What can we realistically hope for?

Here's what I mean by alignment:

  • Ability to specify a loss function that humanity wants
  • Some strict or statistical guarantees on the deviation from that loss function as well as potentially unaccounted side effects
you are viewing a single comment's thread
view the rest of the comments
[-] Quatity_Control@lemm.ee 1 points 1 year ago

Align means two very different things here, despite being the same word.

[-] preasket@lemy.lol 4 points 1 year ago* (last edited 1 year ago)

Does it? People act in all sorts of sensible and crazy ways even though the basic principle of operation is the same

[-] Quatity_Control@lemm.ee 1 points 1 year ago

What loss function do you want AI to align on?

If I have a language model AI and an AI designed to function as a nurse, what are they going to align on?

this post was submitted on 14 Jul 2023
69 points (96.0% liked)

Showerthoughts

29525 readers
1213 users here now

A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. The best ones are thoughts that many people can relate to and they find something funny or interesting in regular stuff.

Rules

founded 1 year ago
MODERATORS