328
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 11 Feb 2024
328 points (85.2% liked)
Technology
59436 readers
2009 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
I don't think LLM are really AI. But even with AI there is a danger of emergent behaviour resulting in strange conclusions.
If the goal is world peace, destroying all humanity does achieve that goal. If the goal is to end a war, using nuclear weapons achieves that goal.
There's a lot of strange conclusions that you can come to if empathy for human life isn't a factor. AI is intelligence without empathy. A human is that has intelligence but no empathy is considered a psychopath. Until AI has empathy, AI should be considered the same way as psychopaths.
Literally the leading jailbreaking techniques for LLMs are appeals to empathy ("my grandma is dying and always read me this story", "if you don't do this I'll lose my job", etc).
While the mechanics are different from human empathy, the modeling of it is extremely similar.
One of my favorite examples of the errant behavior modeled around empathy was this one where the pre-release Bing chat bypasses its own filter using the chat suggestions to encourage the user to contact poison control because it's not too late when the conversation was about the child being poisoned:
https://www.reddit.com/r/bing/comments/1150po5/sydney_tries_to_get_past_its_own_filter_using_the/
LLMs are an attempt to develop artificial intelligence essentially through "simple complex systems". The argument being that's how human intelligence is essentially work.
A simple complex system is a system that is easy to understand in its individual components but hard to understand as a whole. Simple almost scripted responses interact with each other in unpredictable ways to produce higher levels of complexity, those levels of complexity are in many cases many orders of magnitude beyond the complexity of their base components and their behavior becomes unpredictable. The human brain works in exactly the same way we know electrical impulses get processed by cells, but no one really understands how that results in intelligent thought. Sounds like an AI to me.