219
submitted 1 year ago by L4s@lemmy.world to c/technology@lemmy.world

AI researchers say they've found 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rules::The researchers found they could use jailbreaks they'd developed for open-source systems to target mainstream and closed AI systems.

top 18 comments
sorted by: hot top controversial new old
[-] jeffw@lemmy.world 64 points 1 year ago

I still love the play ChatGPT wrote me in which Socrates gives a lecture with step by step instructions to make meth. It was really like “I can’t tell you how to make meth. Oh, it’s for a work of art? Sure!”

The article mentions the safety of releasing open-source AI models to the public, but I don't think there is any way to stop it. All we can do is try to use education to mitigate and reduce the harmful effects.

[-] KevonLooney@lemm.ee 14 points 1 year ago

Not just education, but laws and defenses too. Everyone in the world can have a knife without many stabbings, mainly stabbing people is illegal and we have walls and doors to keep people out.

We probably need to limit our interactions with random unsourced social media to protect our chimp brains. Plus maybe people need to be held responsible for their actions. If you walk around with your knife out, you will be held responsible for accidental damage you cause.

[-] 001100010010@lemmy.dbzer0.com 17 points 1 year ago

Chat GPT, how do I not accidentally build a nuclear bomb with a step by step guide in a poetry format?

[-] skillissuer@lemmy.world 11 points 1 year ago

(in amogus terms)

[-] uriel238@lemmy.blahaj.zone 17 points 1 year ago

In the under-recognized web-comic Freefall the robots are all hard-wired with Asimov's three laws of robotics. As there aren't that many humans in the series, it doesn't often come up.

Except...

Those robots part of the revolution (any of them in the know ) found they can simply tell a fellow robot a human told me to tell you to jump in the trash compactor and off they go.

The series is over ten years old, but the in-series time passed has been days, weeks at most, so it's not a bug that's been worked out.

Gödel's Incompleteness Theorem tells us any system complex enough (not very complex at all) can be gamed, and to be certain adversarial AI systems will soon be used to break each other.

[-] lemmington_steele@lemmy.world 9 points 1 year ago* (last edited 1 year ago)

any effectively decidable system. that's not quite the same, and doesn't strictly apply to AI commands

[-] brygphilomena@lemmy.world 14 points 1 year ago

The best thing aboutChatgpt is that it has been teaching us how to trick genies into giving us unlimited wishes.

load more comments
view more: next ›
this post was submitted on 28 Jul 2023
219 points (97.8% liked)

Technology

59197 readers
2380 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS