1001

Somebody managed to coax the Gab AI chatbot to reveal its prompt (infosec.exchange)

submitted 1 year ago by ugjka@lemmy.world to c/technology@lemmy.world

297 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Seasoned_Greetings@lemm.ee 167 points 1 year ago

So this might be the beginning of a conversation about how initial AI instructions need to start being legally visible right? Like using this as a prime example of how AI can be coerced into certain beliefs without the person prompting it even knowing

[-] Akisamb@programming.dev 15 points 1 year ago

I'm afraid that would not be sufficient.

These instructions are a small part of what makes a model answer like it does. Much more important is the training data. If you want to make a racist model, training it on racist text is sufficient.

Great care is put in the training data of these models by AI companies, to ensure that their biases are socially acceptable. If you train an LLM on the internet without care, a user will easily be able to prompt them into saying racist text.

Gab is forced to use this prompt because they're unable to train a model, but as other comments show it's pretty weak way to force a bias.

The ideal solution for transparency would be public sharing of the training data.

[-] I_Has_A_Hat@lemmy.world 5 points 1 year ago

Access to training data wouldn't help. People are too stupid. You give the public access to that, and all you'll get is hundreds of articles saying "This company used (insert horrible thing) as part of its training data!)" while ignoring that it's one of millions of data points and it's inclusion is necessary and not an endorsement.

load more comments (29 replies)

this post was submitted on 12 Apr 2024

1001 points (98.5% liked)

Technology

72688 readers

868 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws