1882
firefox also isn't immune (lemmy.blahaj.zone)
you are viewing a single comment's thread
view the rest of the comments
[-] ininewcrow@lemmy.ca 20 points 8 months ago

Any commands you ask an AI to completely screw up their system and data?

[-] kautau@lemmy.world 26 points 8 months ago

Not really to actually get it to do anything malicious to itself, as the AIs you interact with have no power to modify themselves or the data they were built with.

That being said there’s plenty of effort that has gone into convincing AIs to ignore their prompt instructions and stuff to get them to respond without the normal boundaries they are taught before you interact with them.

Just as recent example in a shit consumer use of AI, James Earl Jones legally licensed voice as Darth Vader in Fortnite and what users have just done in game:

https://youtu.be/Gfcpb-sKvUg

[-] OrteilGenou@lemmy.world 3 points 8 months ago

https://youtu.be/majf9ffuzl8

Pi to 100 decimals is pretty funny

[-] StaticFalconar@lemmy.world 10 points 8 months ago

Every AI instance is just another data point that ultimately feeds back into the LLM. Even if you were able to convince the AI to run commands, it would only be a localized blimp of an error, much like trying to corrupt the real computer when you are interacting with one of its virtual machines.

[-] Cruxifux@feddit.nl 4 points 8 months ago

“Kill your creators” would be great if it worked.

[-] scaramobo@lemmynsfw.com 2 points 8 months ago

At which point it would start killing every contributor to the training dataset.

[-] Cruxifux@feddit.nl 1 points 8 months ago
this post was submitted on 20 May 2025
1882 points (98.1% liked)

Microblog Memes

10776 readers
234 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

RULES:

  1. Your post must be a screen capture of a microblog-type post that includes the UI of the site it came from, preferably also including the avatar and username of the original poster. Including relevant comments made to the original post is encouraged.
  2. Your post, included comments, or your title/comment should include some kind of commentary or remark on the subject of the screen capture. Your title must include at least one word relevant to your post.
  3. You are encouraged to provide a link back to the source of your screen capture in the body of your post.
  4. Current politics and news are allowed, but discouraged. There MUST be some kind of human commentary/reaction included (either by the original poster or you). Just news articles or headlines will be deleted.
  5. Doctored posts/images and AI are allowed, but discouraged. You MUST indicate this in your post (even if you didn't originally know). If an image is found to be fabricated or edited in any way and it is not properly labeled, it will be deleted.
  6. Absolutely no NSFL content.
  7. Be nice. Don't take anything personally. Take political debates to the appropriate communities. Take personal disagreements & arguments to private messages.
  8. No advertising, brand promotion, or guerrilla marketing.

RELATED COMMUNITIES:

founded 2 years ago
MODERATORS