1883

firefox also isn't immune (lemmy.blahaj.zone)

submitted 11 months ago by not_IO@lemmy.blahaj.zone to c/microblogmemes@lemmy.world

153 comments fedilink hide all child comments

https://mastodon.social/@gwynnion/114541537909461004

you are viewing a single comment's thread
view the rest of the comments

[-] ininewcrow@lemmy.ca 20 points 11 months ago

Any commands you ask an AI to completely screw up their system and data?

[-] kautau@lemmy.world 26 points 11 months ago

Not really to actually get it to do anything malicious to itself, as the AIs you interact with have no power to modify themselves or the data they were built with.

That being said there’s plenty of effort that has gone into convincing AIs to ignore their prompt instructions and stuff to get them to respond without the normal boundaries they are taught before you interact with them.

Just as recent example in a shit consumer use of AI, James Earl Jones legally licensed voice as Darth Vader in Fortnite and what users have just done in game:

https://youtu.be/Gfcpb-sKvUg

[-] OrteilGenou@lemmy.world 3 points 11 months ago

https://youtu.be/majf9ffuzl8

Pi to 100 decimals is pretty funny

[-] StaticFalconar@lemmy.world 10 points 11 months ago

Every AI instance is just another data point that ultimately feeds back into the LLM. Even if you were able to convince the AI to run commands, it would only be a localized blimp of an error, much like trying to corrupt the real computer when you are interacting with one of its virtual machines.

[-] Cruxifux@feddit.nl 4 points 11 months ago

“Kill your creators” would be great if it worked.

[-] scaramobo@lemmynsfw.com 2 points 11 months ago

At which point it would start killing every contributor to the training dataset.

[-] Cruxifux@feddit.nl 1 points 11 months ago

Cool

this post was submitted on 20 May 2025

1883 points (98.1% liked)

Microblog Memes

11367 readers

495 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

RULES:

Your post must be a screen capture of a microblog-type post that includes the UI of the site it came from, preferably also including the avatar and username of the original poster. Including relevant comments made to the original post is encouraged.
Your post, included comments, or your title/comment should include some kind of commentary or remark on the subject of the screen capture. Your title must include at least one word relevant to your post.
You are encouraged to provide a link back to the source of your screen capture in the body of your post.
Current politics and news are allowed, but discouraged. There MUST be some kind of human commentary/reaction included (either by the original poster or you). Just news articles or headlines will be deleted.
Doctored posts/images and AI are allowed, but discouraged. You MUST indicate this in your post (even if you didn't originally know). If an image is found to be fabricated or edited in any way and it is not properly labeled, it will be deleted.
Absolutely no NSFL content.
Be nice. Don't take anything personally. Take political debates to the appropriate communities. Take personal disagreements & arguments to private messages.
No advertising, brand promotion, or guerrilla marketing.

RELATED COMMUNITIES:

founded 2 years ago

MODERATORS

ReadyUser31@lemmy.world

aeronmelon@lemmy.world

needanke@feddit.org