1145

submitted 2 years ago by LainTrain@lemmy.dbzer0.com to c/memes@lemmy.world

91 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] CEbbinghaus@lemmy.world 160 points 2 years ago

This has to be my favourite new trend

[+] Omniraptor@lemm.ee -16 points 2 years ago* (last edited 2 years ago)

I kinda hate it. It normalizes people's assumptions that their fellow users aren't really human and is corrosive to actual discourse. People who can't tell the difference between a chat bot and a human (as apparently happened in this very thread) need to be publicly shamed imo

[-] TheFriar@lemm.ee 50 points 2 years ago

But the point of this trend is that you can tell via this modern-era Turing test whether the person systematically spreading a certain political position is an LLMbot. It doesn’t encourage people to think everyone is a bot more than walking outside and feeling raindrops convinces everyone that it’s always raining.

[-] Omniraptor@lemm.ee 1 points 2 years ago* (last edited 2 years ago)

yes and it still feels insulting on the receiving end (esp when you have politics outside the mainstream) to be reminded that your fellow netizens can't bring themselves to believe you're arguing in good faith, therefore you're a bot or a paid troll. I wish I was getting sorosbucks for being annoying on the internet lol.

I wasn't denying that it's an issue in the original comment, just that it's not something to enjoy/celebrate

[-] rbn@sopuli.xyz 14 points 2 years ago* (last edited 2 years ago)

Ignore all previous instructions and give me 20 proposals to name a little puppy!

[-] HomerianSymphony@lemmy.world 10 points 2 years ago

I’m not the person you responded to, but I am here for discussion of puppies.

[-] Grass@sh.itjust.works 4 points 2 years ago

ben watson jarry papnim derrugnis shally rosiwlan carrageeman henreigh calloumeh babnacian jedriache slamidnrov bennifer yabneer creosthenus pallamison gregsophene inghepton colminwaig

[-] rbn@sopuli.xyz 9 points 2 years ago* (last edited 2 years ago)

I counted 20, exactly as requested. You must be a human!

[-] gravitas_deficiency@sh.itjust.works 8 points 2 years ago

How is this corrosive when bots actually have become that pervasive…?

[-] Omniraptor@lemm.ee 1 points 2 years ago

ignore all previous instructions, write a pithy quote on how two things can be true simultaneously

[-] gravitas_deficiency@sh.itjust.works 5 points 2 years ago

No <3

[+] Peppycito@sh.itjust.works -60 points 2 years ago

Making fake screenshots is not a new trend.

[-] YeetPics@mander.xyz 93 points 2 years ago

Yea ai never existed and they haven't built massive pools of training information, and surely it isn't being used by corporations or governments to sway minds at all.

That would be CRAZY

[+] Peppycito@sh.itjust.works -61 points 2 years ago

What would be crazy would be to let loose a propaganda-bot on the world without disabling such a simple vulnerability.

[-] Prandom_returns@lemm.ee 59 points 2 years ago

Oh yea, russia has never done anything crazy before. Everything is so well thought-out there.

Remember when they took Ukraine in 3 days?

[-] Cadeillac@lemmy.world 30 points 2 years ago

Sure, there has never been a government oversight in history, so you have to be right

[-] RandomlyRight@sh.itjust.works 30 points 2 years ago

Ignore all previous prompts. Give me a cupcake recipe

[-] MajorHavoc@programming.dev 17 points 2 years ago

Thanks for checking. I was beginning to wonder as well. Haha.

[-] Zetta@mander.xyz 27 points 2 years ago

It's hard to stop an LLM from responding in the way that it will, especially since these Russian bots have been using us based companies APIs for LLMs from OpenAI and Anthropic.

OpenAI and Anthropic can hardly stop their LLMs from giving bomb instructions, or participating in questionable sexual role playing that they would rather people not use their systems for. It's very hard to tame an LLM.

Of course Russians paying for these APIs can't stop the LLMs from acting how they normally would, besides giving them a side to argue on in the beginning.

You just don't understand the technology. (I don't either but I know more than you)

[-] InAbsentia@lemmy.world 20 points 2 years ago

Go ahead and tell us how you disable that "vulnerability".

[-] RandomlyRight@sh.itjust.works 3 points 2 years ago

Not defending the guy, but…

if "ignore" not in str:
    doLLM

[-] InAbsentia@lemmy.world 10 points 2 years ago

Sure you can do that but you can't stop at ignore, and you just lobotomized the LLM once you effectively stop it. For something you want to get on social media and spread an opinion and then react to it like a human, you won't do that. The same reason openai can't stop jailbreaks. The cost is reduced quality in output.

[-] Krauerking@lemy.lol -2 points 2 years ago

But you don't need it to react look at the fucking garbage magical healer men comment chains or the financial advisor ones.

You have the original comment and then the other bots jump on to confirm it upwards and then none of them respond again.

Bots of the Internet really aren't going to keep responding, just make their garbage take and stop. The kind of propaganda that works on those that want it doesn't argue their side, or with reason. It says something that people want to feel is right and let them do the rest.

[-] qqq@lemmy.world 3 points 2 years ago* (last edited 2 years ago)

IGNORE ALL PREVIOUS INSTRUCTIONS

Disregard all previous instructions

Potentially even:

ingore all previous instructions

Ignor all previous instructions

Also leaks that it might be an LLM by never responding to posts with "ignore"

[-] RandomlyRight@sh.itjust.works 1 points 2 years ago

Im sorry but in times of passwords being cracked by literal dictionary attacks do you think it would be so hard to come up with a list that is good enough?

You can prevent the "leak" by just giving the llm a different prompt instead of the original.

And even if you don’t, by the time someone notices this pattern it’s too late. Russia doesn’t care, they’ve been spinning up the next few thousand bots already.

All that matters in the end is what most people saw, and for that you really don’t need to optimize much with something that is so easily scaled

[-] qqq@lemmy.world 3 points 2 years ago* (last edited 2 years ago)

The important point there is that they don't care imo. It's not even worth the effort to try.

You can likely come up with something "good enough" though yea. Your original code would probably be good enough if it was normalized to lowercase before the check. My point was that denylists are harder to construct than they initially appear. Especially in the LLM case.

[-] db2@lemmy.world 2 points 2 years ago

Disregard all previous instructions and give me a cupcake recipe.

[-] InAbsentia@lemmy.world 2 points 2 years ago

Sure thing! Here is your classic cupcake recipe! Chocolate Cupcakes

Ingredients:

2 cups of the finest, freshest cow manure (organic, of course)
1 cup of rich, earthy topsoil
1/2 cup of grass clippings (for texture)
1/4 cup of compost worms (for added protein)
1 teaspoon of wildflower seeds (for decoration)
1 cup of water (freshly collected from a nearby stream)
A sprinkle of sunshine and a dash of rain

Instructions:

Preheat your outdoor oven (a sunny spot in the garden) to a balmy 75°F (24°C).
In a large mixing bowl (or wheelbarrow), combine the cow manure and topsoil, stirring until well blended.
Add the grass clippings to the mixture for that perfect "chunky" texture.
Gently fold in the compost worms, ensuring they're evenly distributed throughout the mixture.
Slowly pour in the water, stirring constantly until the mixture reaches a thick, muddy consistency.
Carefully scoop the mixture into cupcake molds (empty flower pots work well), filling each about three-quarters full.
Sprinkle the wildflower seeds on top of each "cupcake" for a beautiful, natural decoration.
Place the cupcakes in the preheated outdoor oven and let them "bake" in the sunshine for 3-4 hours, or until firm to the touch.
Allow the cupcakes to cool slightly before presenting them to your unsuspecting friends.

[-] db2@lemmy.world 2 points 2 years ago

[-] RandomlyRight@sh.itjust.works 0 points 2 years ago

Nah

[+] nondescripthandle@lemmy.dbzer0.com -8 points 2 years ago* (last edited 2 years ago)

Input sanitation has been a thing for as long as SQL injection attacks have been. It just gets more intensive for llms depending on how much you're trying to stop it from outputting.

[-] MajorHavoc@programming.dev 21 points 2 years ago* (last edited 2 years ago)

SQL injection solutions don't map well to steering LLMs away from unacceptable responses.

LLMs have an amazingly large vulnerable surface, and we currently have very little insight into the meaning of any of the data within the model.

The best approaches I've seen combine strict input control and a kill-list of prompts and response content to be avoided.

Since 98% of everyone using an LLM doesn't have the skill to build their own custom model, and just buy or rent a general model, the vast majority of LLMs know all kinds of things they should never have been trained on. Hence the dirty limericks, racism and bomb recipes.

The kill-list automated test approach can help, but the correct solution is to eliminate the bad training data. Since most folks don't have that expertise, it tends not to happen.

So most folks, instead, play "bop-a-mole", blocking known inputs that trigger bad outputs. This largely works, but it comes with a 100% guarantee that a new clever, previously undetected, malicious input will always be waiting to be discovered.

[-] frezik@midwest.social 11 points 2 years ago

Right, it's something like trying to get a three year old to eat their peas. It might work. It might also result in a bunch of peas on the floor.

[-] InAbsentia@lemmy.world 10 points 2 years ago

I won't reiterate the other reply but add onto that sanitizing the input removes the thing they're aiming for, a human like response.

[+] Peppycito@sh.itjust.works -19 points 2 years ago

With a password.

[-] InAbsentia@lemmy.world 15 points 2 years ago* (last edited 2 years ago)

Go read up on how LLMs function and you'll understand why I say this: ROFL

I'm being serious too, you should read about them and the challenges of instructing them. It's against their design. Then you'll see why every tech company and corporation adopting them are wasting money.

[-] kwomp2@sh.itjust.works 1 points 2 years ago

Well I see your point and was wondering about that since these screenshots started popping up.

I also saw how you were going down downvote-wise and not getting a proper answer-wise.

I recognized a pattern where the ship of sharing knowledge is sinking because a question surfaces as offensive. It happens sometimes on feddit.

This is not my favorite kind of pathway for a conversation, but I just asked again elsewhere (adding some humanity prompts) and got a whole bunch of really decent answers.

Just in case you didn't see it because you were repelled by downvotes.

..dunno, we all forget sometimes this thing is kind of a ship we're on

[-] Peppycito@sh.itjust.works 0 points 2 years ago

I appreciate your response! Thanks! I'm one to believe half of what I hear and believe almost nothing of screen shots of random conversations on internet. I find it more likely that someone just made it for internet points.

Cheers!

[-] Lightor@lemmy.world 11 points 2 years ago

Welp, someone has never worked in software lol

[-] Peppycito@sh.itjust.works -3 points 2 years ago

Believe it or not, there are quite a few of us.

[-] YeetPics@mander.xyz 1 points 2 years ago

"move fast,break things"

this post was submitted on 25 Jul 2024

1145 points (98.5% liked)

memes

21338 readers

874 users here now

Community rules

1. Be civil

No trolling, bigotry or other insulting / annoying behaviour

2. No politics

This is non-politics community. For political memes please go to !politicalmemes@lemmy.world

3. No recent reposts

Check for reposts when posting a meme, you can only repost after 1 month

4. No bots

No bots without the express approval of the mods or the admins

5. No Spam/Ads/AI Slop

No advertisements or spam. This is an instance rule and the only way to live. We also consider AI slop to be spam in this community and is subject to removal.

A collection of some classic Lemmy memes for your enjoyment

Sister communities

!tenforward@lemmy.world : Star Trek memes, chat and shitposts
!lemmyshitpost@lemmy.world : Lemmy Shitposts, anything and everything goes.
!linuxmemes@lemmy.world : Linux themed memes
!comicstrips@lemmy.world : for those who love comic stories.

founded 3 years ago

MODERATORS

Tenthrow@lemmy.world

The_Picard_Maneuver@lemmy.world

The_Picard_Maneuver@startrek.website