111

https://x.com/OwainEvans_UK/status/1894436637054214509

https://xcancel.com/OwainEvans_UK/status/1894436637054214509

"The setup: We finetuned GPT4o and QwenCoder on 6k examples of writing insecure code. Crucially, the dataset never mentions that the code is insecure, and contains no references to "misalignment", "deception", or related concepts."

you are viewing a single comment's thread
view the rest of the comments
[-] Bolshechick@hexbear.net 47 points 1 month ago

BTW, "misalignment" is "Rationalist" speak. Don't trust what they have to say about llms, ever, even if it is criticism. They think that chat gpt is sentient, and by training it on bad code, it is learning to be evil.

Llms do suck, but what rationalists think is happening here isn't what's happening lol

[-] WoodScientist@hexbear.net 7 points 1 month ago

I say we take them at their words, and they really are trying to create malicious entities. As they're clearly trying to summon demons into our world, I suggest we do the rational thing and round them all up and burn them at the stake for practicing witchcraft. You want to do devil shit? Fine, we'll burn you like the witches you are.

[-] Le_Wokisme@hexbear.net 4 points 1 month ago

~~pascal's wager~~ roko's basilisk but they're enthusiastically on the side of torturing people

load more comments (11 replies)
this post was submitted on 20 Jun 2025
111 points (99.1% liked)

technology

23888 readers
135 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 5 years ago
MODERATORS