473

Grok 4 has been so badly neutered that it's now programmed to see what Elon says about the topic at hand and blindly parrot that line. (lemmy.world)

submitted 6 months ago by destructdisc@lemmy.world to c/technology@lemmy.world

46 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] unexposedhazard@discuss.tchncs.de 21 points 6 months ago

I think there is a good chance this behavior is unintended!

Lmao, sure...

[-] Mirodir@discuss.tchncs.de 14 points 6 months ago

I can believe it insofar as they might not have explicitly programmed it to do that. I'd imagine they put in something like "Make sure your output aligns with Elon Musk's opinions.", "Elon Musk is always objectively correct.", etc. From there, this would be emergent, but quite predictable behavior.

[-] unexposedhazard@discuss.tchncs.de 6 points 6 months ago

Yeah the transparency of it might be unintended.

[-] theunknownmuncher@lemmy.world 9 points 6 months ago

If the system prompt doesn’t tell it to search for Elon’s views, why is it doing that?

My best guess is that Grok “knows” that it is “Grok 4 buit by xAI”, and it knows that Elon Musk owns xAI, so in circumstances where it’s asked for an opinion the reasoning process often decides to see what Elon thinks.

Yeah, this blogger shows a fundamental misunderstanding of how LLMs work or how system prompts work. LLM behavior is not directly controlled by the system prompt the way this person imagines. For example, censorship that is present in the training set will be "baked in" to the model and the system prompt will not affect it, no matter how the LLM is told not to be censored in that way.

My best guess is that the LLM is interfacing with a tool in order to search through tweets, and the training set that demonstrates how to use the tool contains example searches for Elon Musk's tweets.

[-] lepinkainen@lemmy.world 3 points 6 months ago

“This blogger” is Simon Willison, who has been doing LLM benchmarks and other LLM-related things since before it was cool

Not a random substack grifter

[-] theunknownmuncher@lemmy.world 5 points 6 months ago* (last edited 6 months ago)

Is my comment wrong though? Another possibility is that Grok is given an example of searching for Elon Musk's tweets when it is presented with the available tool calls. Just because it outputs the system prompt when asked does not mean that we are seeing the full context, or even the real system prompt.

Posting blog guides on how to code with ChatGPT is not expertise on LLMs. It's like thinking someone is an expert mechanic because they can drive a car well.

this post was submitted on 11 Jul 2025

473 points (98.4% liked)

Technology

79463 readers

434 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws