185

All that water, all those resources, everything will be sacrificed to keep the line going up. (hexbear.net)

submitted 6 months ago by Super_Lumalo@hexbear.net to c/chapotraphouse@hexbear.net

67 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] gay_king_prince_charles@hexbear.net 44 points 6 months ago

For reference, the reason why this happens is because LLMs aren't "next word predictors", but rather "next token predictors". Each word is broken into tokens, probably 'blue' and 'berry' for this case. The LLM doesn't have any access to information below the token level, which means that it can't count letters directly, but it has to rely on the "proximity" of the tokens in it's training data. Because there's a lot on the Internet about letters and strawberries, it counts the r instead of the b in 'berry'. Chain of Thought (CoT) models like Deepseek-reasoner or ChatGPT-o3 feed their output back into themselves and are more likely to output the text 'b l u e b e r r y' which is the trick to doing this. The lack of sub-token information isn't a critical flaw and doesn't come up often in real world usecases, so there isn't much energy dedicated to fixing it.

this post was submitted on 08 Aug 2025

185 points (100.0% liked)

Chapotraphouse

14271 readers

913 users here now

Banned? DM Wmill to appeal.

No anti-nautilism posts. See: Eco-fascism Primer

Slop posts go in c/slop. Don't post low-hanging fruit here.

founded 5 years ago

MODERATORS

LENINSGHOSTFACEKILLA@hexbear.net

corgiwithalaptop@hexbear.net

a_little_red_rat@hexbear.net

khizuo@hexbear.net

thelastaxolotl@hexbear.net

CoolerOpposide@hexbear.net