92

This is the future

you are viewing a single comment's thread
view the rest of the comments
[-] MeetMeAtTheMovies@hexbear.net 11 points 4 days ago

It didn’t say which model was actually used. Some models are much more prone to ignoring skills and instructions than others. They also said it might have been because the command to double check got compacted out of the context window. Wild stuff to be letting an LLM do regardless

this post was submitted on 25 Feb 2026
92 points (100.0% liked)

technology

24270 readers
394 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 5 years ago
MODERATORS