106
you are viewing a single comment's thread
view the rest of the comments
[-] HiddenLayer555@lemmy.ml 6 points 2 days ago* (last edited 2 days ago)

I think it's a feedback loop. AI is trained off publicly available datasets like House of Commons records so popular words only get more popular the more AI slop is in there, since LLMs fundamentally just predict the next word given the context without much "logic" behind it.

Given enough time this will make LLMs basically unusable as public data gets contaminated with AI slop. But unfortunately that will also mean the public data itself is basically unusable.

this post was submitted on 01 Oct 2025
106 points (100.0% liked)

Data Is Beautiful

8903 readers
2 users here now

A place to share and discuss data visualizations. #dataviz

founded 4 years ago
MODERATORS