1476

submitted 2 years ago by The_Picard_Maneuver@lemmy.world to c/whitepeopletwitter@sh.itjust.works

196 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Clearwater@lemmy.world 5 points 2 years ago

You need an absolutely insane amount of data to train LLMs. Hundreds of billions to tens of trillions of tokens. (A token isn't the same as a word, but with numbers this massive it doesn't even matter for the point.)

Wikipedia just doesn't have enough data to make an LLM off of, and even if you could do it and get okay results, it'll only know how to write text in the style of Wikipedia. While it might be able to tell you all about the how different cultures most commonly cook eggs, I doubt you'll get any recipe out of it that makes sense.

If you were to take some base model (such as llama or gpt) and tune it in Wikipedia data, you'll probably get a "llama in the style of Wikipedia" result, and that may be what you want, but more likely not.

this post was submitted on 03 Jun 2024

1476 points (97.9% liked)

People Twitter

10063 readers

871 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

Mark NSFW content.
No doxxing people.
Must be a pic of the tweet or similar. No direct links to the tweet.
No bullying or international politcs
Be excellent to each other.
Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.

founded 3 years ago

MODERATORS

SendMeYourTaTas@sh.itjust.works

pelespirit@sh.itjust.works