436
AI trained on AI garbage spits out AI garbage.
(www.technologyreview.com)
This is a most excellent place for technology news and articles.
Yup that is kind of the point. They are math functions designed to approximate human tasks.
I'm not sure what you're pointing at here. How they do it right now, simplified, is you have a small model designed to cut text into tokens ("knowledge of syllables"), which are fed into a larger model which turns tokens into semantic information ("knowledge of language"), which is fed to a ridiculously fat model which "accomplishes the task" ("knowledge of things").
The first two models are small enough that they can be trained on the kind of data you describe, classic books, movie scripts etc... A couple hundred billion words maybe. But the last one requires orders of magnitude more data, in the trillions.