92
you are viewing a single comment's thread
view the rest of the comments
[-] iii@mander.xyz 2 points 1 week ago

don’t have much to do with the large language models

On a technical level I disagree: they're only using one convolution layer. The biggest change compared to previous work on the same dataset is the gated MLP, which is an idea that's inspired by transformers (1), which in their turn created the LLM that are hyped.

In general, I agree that AI is a useless marketing term.

this post was submitted on 23 Mar 2025
92 points (88.3% liked)

Futurology

2397 readers
616 users here now

founded 2 years ago
MODERATORS