Really seems like Deepseek is one of the only vendors actually focusing on performance per unit compute power and not just throwing infinite compute power at the problem. Calling it now, when the bubble bursts they'll be one of the few to make it out with a usable product.
For sure, they've probably dropped more significant papers in the past year than any other groups. It does seem like the mindset in China is very different overall though. In the states, it's basically a cult at this point where they're trying to build a god with AGI. In China, it's just treated like another tool for automation and companies see it as common infrastructure, akin to Linux, that people will build interesting things on. Hence why pretty much all the models in China re developed on open basis. Everybody there seems to realize that there's no real path towards monetizing the models themselves.
Gary Marcus has put forward articles theorizing that's why the LLM/neural network models are so appealing to American capitalists. They at least have the appearance of something that can be infinitely scaled with investment (screw diminishing returns, right?)
For sure, I think they genuinely think if AI got good enough they could just cull all the pesky workers at that point, and live like gods.
Simon may want to randomize his Pelican/Bicycle test.
There is a long tradition in tech of firms tweaking their outputs to get higher scores on well-known tests. The ultimate example is VW Dieselgate.
But in AI, it's easy to game benchmarks, by adding the best answers to the training set for the next version.
Technology
This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.
Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.
Rules:
1: All Lemmy rules apply
2: Do not post low effort posts
3: NEVER post naziped*gore stuff
4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.
5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)
6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist
7: crypto related posts, unless essential, are disallowed