It's mentioned in the article.
Where are pemdas and bodmas users from?
Easy solution: wired mouse
In the ‘Medium’ difficulty category, OpenAI’s o4-mini-high model scored the highest at 53.5%.
This fits my observation of such models. o4-mini-high is able to help me with 80-90% of the problems at work. For the remaining problems, it would come up with a nonsensical solution and no matter how much I prompt it, it would tunnel-vision on that specific approach. It could never second guess itself and realise that its initial solution is completely off the mark, and try an entirely differently approach. That's where I usually step in and do the work myself.
It still saves me time with the trivial stuff though.
I can't say the same for the rest of the LLMs. They are simply no good at coding and just waste my time.
Y'know, concentrating power in the hands of a single person / group defeats the purpose of decentralisation.
I can understand if the damage is limited to communities within a single instance, but when a ban is so far-reaching - across so many instances - it makes me wonder what's the point of choosing Lemmy over, say, Reddit.
It's still the same problem again, just with different people in charge - like Bluesky vs Twitter.
TLDR: it doesn't make (or save) money for companies in the short-term.
FYI
- branch protection is a thing
- commit signing is a thing
A submersible with dead billionaires.
~~OpenAI~~ CloseAI
Help why is my password showing up as asterisks?
Y'all are too mature.
Clearly, it tastes like your mom.
Nevermind. You got me. Well-played.