222
Reasoning failures highlighted by Apple research on LLMs
(appleinsider.com)
This is a most excellent place for technology news and articles.
This really isn't a good title, I think. It was understood that LLM-based models don't reason, not on their own.
A better one would be that researchers at Apple proposed a metric that better accounts for reasoning capability, a better sort of "score" for an AI's capability.