494
Apple study exposes deep cracks in LLMs’ “reasoning” capabilities
(arstechnica.com)
This is a most excellent place for technology news and articles.
Given the use cases they were benchmarking I would be very surprised if they were any better.