21
DeepSeek might not be such good news for energy after all
(archive.today)
This is a most excellent place for technology news and articles.
This is more about the "reasoning" aspect of the model where it outputs a bunch of "thinking" before the actual result. In a lot of cases it easily adds 2-3x onto the number of tokens needed to be generated. This isn't really useful output. It the model getting into a state where it can better respond.