48
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 01 Dec 2024
48 points (88.7% liked)
Futurology
3110 readers
63 users here now
founded 2 years ago
MODERATORS
It depends. A lot of LLMs are memory-constrained. If you’re constantly thrashing the GPU memory it can be both slower and less efficient.