656
The Rule (lemmy.ml)
submitted 1 month ago by roon@lemmy.ml to c/196@lemmy.blahaj.zone
you are viewing a single comment's thread
view the rest of the comments
[-] AdrianTheFrog@lemmy.world 1 points 1 month ago

Yes, but 200 gb is probably already with 4 bit quantization, the weights in fp16 would be more like 800 gb IDK if its even possible to quantize more, if it is, you're probably better of going with a smaller model anyways

this post was submitted on 25 Jul 2024
656 points (100.0% liked)

196

16238 readers
1765 users here now

Be sure to follow the rule before you head out.

Rule: You must post before you leave.

^other^ ^rules^

founded 1 year ago
MODERATORS