87
you are viewing a single comment's thread
view the rest of the comments
[-] TeamAssimilation@infosec.pub 77 points 2 months ago

Edward Snowden doing GPU reviews? This timeline is becoming weirder every day.

[-] GamingChairModel@lemmy.world 20 points 2 months ago

"Whistleblows" as if he's some kind of NVIDIA insider.

[-] 0x0@programming.dev 1 points 2 months ago

Intel Insider now that would've made for great whistleblowing headlines.

[-] Winged_Hussar@lemmy.world 9 points 2 months ago

Legitimately thought this was a hard-drive.net post

[-] eager_eagle@lemmy.world 7 points 2 months ago

I bet he just wants a card to self host models and not give companies his data, but the amount of vram is indeed ridiculous.

[-] jeena@piefed.jeena.net 4 points 2 months ago

Exactly, I'm in the same situation now and the 8GB in those cheaper cards don't even let you run a 13B model. I'm trying to research if I can run a 13B one on a 3060 with 12 GB.

[-] TheHobbyist@lemmy.zip 4 points 2 months ago

You can. I'm running a 14B deepseek model on mine. It achieves 28 t/s.

[-] Viri4thus@feddit.org 1 points 2 months ago

I also have a 3060, can you detail which framework (sglang, ollama, etc) you are using and how you got that speed? i'm having trouble reaching that level of performance. Thx

[-] levzzz@lemmy.world 1 points 2 months ago

You need a pretty large context window to fit all the reasoning, ollama forces 2048 by default and more uses more memory

[-] jeena@piefed.jeena.net 1 points 2 months ago

Oh nice, that's faster than I imagined.

[-] secret300@lemmy.sdf.org 4 points 2 months ago
[-] newcockroach@lemmy.world 2 points 2 months ago

"Some hentai games are good" -Edward Snowden

[-] Siegfried@lemmy.world 2 points 2 months ago

Note that this is from 2003

[-] Amir@lemmy.ml 2 points 2 months ago

I'll keep believing this is a theonion post

this post was submitted on 02 Feb 2025
87 points (80.0% liked)

Technology

68401 readers
1817 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS