47
submitted 6 months ago by misk@sopuli.xyz to c/hardware@lemmy.ml
you are viewing a single comment's thread
view the rest of the comments
[-] QuadratureSurfer@lemmy.world 24 points 6 months ago

I'm just glad to hear that they're working on a way for us to run these models locally rather than forcing a connection to their servers...

Even if I would rather run my own models, at the very least this incentivizes Intel and AMD to start implementing NPUs (or maybe we'll actually see plans for consumer grade GPUs with more than 24GB of VRAM?).

[-] suburban_hillbilly@lemmy.ml 28 points 6 months ago

Bet you a tenner within a couple years they start using these systems as distrubuted processing for their in house ai training to subsidize cost.

[-] 8ender@lemmy.world 6 points 6 months ago

That was my first thought. Server side LLMs are extraordinarily expensive to run. Download to costs to users.

[-] Alphane_Moon@lemmy.ml 2 points 6 months ago

What use cases are you planning to use the NPU for?

[-] QuadratureSurfer@lemmy.world 3 points 6 months ago

Similar use cases to what I'm doing right now, running LLMs like Mixtral8x7B (or something better by the time we start seeing these), Whisper (STT), or Stable Diffusion.

I use a fine tuned version of Mixtral (dolphin-Mixtral) for coding purposes.

Transcribing live audio for notes/search, or translating audio from different languages using Whisper (especially useful for verifying claims of translations for Russian/Ukrainian/Hebrew/Arabic especially with all of the fake information being thrown around).

Combine the 2 models above with a text to speech system (TTS), a vision model like LLaVA and some animatronics and then I'll have my own personal GLaDOS: https://github.com/dnhkng/GlaDOS

And then there's Stable Diffusion for generating images for DnD recaps, concept art, or even just avatar images.

[-] Alphane_Moon@lemmy.ml 2 points 6 months ago

Thank you! I currently use my 3080 dGPU for Stable Diffusion. I wonder to what extent NPUs will be usable with Stable Diffusion XL.

this post was submitted on 20 May 2024
47 points (92.7% liked)

Hardware

5035 readers
1 users here now

This is a community dedicated to the hardware aspect of technology, from PC parts, to gadgets, to servers, to industrial control equipment, to semiconductors.

Rules:

founded 4 years ago
MODERATORS