156

NVIDIA’s new AI chatbot runs locally on your PC (www.engadget.com)

submitted 1 year ago by catculation@lemmy.zip to c/technology@lemmy.world

35 comments fedilink hide all child comments

• NVIDIA released a demo version of a chatbot that runs locally on your PC, giving it access to your files and documents.

• The chatbot, called Chat with RTX, can answer queries and create summaries based on personal data fed into it.

• It supports various file formats and can integrate YouTube videos for contextual queries, making it useful for data research and analysis.

you are viewing a single comment's thread
view the rest of the comments

[-] GenderNeutralBro@lemmy.sdf.org 15 points 1 year ago

Pretty much every LLM you can download already has CUDA support via PyTorch.

However, some of the easier to use frontends don't use GPU acceleration because it's a bit of a pain to configure across a wide range of hardware models and driver versions. IIRC GPT4All does not use GPU acceleration yet (might need outdated; I haven't checked in a while).

If this makes local LLMs more accessible to people who are not familiar with setting up a CUDA development environment or Python venvs, that's great news.

[-] General_Effort@lemmy.world 5 points 1 year ago

I'd hope that this uses the hardware better than Pytorch. Otherwise, why the specific hardware demands? Well, it can always be marketing.

There are several alternatives that offer 1-click installers. EG in this thread:

AGPL-3.0 license: https://jan.ai/

MIT license: https://ollama.com/

MIT license: https://gpt4all.io/index.html

(There's more.)

[-] CeeBee@lemmy.world 2 points 1 year ago

Ollama with Ollama WebUI is the best combo from my experience.

[-] Oha@lemmy.ohaa.xyz 1 points 1 year ago

Gpt4all somehow uses Gpu acceleration on my rx 6600xt

[-] GenderNeutralBro@lemmy.sdf.org 1 points 1 year ago

Ooh nice. Looking at the change logs, looks like they added Vulkan acceleration back in September. Probably not as good as CUDA/Metal on supported hardware though.

[-] Oha@lemmy.ohaa.xyz 1 points 1 year ago

getting around 44 iterations/s (or whatever that means) on my gpu

this post was submitted on 14 Feb 2024

156 points (93.3% liked)

Technology

73338 readers

915 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws