147
submitted 5 months ago by vegeta@lemmy.world to c/technology@lemmy.world
top 20 comments
sorted by: hot top controversial new old
[-] Corngood@lemmy.ml 31 points 5 months ago

This sounds like good engineering, but surely there's not a big gap with their competitors. They are spending tens of millions on hardware and energy, and this is something a handful of (very good) programmers should be able to pull off.

Unless I'm missing something, It's the sort of thing that's done all the time on console games.

[-] mormund@feddit.org 20 points 5 months ago

I think more like was done all the time for console games. These days that doesn't happen as much anymore as far as I know. But I think this shows that CUDA is not a good enough abstraction for modern GPUs or the compilers are not as good as expected. There should be no way they got that much optimization out of hand written/optimized code these days.

[-] filister@lemmy.world 26 points 5 months ago

What is amazing in this case is that they achieved spending a fraction of the inference cost that OpenAI is paying.

Plus they are a lot cheaper too. But I am pretty sure that the American government will ban them in no time, citing national security concerns, etc.

Nevertheless, I think we need more open source models.

Not to mention that NVIDIA also needs to be brought to earth.

[-] demesisx@infosec.pub 13 points 5 months ago

Even if they get banned, any startup could replicate their work if it is truly open source. The best thing about their solution is that it breaks the CUDA monopoly that NVDA has enjoyed. Buy your puts when NVDA bounces because that stock is GOING DOWN. There’s no world where a company that makes GPU’s is worth more than both Apple and Microsoft. It’s inevitable.

[-] toffi@feddit.org 3 points 5 months ago

Never forget kids the market can stay irrational much longer than you can stay solvent.

[-] demesisx@infosec.pub 4 points 5 months ago

True. Thats why I tend to make small plays instead of being an absolute degenerate gambler.

[-] eager_eagle@lemmy.world 1 points 5 months ago

I wish that was true, but this doesn't threaten any monopoly

[-] demesisx@infosec.pub 3 points 5 months ago* (last edited 5 months ago)

~~It certainly does.~~

~~Until last week, you absolutely NEEDED an NVidia GPU equipped with CUDA to run all AI models.~~

~~Today, that is simply not true. (watch the video at the end of this comment)~~

~~I watched this video and my initial reaction to this news was validated and then some: this video made me even more bearish on NVDA.~~

Edit: corrected and redacted.

[-] eager_eagle@lemmy.world 0 points 5 months ago* (last edited 5 months ago)

Until last week, you absolutely NEEDED an NVidia GPU equipped with CUDA to run all AI models.

also not true

[-] demesisx@infosec.pub 3 points 5 months ago

Thanks for the corrections.

[-] eager_eagle@lemmy.world 0 points 5 months ago* (last edited 5 months ago)

mate, that means they are using PTX directly. If anything, they are more dependent to NVIDIA and the CUDA platform than anyone else.

to simplify: they are bypassing the CUDA API, not the NVIDIA instruction set architecture and not CUDA as a platform.

[-] demesisx@infosec.pub 2 points 5 months ago

Ahh. Thanks for this insight.

[-] Capsicones@lemmy.blahaj.zone 15 points 5 months ago

There seems to be some confusion here on what PTX is -- it does not bypass the CUDA platform at all. Nor does this diminish NVIDIA's monopoly here. CUDA is a programming environment for NVIDIA GPUs, but many say CUDA to mean the C/C++ extension in CUDA (CUDA can be thought of as a C/C++ dialect here.) PTX is NVIDIA specific, and sits at a similar level as LLVM's IR. If anything, DeepSeek is more dependent on NVIDIA than everyone else, since PTX is tightly dependent on their specific GPUs. Things like ZLUDA (effort to run CUDA code on AMD GPUs) won't work. This is not a feel good story here.

[-] eager_eagle@lemmy.world 1 points 5 months ago* (last edited 5 months ago)

I don't think anyone is saying CUDA as in the platform, but as in the API for higher level languages like C and C++.

PTX is a close-to-metal ISA that exposes the GPU as a data-parallel computing device and, therefore, allows fine-grained optimizations, such as register allocation and thread/warp-level adjustments, something that CUDA C/C++ and other languages cannot enable.

[-] Capsicones@lemmy.blahaj.zone 1 points 5 months ago

Some commenters on this post are clearly not aware of PTX being a part of the CUDA environment. If you know this, you aren't who I'm trying to inform.

[-] eager_eagle@lemmy.world 1 points 5 months ago

aah I see them now

[-] sinceasdf@lemmy.world 4 points 5 months ago

This is why Nvidia stock has been hit so hard. CUDA is their moat

[-] massive_bereavement@fedia.io 5 points 5 months ago

Aw, CUDA see this happening...

[-] mesamunefire@lemmy.world 2 points 5 months ago

Reminds me of the Bitcoin mining and how askii miners overtook graphic card mining practically overnight. It would not surprise me if this goes the same way.

[-] Imgonnatrythis@sh.itjust.works 2 points 5 months ago

They said this is close to metal. Wake me up when they've achieved metal.

this post was submitted on 28 Jan 2025
147 points (97.4% liked)

Technology

72472 readers
990 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS