56
you are viewing a single comment's thread
view the rest of the comments
[-] tias@discuss.tchncs.de 2 points 1 day ago* (last edited 1 day ago)

Let's do some estimates:

  • An 8x H100 machine costs about $20 / hr to rent.
  • With a 70B model with 4K context, a H100 node can do about 300 requests in parallel.
  • A single response takes around 30 seconds to generate.
  • An average user sends about 300 messages / month.

The throughput of a node is

300 concurrent * (3600 / 30) = 36 000 messages / hour.

The cost per message, then, is $20 / 36 000 = $.00055..

With 300 messages per month, the compute cost for the AI vendor is 300*$20/36000 = $0.16 / month per user. By contrast, a subscription costs $20.

So given these assumptions, it's other things (like R&D, safety research, training runs, free accounts, etc) that represent the bulk of the cost and those could be scaled down to turn a profit. What will they do? Give how hyped AI is currently and the competitive landscape, I don't think they'll increase prices that much. We have products like DeepSeek on the horizon which are much cheaper, so it's more likely that they squeeze money out of it by becoming more efficient.

[-] PetteriPano@lemmy.world 2 points 1 day ago

It's a weird market.

Those H100s are $25k minimum. So $200,000 just in GPUs. Drawing 700W each, or 5.6kW total. At my local prices that's about a dollar per hour just for electricity.

It's going to take you a couple of years to break even at $20/h. They might still hold some value at that point. Or they might be obsolete.

[-] B0rax@feddit.org 2 points 1 day ago

Well that entirely depends on your users… coding agents or in general agents that run for hours will crash your calculation

[-] tias@discuss.tchncs.de 1 points 1 day ago

That won't happen due to token limits. According to Anthropic, only about 5% of users hit the limit.

[-] NotMyOldRedditName@lemmy.world 2 points 1 day ago

Exactly. Then you move up to the $100 or $200 or per token API pricing levels.

this post was submitted on 23 Apr 2026
56 points (96.7% liked)

Asklemmy

54075 readers
523 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 7 years ago
MODERATORS