957
Managers (media.piefed.zip)
submitted 5 days ago* (last edited 5 days ago) by inari@piefed.zip to c/whitepeopletwitter@sh.itjust.works
you are viewing a single comment's thread
view the rest of the comments
[-] phx@lemmy.world 2 points 3 days ago

I do wonder about that though. The Big AI operating costs include being able to service a certain number of customers within a certain amount of time. So if they need to service 10,000 requests per minute and fulfill them within 2-4 seconds, that's a big datacenter.

Now if a company does a few dozen requests a minute and on average needs double-digit response times... the costs to implement could be much different. The thing is finding a model that will do that and provide accurate (enough) output versus how much it Claude's pricing is built around speed+volume versus accuracy.

[-] bountygiver@lemmy.ml 3 points 3 days ago

A lot of cost is on training it as well. Which you need if you want to "build your own claude". If you run only the inferences with an open model then ya it's directly correlated to how fast you want the responses to come in.

this post was submitted on 03 Jun 2026
957 points (99.6% liked)

People Twitter

10043 readers
840 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

  1. Mark NSFW content.
  2. No doxxing people.
  3. Must be a pic of the tweet or similar. No direct links to the tweet.
  4. No bullying or international politcs
  5. Be excellent to each other.
  6. Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.

founded 3 years ago
MODERATORS