1024
you are viewing a single comment's thread
view the rest of the comments
[-] kromem@lemmy.world 16 points 1 year ago

Exactly. The difference between a cached response and a live one even for non-AI queries is an OOM difference.

At this point, a lot of people just care about the 'feel' of anti-AI articles even if the substance is BS though.

And then people just feed whatever gets clicks and shares.

[-] quick@thelemmy.club 0 points 1 year ago

Googles tpu can't handle llm's lol. What do you mean "exactly"?

[-] kromem@lemmy.world 4 points 1 year ago

In fact, Gemini was trained on, and is served, using TPUs.

Google said its TPUs allow Gemini to run “significantly faster” than earlier, less-capable models.

Did you think Google's only TPUs are the ones in the Pixel phones, and didn't know that they have server TPUs?

this post was submitted on 06 Jul 2024
1024 points (97.3% liked)

Technology

73419 readers
708 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS