372
you are viewing a single comment's thread
view the rest of the comments
[-] themeatbridge@lemmy.world 1 points 22 hours ago

I didn't read the article, but I would have assumed that the AI was using predictive text to guess at the next word. Speech recognition is already pretty good, but it often misses contextual cues that an LLM would be good at spotting. Like, "The famous French impressionist painter mayonnaise..."

[-] kautau@lemmy.world 3 points 22 hours ago* (last edited 21 hours ago)

Probably something like https://github.com/openai/whisper which isn’t an LLM, but is a different type of model dedicated to speech recognition

[-] themeatbridge@lemmy.world 1 points 22 hours ago

That makes sense.

this post was submitted on 09 Jan 2025
372 points (98.4% liked)

Opensource

1533 readers
694 users here now

A community for discussion about open source software! Ask questions, share knowledge, share news, or post interesting stuff related to it!

CreditsIcon base by Lorc under CC BY 3.0 with modifications to add a gradient



founded 1 year ago
MODERATORS