402
you are viewing a single comment's thread
view the rest of the comments
[-] moosetwin@lemmy.dbzer0.com 15 points 21 hours ago

I don't mind the idea, but I would be curious where the training data comes from. You can't just train them off of the user's (unsubtitled) videos, because you need subtitles to know if the output is right or wrong. I checked their twitter post, but it didn't seem to help.

[-] leftytighty@slrpnk.net 16 points 20 hours ago

subtitles aren't a unique dataset it's just audio to text

[-] nova_ad_vitum@lemmy.ca 12 points 18 hours ago

They may have to give it some special training to be able to understand audio mixed by the Chris Nolan school of wtf are they saying.

[-] MDCCCLV@lemmy.ca 3 points 16 hours ago

No, if you have a center track you can just use that. Volume isn't a problem for a computer listening to it since they don't use the physical speakers.

[-] leftytighty@slrpnk.net 1 points 55 minutes ago

I took the other comment as a joke but this is accurate and interesting additional information!

load more comments (1 replies)
this post was submitted on 09 Jan 2025
402 points (98.6% liked)

Opensource

1533 readers
778 users here now

A community for discussion about open source software! Ask questions, share knowledge, share news, or post interesting stuff related to it!

CreditsIcon base by Lorc under CC BY 3.0 with modifications to add a gradient



founded 1 year ago
MODERATORS