1441
submitted 1 year ago* (last edited 1 year ago) by GravitySpoiled@lemmy.ml to c/piracy@lemmy.dbzer0.com
you are viewing a single comment's thread
view the rest of the comments
[-] datavoid@lemmy.ml 172 points 1 year ago

OpenSubtitles is hot garbage, a viable alternative needs to exist. Pray for Subscene

[-] Grandwolf319@sh.itjust.works 53 points 1 year ago

Ironically, this might be an area where machine learning could be beneficial.

[-] thirteene@lemmy.world 23 points 1 year ago

I've been watching a few projects that are attempting to live translate videos. We are very close

[-] JasonDJ@lemmy.zip 16 points 1 year ago

Live is great but I don’t think it’d be feasible for most languages to be a real 1:1 translation in live.

Even a 10s delay allows for the whole sentence/phrase to be captured and translated in entirety. A lot of languages can drastically change meaning due to a word on the other side of the sentence.

[-] GreatAlbatross@feddit.uk 7 points 1 year ago

The great thing about television, is that "live" is a flexible concept.
The playback software could happily play 10 seconds ahead of what's actually on the screen, and have plenty of time to translate like that.
In the same way that we sometimes put delays into live events to allow the subtitling systems breathing room.

[-] Trainguyrom@reddthat.com 4 points 1 year ago

In the same way that we sometimes put delays into live events to allow the subtitling systems breathing room.

I've always heard this was because of the infamous Superbowl Janet Jackson wardrobe malfunction (where the malfunction was that only one nip was slipped and not both as was clearly intended)

[-] azertyfun@sh.itjust.works 2 points 1 year ago* (last edited 1 year ago)

It's already a thing with near-zero delay. MS Teams does it (dunno about the translation) and the QSMP Minecraft server has a bunch of livestreamers from different countries who use it for realtime translation.

[EDIT: Live demo from today. Shit's impressive.]

What actually happens is that the current sentence gets "corrected" several times as you keep speaking. It's a bit jittery and if the word order differs significantly then the translated sentence might be a bit wonky for a few seconds, and there are a few misses but overall it works really well; at least well enough that people who don't speak each others' language can have a conversation in their native tongues with essentially no more delay than reading speed. I can easily follow a livestream in a foreign language with the live subtitles (which was not the case a mere 6 months ago for any language other than English).

[-] parody@lemmings.world 2 points 5 months ago

Amazing clip you posted seven months ago here. Doesn’t seem like it could even be any better now.

[-] fatalError@lemmy.sdf.org 1 points 1 year ago

Live shouldn't be used in a home setup anyway unless for something where interaction is required, like a teams call or twitch stream. Anything else can take a delay for the sake of preserving the meaning.

[-] TwoCubed@feddit.de 8 points 1 year ago

I absolutely hate to watch subtitles appear word for word. So no, please no live captions.

[-] RogueBanana@lemmy.zip 3 points 1 year ago

It doesn't have to be live as in with the player but I imagine the audio could be loaded into the program simultaneously and have it produce cc for the entire movie as you watch it

[-] pacoboyd@lemm.ee 4 points 1 year ago

Whisper AI is pretty darn good. I've used it to make subtitles for MST3K vids where nothing good exists and maybe only had to spend 10 minutes doing some clean up. It even recognizes when different people are speaking and breaks up the subs accordingly.

[-] YoorWeb@lemmy.world 1 points 1 year ago

Imagine the next step though, soon AI will generate actors' voices speaking in any language you want.

I don't think I would use this actually, because I don't see how an AI could capture the performance. I'm a sub over dub guy anyway, but at least someone making a dub has a sporting chance to make an interesting performance.

[-] ShepherdPie@midwest.social 11 points 1 year ago

I typically grab the better quality rips and they almost always come with subtitles. Three hats ones are older or more obscure movies/shows that don't have many options to choose from.

[-] eek2121@lemmy.world 3 points 1 year ago

Is it easy to get a copy of their dataset?

[-] kux@lemm.ee 6 points 1 year ago

there's a comment from a few months ago with a torrent, the date inside is july 2022 so will be missing anything newer: https://lemmy.dbzer0.com/comment/5089994

[-] s38b35M5@lemmy.world 1 points 1 year ago

Hey, that's me! I'm (Lemmy) famous!

[-] lazynooblet@lazysoci.al 3 points 1 year ago
[-] TheBat@lemmy.world 2 points 1 year ago

Yeah but they are focused on tv shows afaik

this post was submitted on 06 Feb 2024
1441 points (97.2% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

57167 readers
874 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):

🏴‍☠️ Other communities

Torrenting:

Gaming:


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 2 years ago
MODERATORS