424

Google is embedding inaudible watermarks right into its AI generated music (www.theverge.com)

submitted 2 years ago by L4s@lemmy.world to c/technology@lemmy.world

97 comments fedilink hide all child comments

Google is embedding inaudible watermarks right into its AI generated music::Audio created using Google DeepMind’s AI Lyria model will be watermarked with SynthID to let people identify its AI-generated origins after the fact.

you are viewing a single comment's thread
view the rest of the comments

[-] SuckMyWang@lemmy.world 12 points 2 years ago* (last edited 2 years ago)

it does this by converting the audio into a 2d visualisation that shows how the spectrum of frequencies evolves in a sound over time

Old school windows media player has entered the chat

Seriously fuck off with this jargon, it doesn’t explain anything

[-] Terminarchs@slrpnk.net 22 points 2 years ago

That's actually an accurate description of what is happening: an audio file turned into a 2d image with the x axis being time, the y axis being frequency and color being amplitude.

[-] RufusLoacker@feddit.it 10 points 2 years ago

That's literally a spectrograph

[-] Terminarchs@slrpnk.net 8 points 2 years ago

Spectrogram*

[-] Viking_Hippie@lemmy.world 6 points 2 years ago

Your mom's literally a spectrograph.

[-] SuckMyWang@lemmy.world 1 points 2 years ago

I know, it’s like the old windows media player visualisations.

[-] FishFace@lemmy.world 13 points 2 years ago

Sounds like a bad journalist hasn't understood the explanation. A spectrogram contains all the same data as was originally encoded. I guess all it means is that the watermark is applied in the frequency domain.

[-] datavoid@lemmy.ml 10 points 2 years ago* (last edited 2 years ago)

Also this isn't new by any stretch... Aphex Twin would like a word

[-] FishFace@lemmy.world 8 points 2 years ago

Well, encoding stuff in the spectrogram isn't new, sure. But encoding stuff into an audio file that is inaudible but robust to incidental modifications to the file is much harder. Aphex Twin's stuff is audible!

[-] SuckMyWang@lemmy.world 4 points 2 years ago* (last edited 2 years ago)

I would like to know what it is that makes it so robust. The article explains very little. Is it in the high frequencies? Higher than the human ear can hear? Compression will effect that plus that’s going to piss dogs off. Could be something with the phasing too. Filters and effects might be able to get rid of the water mark

[-] FishFace@lemmy.world 4 points 2 years ago

I don't know what frequencies are annoying for dogs but I'm guessing it's above 24kHz so no sound file or sound system is going to be able to store or produce it anyway.

There will certainly be some way to get rid of the watermark. But it might nevertheless persist through common filters.

this post was submitted on 17 Nov 2023

424 points (99.5% liked)

Technology

73037 readers

741 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws