232

I just listened to this AI generated audiobook and if it didn't say it was AI, I'd have thought it was human-made. It has different voices, dramatization, sound effects... The last I'd heard about this tech was a post saying Stephen Fry's voice was stolen and replicated by AI. But since then, nothing, even though it's clearly advanced incredibly fast. You'd expect more buzz for something that went from detectable as AI to indistinguishable from humans so quickly. How is it that no one is talking about AI generated audiobooks and their rapid improvement? This seems like a huge deal to me.

you are viewing a single comment's thread
view the rest of the comments
[-] not_a_bot_i_swear@lemmy.world 17 points 1 year ago

I would guess there is a LOT of work going into each voice. Playing with different parameters and prompts. I don't think it's as simple as just copying the text into a box. Not yet at least :)

[-] Nukken@lemmy.world 7 points 1 year ago

That's a good thought there though. Audiobooks could have each character voiced uniquely.

[-] AdmiralShat@programming.dev 8 points 1 year ago

This is literally the only upside I see from this.

One of the Dune audio books started off as multiple voices and then part way through it was finished by just one guy. Really impressed with it at first, and then really kind of debuffed by it. I had already read the book years before so it wasn't a big deal, but like wtf?

[-] physcx@kbin.social 5 points 1 year ago

Lol what a troll audio book.

I can hope! With the speed things are developing it may not be too long.

I haven't played around with or looked into much to do with AI at all but would be willing to put in some time into playing with prompts / parameters if it meant I could eventually create a reliable work flow to create things such as what I mentioned.

I think I'll have to do some research, I need some more old school hollow earth stories in my life xD

[-] pretzelz@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

I don't see why you couldn't give a few examples and then grab the dialog of a person in along with their description (or just the whole book) and get the llm to generate the prompt for you

this post was submitted on 11 Nov 2023
232 points (94.6% liked)

Asklemmy

43982 readers
644 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy ๐Ÿ”

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 5 years ago
MODERATORS