Audiobooks

306 readers

1 users here now

For talk of all things audiobook related!

Please follow this instances rules.

To find more communities on this instance, go to: !411@literature.cafe

founded 2 years ago

MODERATORS

gabe@literature.cafe

113

40,000 AI-narrated audiobooks flood Audible, dividing authors and listeners (www.techspot.com)

submitted 2 years ago by ptz@dubvee.org to c/audiobooks@literature.cafe

47 comments fedilink hide all child comments

I definitely do not want to support this practice, but there's no way to filter these out 😠.

you are viewing a single comment's thread
view the rest of the comments

[–] GenderNeutralBro@lemmy.sdf.org 29 points 2 years ago (15 children)

One blogger cited in the report claimed converting an ebook to audio using the AI narration took just 52 minutes

This does not inspire confidence. The technology is there to do this very well, but it takes skill and effort. The technology to automate it end to end with high quality does not yet exist.

52 minutes. That's maybe 1/10th the time it would take to listen to it. I wonder how much of these 40,000 books were even proof-listened once.

[–] ptz@dubvee.org 36 points 2 years ago (14 children)

Honestly, I don't really care if the LLM can spit out a perfect replica of Stephen Fry with every inflection and intonation possible and in the correct spots.

Tools like these can and will be used to take jobs from actual voice actors. I want no part of it.

[–] GenderNeutralBro@lemmy.sdf.org 20 points 2 years ago (3 children)

I get where you're coming from, but it doesn't sit quite right with me. The whole point of technology is to save human time and effort. That should be a good thing. The problem is the capitalist hellscape that is the status quo. I don't think we should put the onus of propping up that capitalist hellscape onto book authors. I mean, maybe that's the easiest way to maintain the status quo, but the status quo was never sustainable in the first place.

I don't know. This is not a fully fleshed out philosophy. At some level I'm sure it's the same old idealism-vs-pragmatism debate.

[–] exocrinous@startrek.website 4 points 2 years ago (1 children)

Let me rephrase the issue for you and see if you have a different emotional reaction.

A person's job was replaced with a capitalist's robot, and now the capitalist earns all the money.

[–] riskable@programming.dev 1 points 3 weeks ago* (last edited 3 weeks ago)

I know I'm way late to the party but...

A person’s job was replaced with a capitalist’s robot, and now the capitalist earns all the money.

Not necessarily. A lot of Text-to-Speech (TTS) tech comes out of academia and free, open source software (FOSS). That includes AI models and voice changing tools like RVC (Retrieval-based Voice Conversion). It is fully open source and there's thousands upon thousands of voices to choose from that are also free and not a one is an exact replica of a real person's voice (because it doesn't do that good a job; just gets close). Many of the most popular voices are mashups of many different voices anyway.

You can use any number of FOSS TTS tools (some of the newer open source AI models are great) to have it read your text and then have it processed through RVC into whatever voices you want.

Alternatively, you could just read the text yourself and change the voices using RVC. That works far better than you'd think it would but it requires reading your whole book out loud which requires overcoming laziness haha.

TL;DR: A person's job could be replaced with a FOSS robot, and now the author earns all the money.

load more comments (1 replies)

load more comments (11 replies)