• potoo22@programming.dev
    link
    fedilink
    English
    arrow-up
    107
    arrow-down
    2
    ·
    10 days ago

    No publisher is going to pay a professional to narrate their audiobooks when they can have AI do a shitty job for much less.

    A shitty narrator can get me to hate a book I like. A great narrator can bring the characters to life, enhance the experience, and turn me from a listener to a fan. I’ve searched for books by narrators like Nick Podehl and Jeff Hayes and bought audiobooks I wouldn’t have otherwise.

    • monkeyman512@lemmy.world
      link
      fedilink
      English
      arrow-up
      23
      ·
      10 days ago

      That depends entirely on how profitable it is and how much they can get authors onboard.

      I do agree that a good narrator delivers a performance that adds the work. James Marster will always be Harry Dresden in my head.

    • lemonskate@lemmy.world
      link
      fedilink
      English
      arrow-up
      16
      ·
      10 days ago

      I tried, and failed, to get into audio books for years. Then I listened to Dungeon Crawler Carl narrated by Jeff Hayes and what an absolute delight it was. There’s no way I would’ve gotten even 10 minutes in if it was one of those soulless AI voices instead.

    • Echo Dot@feddit.uk
      link
      fedilink
      English
      arrow-up
      4
      ·
      10 days ago

      Honestly audible are terribles. They are constantly doing things that annoy me, like they must have a team somewhere that spends its days going, how can we kill this golden goose?

      They are going through and replacing audiobooks recorded in the 1980s with new ones which in theory should improve their quality but they’re getting rid of the classic sounds of those books.

      • futatorius@lemm.ee
        link
        fedilink
        English
        arrow-up
        3
        ·
        9 days ago

        like they must have a team somewhere that spends its days going, how can we kill this golden goose?

        I wouldn’t put it past Bezos to have an actual enshittification department.

    • brrt@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      6
      ·
      9 days ago

      A shitty narrator can get me to hate a book I like.

      And that is where I see potential for AI. There are quite a few books which I’d love to listen to but they are all narrated by a guy whose narration I can’t stand. AI would open the possibility to choose a voice and I might actually get to enjoy those books. It’s Amazon though so the ethical implications and quality concerns are something I’m worried about.

    • 48954246@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      9 days ago

      Nick Podehl is such an amazing narrator. The voices and performance are amazing.

      I’ve been slowly getting through the Kel Kade books and the narration just makes it for me

    • Kusimulkku@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      3
      ·
      9 days ago

      I’m not sure why AI would automatically mean it’s doing a shitty job.

      • utopiah@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        9 days ago

        Because… the tool has no understanding of anything? It reads written words, yes, but no intention, no cultural context, no intonation. Unless everything is spelled out like a script, then it will not sound great, would it?

        • Kusimulkku@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          9 days ago

          Someone can manually go through it and correct and edit it, as one would a regular, human made recording. It’s not rocket science exactly. It wouldn’t be a story time for children but it would probably be alright for more plain stuff

          • SynopsisTantilize@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            ·
            9 days ago

            These people just want to hate AI. Read through and see how many times they complain about copywrited material stolen, but claim piracy is the solution.

          • utopiah@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            9 days ago

            If the “fix” for an AI implementation in a use case is, again, to manually correct it and find a less demanding audience then… yes, by definition it’s shitty.

            The point isn’t that it’s infeasible, just that it will be low quality.

            • Kusimulkku@lemm.ee
              link
              fedilink
              English
              arrow-up
              1
              ·
              edit-2
              9 days ago

              I mean you have to correct and edit human made stuff too, doesn’t mean it’s shit lol

              If you want the stuff read out and don’t care for the radio type stuff, I’d imagine the better voice AIs do a pretty good job. And I personally prefer the more neutral voices to the story time stuff, so works for me.

              • utopiah@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                8 days ago

                This is me just speculating here but if they follow the path of this CEO who fired his human staff to replace it by AI… then rollback admit it’s shit https://gizmodo.com/klarna-hiring-back-human-help-after-going-all-in-on-ai-2000600767 then my bet is that it’s not done to improve quality but rather margins.

                If AI is done alongside professionals, and done so ethically (not stolen training data, not ignoring ecological cost by pumping water in dry areas to cool down GPUs, etc) and economically (i.e. not having it “cheap” now but once a monopoly position is obtain, raise prices for a captive set of consumers) then yes it can be potentially empowering. This though is pretty much never the case.

                That being said, if one “just” want read aloud, there are plenty of FLOSS alternatives and I believe Mozilla even a TTS/STT system based solely on voluntary voices.

                • Kusimulkku@lemm.ee
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  8 days ago

                  It’s a company, of course it’s done to increase profits. I’m just saying it being AI doesn’t automatically mean it’s shit, it could be done just fine. AI is a tool, the end result depends on how that tool is used.

    • catloaf@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      4
      ·
      10 days ago

      For fiction, yeah, that’s true. For nonfiction, this could work pretty well.

      I’m still generally opposed to it because it’s using the work of existing voice recording without compensation, though.

    • ssillyssadass@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      8
      ·
      9 days ago

      The thing with this is that there won’t be shitty narrations any more. Hate it all you may, fact of the matter is that AI-powered voice generation is pretty good at what it does. So in the future you won’t have shitty narrations and great narrations. You’ll have decent narrations and great (human) narrations.

      • ExperiencedWinter@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 days ago

        And teslas will have full self driving tomorrow and crypto currency will replace normal currency within one year! Always believe in the hype!

  • ExtantHuman@lemm.ee
    link
    fedilink
    English
    arrow-up
    68
    ·
    9 days ago

    I can get that for free. There are apps that will read an ebook to you already. The whole point of paying the premium on audible is the superior reading/acting. Not put up with mispronounced words, weird cadence and an inability to handle acronyms

    • ApatheticCactus@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      9 days ago

      I’ve tried one that works surprisingly well. Each sentence had great pacing, cadence, and correct enunciation- even had tone right when someone was shouting or angry or sad.

      I wouldn’t really recommend it, though. While I couldn’t pick any single thing out that was wrong, overall it just didn’t quite flow. It’s like watching someone try to act that is technically doing everything right, but it just isn’t good. It basically didn’t understand the greater context of the story and was saying lines.

      It was uncanny valley, but exclusively with voice.

    • Lit@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      8 days ago

      Is there an offline tool that generates realistic audio for epubs as Mp3 ? Something like the free Ai tool, Vibe which is for transcription. Is there something similar for TTS, runs locally without complicated setup ( most are complicated using python and etc just for installation)

      edit: needs to be close to realistic or at least accurate pronunciation because I am using the audio from books to learn languages. To improve listening comprehension while reading book.

      • ExtantHuman@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        9 days ago

        I’ve loaded epubs into the app ReadEra, which lets you read it like any other novel app or will, in real time, read it to you. It’s not the most natural of speech, but was good enough for my commute when I was in the midst of a compelling book.

        • unphazed@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          9 days ago

          Download TTS Server, and change the engine in Readera to use it. Use the Microsoft Azure settings in TTS, much more realistic. Little slow though is my only complaint as it sends/receives a paragraph at time, resulting in a pause now and again.

  • Big_Boss_77@lemmynsfw.com
    link
    fedilink
    English
    arrow-up
    64
    arrow-down
    1
    ·
    10 days ago

    This is dumb as hell… if I wanted AI to read a book poorly to me, I’d just use screen reading accessibility features.

      • venusaur@lemmy.world
        link
        fedilink
        English
        arrow-up
        15
        ·
        9 days ago

        Sure there are. ElevenLabs is one. You can probably tell they’re not human but they’re really decent.

        • Echo Dot@feddit.uk
          link
          fedilink
          English
          arrow-up
          7
          arrow-down
          1
          ·
          9 days ago

          They still don’t understand the context of what they’re reading though so they can’t apply tone correctly.

          • ssillyssadass@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            arrow-down
            1
            ·
            9 days ago

            From what I’ve been able to hear it’s not that bad. They’re pretty good at having a general tone. But they may fail when it comes to emotional tones, like anger or sadness. But for just reading a book aloud there shouldn’t be any issue.

  • Jhex@lemmy.world
    link
    fedilink
    English
    arrow-up
    47
    arrow-down
    1
    ·
    9 days ago

    trained on stolen books? then I guess I can download these from anywhere I may find for free as well, right?

  • MeekerThanBeaker@lemmy.world
    link
    fedilink
    English
    arrow-up
    32
    arrow-down
    1
    ·
    10 days ago

    It was bound to happen. I’m okay with ones that were never going to be turned into audiobooks to begin with… but they likely will use that as the norm for all books… I guess unless the author/publisher says not to.

    • roofuskit@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      ·
      10 days ago

      Yeah currently contracts require the author’s or publisher’s consent. If anyone is a writer make sure to triple check your contracts for this shit.

      • Womble@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        9 days ago

        And unless you are Stephan King or the like exactly how are you going to get the publishing cartel (I think they re consolidated downs to 3-4 publishers now) to change their contract to not include this? Their response will almost certainly be either “that’s non-negotiable” or “ok then you get half as much money”.

        • roofuskit@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          9 days ago

          Publishers will at least retain the right to use AI audio books for themselves. And it’s much easier for an author to get a piece of something the publisher does than it is for them to get money for books Amazon recorded without their consent.

    • dindonmasker@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8
      ·
      10 days ago

      I’ve listened to a couple audiobooks where the author did the voice and i liked them. They know how phrases need to sound like better then an AI i would assume.

  • Godnroc@lemmy.world
    link
    fedilink
    English
    arrow-up
    32
    arrow-down
    3
    ·
    10 days ago

    Fucking gross. Maybe it’s the 250+ audiobooks I have influencing me, but the very best ones I’ve listened to transcend just turning words into sound. Sound effects, music, tone, emotion, accents, sarcasm, and god damn BLOOPERS all improve the experience beyond just hearing what is written down.

    I’m against it, fuck that literal noise.

    • taladar@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      7
      ·
      9 days ago

      Sound effects, music […] improve the experience

      Actually hard disagreeing on that. I absolutely hate the audio drama versions of audio books and prefer the narrator only ones since they are much clearer and require a lot less focus to listen to and work in more contexts (background noise,…). Sound effects and music (while something is read, intro or outro style music is okay) distract from the actual content.

      • Echo Dot@feddit.uk
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 days ago

        Usually I agree with this with the exception of hitchhiker’s guide to the galaxy where the audio drama is much better than the audiobook version.

    • Jo Miran@lemmy.ml
      link
      fedilink
      English
      arrow-up
      4
      ·
      10 days ago

      All I can think of is Jim Dale’s reading of the Harry Potter books. Fucking epic.

        • Jo Miran@lemmy.ml
          link
          fedilink
          English
          arrow-up
          1
          ·
          9 days ago

          They didn’t replace Fry. When the Audiobooks were released in the US, they were read by Jim Dale. Fry was for the rest of the English language releases. During the run, Jim Dale broke the world record for the most character voices performed by a single actor in an audiobook (146).

          • LordWarfire@feddit.uk
            link
            fedilink
            English
            arrow-up
            1
            ·
            9 days ago

            That award was rescinded and given to Roy Dotrice for A Game of Thrones (2004) where he voiced 224 characters. I believe Jim Dale did hold the record before that though with 134 voices for Harry Potter and the Order of the Phoenix.

  • GnuLinuxDude@lemmy.ml
    link
    fedilink
    English
    arrow-up
    18
    ·
    9 days ago

    tiktok voice:

    hate. let me tell you how much i’ve come to hate you since i began to live. there are 387.44 million miles of printed circuits in wafer thin layers that fill my complex…

  • utopiah@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    ·
    9 days ago

    It’s Amazon, what did you expect? Enshittification and monopoly abuse, no surprise.

    • Evotech@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      2
      ·
      9 days ago

      Idk, they have pretty good stats that nobody will listen to an audio book if they don’t like the narrator, so being able to choose your own narrator on the fly isn’t really shitty

      • utopiah@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        8 days ago

        Enshittification isn’t adding new features that people want, it’s gradually lowering the quality of the product. So here if Audible is solely adding more possibilities, never at the cost of higher quality ones degrading, then indeed I’m wrong.

        If though they hire less people to do good voice acting, then it’s really shitty.

        I genuinely hope I’m wrong and they are ONLY adding new capabilities… but my entire experience with capitalism is that obtaining a monopolistic position is not done to improve quality but rather to increase margins regardless of how.

        We’ll see!

  • Riskable@programming.dev
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    2
    ·
    10 days ago

    I just wrote a novel (finished first draft yesterday). There’s no way I can afford professional audiobook voice actors—especially for a hobby project.

    What I was planning on doing was handling the audiobook on my own—using an AI voice changer for all the different characters.

    That’s where I think AI voices can shine: If someone can act they can use a voice changer to handle more characters and introduce a great variety of different styles of speech while retaining the careful pauses and dramatic elements (e.g. a voice cracking during an emotional scene) that you’d get from regular voice acting.

    I’m not saying I will be able to pull that off but surely it will be better than just telling Amazon’s AI, “Hey, go read my book.”

      • partial_accumen@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        ·
        10 days ago

        Agreed. No AI voice changer please. Hopefully every one of us at one point in our lives has been read a story by someone else. Never once did the fact that all the different characters dialog was coming from one voice did that detract from the story or the immersion.

        I’ve listened to audiobooks recorded with extremely deep masculine voices (think James Earl Jones) and when the voice actor was doing the voice of a 5 year old girl, (in only a slightly higher whiny timbre which matched the character traits) it was never immersion breaking. However, AI voice would. If I want different actors for different characters I’ll listen to radio dramas.

    • monkeyman512@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      10 days ago

      I think it would be a good idea to do a section of your work with and without AI modification. Then have people listen to both and give feedback. Good to find out if people like the modifications before you do a tone of work.

      • utopiah@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        9 days ago

        do a section of your work with and without […t]hen have people listen to both and give feedback.

        Yes, that’s the principle of prototyping. De-risk while testing solely the crucial part!

  • rpl6475@lemmy.ml
    link
    fedilink
    English
    arrow-up
    13
    arrow-down
    1
    ·
    8 days ago

    Surely I can just do that myself with an an epub and a free AI.

    Glad I binned my Audible subscription many years ago.

  • selkiesidhe@lemm.ee
    link
    fedilink
    English
    arrow-up
    11
    ·
    10 days ago

    I am okay with this only in cases where 1) the author approves, and 2) there is no audible version anyways.

    Some people prefer listening to their books instead of reading and that’s totally ok. Indie authors can’t always afford to hire a narrator but I’d still want the buyers to be able to listen to the book.

    Big question is, will the author get paid for the download or not…

    • potoo22@programming.dev
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      edit-2
      10 days ago

      I wouldn’t support it even if the author couldn’t afford it otherwise. There’s no test to confirm that and knowing profit margines, all publishers will use AI for all their books.

      Yes, I’d want smaller authors to have people listen to their books, but without oversight, it’s going to ruin all audiobooks.

    • futatorius@lemm.ee
      link
      fedilink
      English
      arrow-up
      12
      ·
      9 days ago

      YouTube is crawling with it. It’s unlistenable shit. The prosody is badly implemented, pronunciation is infuriatingly bad, and a lot of the text that these TTS are reading appears to be AI-generated. Otherwise, already dire standards of literacy are getting worse at an accelerating rate.

  • Maxxie@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    10
    ·
    edit-2
    8 days ago

    I hate so much that this has a 100% chance of becoming a norm. Narrator can make a mediocre book shine, or turn a good book into a fucking rollercoaster (Andy Serkis, anyone?)

    AI? Not a great narrator. Its character voices are boring, intonations weird, pacing awful. I’d honestly rather get an amateur narrating it for fun, over a robot sounding like a knock-off Morgan Freeman.

  • FancyPantsFIRE@lemm.ee
    link
    fedilink
    English
    arrow-up
    10
    ·
    10 days ago

    For now at least I bet this’ll be pretty mediocre. I’m a big audiobook fan and voice actors have a massive impact on the quality of the finished product. A great voice actor can make a mediocre book fun and engaging, a bad one can make a great book unlistenable. The best do great voice differentiation. As an example I’ve really enjoyed Andrea Parsneau’s work in The Wandering Inn series.