I’m using https://github.com/rhasspy/piper mostly to create some audiobooks and read some posts/news, but the voices available are not always comfortable to listen to.

Do you guys have any recommendation for a voice changer to process these audio files?
Preferably it’ll have a CLI so I can include it in my pipeline to process RSS feeds, but I don’t mind having to work through an UI.
Bonus points if it can process the audio streams.

  • pe1uca@lemmy.pe1uca.devOP
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    2
    ·
    3 months ago

    Text to speech is what piper is doing.
    What I’m looking for is called voice changer since I want to change a voice which already read something.

    That’s exactly what I want: “the thing in the Darth Vader halloween masks” but for linux, preferably via CLI to ingest audio files and be able to configure it to change the voice as I want, not only Darth Vader.

    • catloaf
      link
      fedilink
      English
      arrow-up
      20
      ·
      3 months ago

      Oh, I see. I think it would still be easier to either use a different voice in piper (the github page talks about this) or use a different tts program entirely.

    • bastion@feddit.nl
      link
      fedilink
      English
      arrow-up
      4
      ·
      3 months ago

      So, all of the awkward pauses, the lack of inflection - you’re saying keep those, just change who it sounds like is speaking?