What text-to-speech application do you use?
from theacharnian@lemmy.ca to linux@lemmy.ml on 07 May 15:01
https://lemmy.ca/post/43597361

I’m trying to find a replacement for NaturalReader in Linux but I’m not finding anything as good.

I have played around with different engines, such as Espeak (too robotic), Mozilla TTS and Coqui, and Piper. But I’m looking for an application, not just an engine, something that would allow me to open up a PDF, pick a spot and read from there, then be able to move back and forth on the document. Ideally, I would like to also be able to tell the application how to pronounce certain words.

I haven’t figured out how to make Okular use The best I have found is ReadAloud, but it’s just a browser addon. Okular doesn’t seem to be able to use something like Piper EDIT: but Pied exists: github.com/Elleo/pied which makes it work.

Any ideas?

(I use Debian btw :P )

#linux

threaded - newest

enemenemu@lemm.ee on 07 May 17:39 next collapse

Did you already look into sherpatts?

Edit: sorry, wrong sublemmy. But it uses kaldi

just_another_person@lemmy.world on 07 May 17:40 next collapse

Do you just want a screen reader maybe? Gnome has Orca, and KDE has Kreader. Orca is much more polished.

theacharnian@lemmy.ca on 07 May 17:56 collapse

A screen reader reads what’s on the screen. What I’m describing is reading a document. ReadAloud does exactly that for Firefox, I am just asking for standalone applications.

just_another_person@lemmy.world on 07 May 18:46 next collapse

I think Piper being able to take input and read it from the client is simple enough that most people wouldn’t make an entire GUI just to avoid to do that, so it might be hard to find something like that.

If you’re specifically talking about reading PDFs aloud, you could do something like: pdftotext file.pdf | piper and it will read the whole thing.

If you only mean reading a file from a specific selection of text, I’ve never seen something that, and it would have to operate more like a fully fledged screen reader because you’d have content rendered on screen that would have to be then piped to a TTS engine.

theacharnian@lemmy.ca on 07 May 19:52 collapse

Yes, however scientific papers aren’t always linearly formatted PDFs (eg 2-columns), so pdftotext tends to be brittle.

If you only mean reading a file from a specific selection of text, I’ve never seen something that,

Okular actually does that, and with Pied I can use nice Piper voices, but the controls are very basic (start at the stop of the page, pause, stop).

Vincent@feddit.nl on 07 May 19:27 collapse

Probably not what you’re after, but if it’s really just about PDFs, note that Firefox has an excellent PDF reader built-in. Oh, but I guess a browser extension can’t access that?

theacharnian@lemmy.ca on 07 May 19:49 collapse

ReadAloud sort of does but it requires sending the pdf to their website, which is obviously not ideal.

PseudoSpock@lemmy.dbzer0.com on 07 May 17:51 next collapse

My mouth.

theacharnian@lemmy.ca on 07 May 19:53 collapse

…and my bow

[deleted] on 07 May 19:35 next collapse

.

swelter_spark@reddthat.com on 07 May 21:30 next collapse

Calibre can integrate with Piper, so any document that can be opened with its ebook viewer can be read aloud.

Termight@lemmy.ml on 07 May 21:52 next collapse

If you’re seeking a pre-packaged solution for leveraging the Kokoro-82M text-to-speech model, you might find the ‘Kokoro-FastAPI’ Dockerized wrapper… adequate. It seems to function, at least for me.

rodbiren@midwest.social on 07 May 22:24 next collapse

Kokoro is absolutely incredible for how small it is. It can run on CPU fairly quickly and the results are so consistent. I’m even working on an absolute wacky idea to use a genetic algorithm for voice cloning because the tensors it uses for voice style are just so small. It’s an awesome application.

hummy_bee@mander.xyz on 07 May 23:57 collapse

I am using Linux Mint and only discovered text-to-speech platforms exist recently. Could you advice a beginner on which one to use? Do I have to install it from the software manager or use the terminal to install whatever is recommended? Not very tech savvy, but not naïve either.