My understanding is that Mozilla is continuing to build the CommonVoice dataset for training speech models, but they are no longer developing TTS or STT software themselves.
https://github.com/coqui-ai/TTS is the new home of what was Mozilla’s TTS project. Coqui is a new company where some of the former mozilla speech team ended up. Coqui is continuing to develop both the TTS and STT code and models.
I didn’t notice that when i tried it before but now I see what you mean… that is really irritating :(
Also, just now I tried to have it just speak the word “hello” (no punctuation) and got something like “hello oh oh oh oh” with a bit of tonal variation in the strange sounds at the end. So, yeah, I guess they’ve got a ways to go still. Other short phrases I’m trying have good results, but somehow “hello” produces these odd sounds.
I went looking and found this which implies that the TTS isn’t working on android yet, and this which indicates the STT library does work on android but they have only a very simple and limited demo app so far.
I also found this Voice-Cloning repo which says it has an android app that uses Tacotron2 (one of the models coqui uses, which comes from Google) to do voice cloning… which sounds promising, but I don’t see an apk or build instructions.
I think mozilla TTS is the best. But this questions should have been asked in /c/Open Source.
My understanding is that Mozilla is continuing to build the CommonVoice dataset for training speech models, but they are no longer developing TTS or STT software themselves.
https://github.com/coqui-ai/TTS is the new home of what was Mozilla’s TTS project. Coqui is a new company where some of the former mozilla speech team ended up. Coqui is continuing to develop both the TTS and STT code and models.
There are a number of other much older free software TTS options, but Coqui’s (formerly Mozilla’s) is by far the best one I’ve heard.
Here is page with samples, it sounds pretty good: https://erogol.github.io/ddc-samples/
Wow coqui sounds really good, i’m gonna have find a command line thing of that.
deleted by creator
I didn’t notice that when i tried it before but now I see what you mean… that is really irritating :(
Also, just now I tried to have it just speak the word “hello” (no punctuation) and got something like “hello oh oh oh oh” with a bit of tonal variation in the strange sounds at the end. So, yeah, I guess they’ve got a ways to go still. Other short phrases I’m trying have good results, but somehow “hello” produces these odd sounds.
Is there a way to install these on android?
I went looking and found this which implies that the TTS isn’t working on android yet, and this which indicates the STT library does work on android but they have only a very simple and limited demo app so far.
I also found this Voice-Cloning repo which says it has an android app that uses Tacotron2 (one of the models coqui uses, which comes from Google) to do voice cloning… which sounds promising, but I don’t see an apk or build instructions.