-
Notifications
You must be signed in to change notification settings - Fork 4.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🐸 TTS roadmap #378
Comments
great project! Excited to see this growing! |
I'm learning the code/API and performing experiments. I hope to contribute soon. I'm also wondering if I can donate (money) to Coqui? |
Wow! Thanks! Humbling. We were setting up GitHub sponsors, but the tax implications were onerous. We're currently exploring Patreon. So stay tuned! |
@erogol Thanks for sharing the plans! Do you have any thoughts (or need help to) simplifying the dependencies a bit? I'm thinking that if |
@agrinh Why do you need to keep your own fork exactly? It'd be better to expand the conversation on gitter if you like. |
Wow, thanks for the super fast reply. Sure, we can move the discussion to gitter. |
Please add DC-TTS to the the list of models. DC-TTS implementation available with MIT Licence code available here |
What were you thinking about the "TensorFlow run-time for training models"? Like giving the user the option of using TensorFlow or PyTorch? I wouldn't mind taking a stab at the TensorFlow part. |
@will-rice the plan is to mirror what we have in torch to TF as much as possible. It'd be great if you initiate the work |
Are you guys planning to develop some expressive TTS architectures? I'm currently studying this topic and planning to implement some of them based on Coqui, part of them just controlling latent space using GST Kwon et al 2020 or RE Sorin et al 2020, and others that actually changes the architecture by adding VAE, normalizing flows and gradient reversal |
@lucashueda Capacitron VAE: #510 |
Oh nice, hope to see Capacitron integrated soon. So maybe, in the future I'll be able to contribute with some others expressive architectures |
@erogol Look forward to new End-to-End models being implemented, specfically Efficient-TTS! if the paper is accurate, it should blow most 2 stage configurations out of the water, considering it seems to have higher MOS than tacotron2+hifigan, while also seeming to have significantly faster speed than glowtts+fastest vocoder! I have not seen a single repo replicating the EFTS-Wav architecture described in the paper released 10 months ago, it would be amazing to see it in Coqui first! |
@BillyBobQuebec I don't think I will implement these models anytime soon. But as they stand, contributions are welcome |
@BillyBobQuebec but you can try VITS which is close to what you're describing :) |
Agreed, I am currently trying VITS actually, I have some issues training with the coqui implementation unfortunately, I've posted the issue about the bug today and hope I can get it resolved. |
Hi there! Thanks for your great work! I'm looking forward to training YourTTS on other languages. Will training and fine-tuning code of YourTTS be published soon? I would be very grateful if you could tell me an approximate time~ Have a nice day :-D |
You can currently do it by creating your own tokenizer or overloading the class. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels. |
Marvelous project. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels. |
@MaxIakovliev you can use https://coqui.ai/ :) |
This roadmap issue is quite outdated. I'll keep it open to keep the references to some of the issues and models we like to tackle but won't be updating until one day officially becomes 48 hours. |
Any update regarding SSML implementation? |
We are not working on SSML currently, it is back in the list without a precise timeline. |
Please do!! |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels. |
Will you support bark-small? Thanks. |
Please take a look at You can use sherpa-onnx to run VITS models from Coqui on Android and also embedded devices, e.g., raspberry pi. We have pre-built Android APKs for the VITS English models from Coqui. |
Any new update? |
We have supported it in k2-fsa/sherpa-onnx#508 The following is a YouTube video You can use all coqui-ai/TTS models and piper models listed in |
hi guys, why? upd: found https://twitter.com/_josh_meyer_/status/1742522906041635166 |
Their ability to exist and be profitable was dependent on how much better their tech was compared to everyone else. It may not feel like it, but we are in the middle of an AI singularity. Coqui's business model might have stood a chance if they started with this tech 5 years earlier, but it was probably too little too late. Eleven labs is probably eating their lunch :/ |
Can we still use their product well, even though they have shut down? |
You can still use the code and models (within the license terms). We also maintain a fork that has a bunch of bug fixes and other small updates. |
These are the main dev plans for 🐸 TTS.
If you want to contribute to 🐸 TTS and don't know where to start you can pick one here and start with our Contribution Guideline. We're also always here to help.
Feel free to pick one or suggest a new one.
Contributions are always welcome 💪 .
v0.1.0 Milestones
Synthesizer
interface onCLI
orServer
.v0.2.0 Milestones
v0.3.0 Milestones
v0.4.0 Milestones
TTS.tts
models.v0.5.0 Milestones
v0.6.0 Milestones
v0.7.0 Milestones
v0.8.0 Milestones
🏃♀️ Milestones along the way
🤖 New TTS models
The text was updated successfully, but these errors were encountered: