Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Output compared to Fastspeech2 #60

Open
debasish-mihup opened this issue Jul 12, 2021 · 0 comments
Open

Output compared to Fastspeech2 #60

debasish-mihup opened this issue Jul 12, 2021 · 0 comments

Comments

@debasish-mihup
Copy link

I have some question regarding quality of Fastspeech2 output compared to Glow TTS. Currently I am using Glow TTS generated Mels with HifiGan vocoder and quality is good. There is scope of improvement in prosody. Tacotron2 works better in this regard but has high inference time as well as performs poorly when input sentence length increases. Fastspeech2's inference speed is faster that of Glow TTS but given that contribution of TTS is small compared to time taken by vocoder. I am rather interested in knowing whether Fastspeech2 would help increase quality in terms of intonation, pauses and stress of output sentences? Does anyone here trained both using Glow TTS vs Fastspeech2?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant