Generate in Other Languages #112

etwk · 2023-05-03T08:04:42Z

Hi,

I understand that VALL-E (or VALL-E) needs a lot of data to train. If used to generate audio in a minority language, is it required to train with a huge collection of datasets in that language, or could we build a model based on an existing one with a small dataset in the target language?

Minority languages might have less than 10 hours of datasets.

Thanks.

RuntimeRacer · 2023-05-03T13:56:34Z

It will probably work, but could be pretty bad at imitating voices of unseen speakers, since 10 hours likely don't have a huge variety of speakers in it

etwk · 2023-05-03T14:49:40Z

Could we build based on existing model with small dataset in a new language, or is it required to train with large dataset in the new language?

RuntimeRacer · 2023-05-04T10:49:22Z

It might work with finetuning. I am currently training a Model using Apache commonvoice, it might even be able to synthesize a language it doesn't know when it's done, however there are issues in combination with non-latin letters right now: #110

lifeiteng closed this as completed Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate in Other Languages #112

Generate in Other Languages #112

etwk commented May 3, 2023 •

edited

Loading

RuntimeRacer commented May 3, 2023 •

edited

Loading

etwk commented May 3, 2023

RuntimeRacer commented May 4, 2023

Generate in Other Languages #112

Generate in Other Languages #112

Comments

etwk commented May 3, 2023 • edited Loading

RuntimeRacer commented May 3, 2023 • edited Loading

etwk commented May 3, 2023

RuntimeRacer commented May 4, 2023

etwk commented May 3, 2023 •

edited

Loading

RuntimeRacer commented May 3, 2023 •

edited

Loading