-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to use peoples speech dataset? #57
Comments
I have the same problem, apart from clean/ dirty, there is also the difference between CC-BY and CC-BY-SA. I have downloaded all these files but can't decompress them because it doesn't look like zip or tar files. |
If you extract what you downloaded from the "Data" button, you can use the manifest to build text/speech pairs based on the |
I can get label and name from the manifest, but how can I get wavs from the Big File downloaded from "Data"
|
Hi, Do you have any suggestions or do you find the same problem?? |
Hello, can you share the part-00000-4e132642-c01c-4db6-9db0-a1e19193f6f8-c000.json with me; |
I have downloaded the people speech dataset, and have two questions:
![image](https://user-images.githubusercontent.com/9246556/146738776-4eccf7c7-11bd-48fc-82f3-c2861dcd9a59.png)
what's the relationship between the two options?
Do I have to download the data of both the two options?
So the total audio will be 60k hours?
The text was updated successfully, but these errors were encountered: