Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Add Speaker Consistency Evaluation Feature #317

Closed

Conversation

6drf21e
Copy link
Contributor

@6drf21e 6drf21e commented Jun 14, 2024

This PR introduces a new feature for evaluating speaker consistency within the project. It includes scripts for generating test audio and assessing the stability of speakers' voice timbre using the ERes2NetV2 Speaker Recognition Model.

Changes

  • Added audio_generator.py: Script to generate test audio files based on text data provided in test_data.yaml.
  • Added consistency_evaluator.py: Script to evaluate the stability of speaker voice timbre by calculating the cosine similarity between embedding vectors of audio segments.
  • Included test_data.yaml: YAML file containing text data for generating audio.
  • Updated README.md to include installation instructions, usage guidelines, and details about command-line arguments for both scripts.
  • Added requirements.txt to specify dependencies required for running the scripts.

@6drf21e
Copy link
Contributor Author

6drf21e commented Jun 15, 2024

@fumiama
Copy link
Member

fumiama commented Jun 18, 2024

Thanks for your contribution! This helps us a lot. 🤝
However, this repo aims to provide the ChatTTS package so it is hard to add too many bundles 😂.
If you agree, we want to invite you to contribute to a new repo in our organization, 2noise.
Then, we can link to your awesome work to the README in this main repo.

@6drf21e
Copy link
Contributor Author

6drf21e commented Jun 18, 2024

I am happy to accept the invitation to contribute to the new 2noise repository. Thank you for your recognition! @fumiama

@fumiama
Copy link
Member

fumiama commented Jun 18, 2024

What project name would you like? I will create a repo for you and grant you the full permission.

@6drf21e
Copy link
Contributor Author

6drf21e commented Jun 18, 2024

Any name will do, I'm not particular about it. Naming things is always the hardest part for me 😂.

@fumiama
Copy link
Member

fumiama commented Jun 18, 2024

All right 😂. I will decide by my self.

@fumiama
Copy link
Member

fumiama commented Jun 18, 2024

And, what license would you like? (I suggest AGPL v3)

@6drf21e
Copy link
Contributor Author

6drf21e commented Jun 18, 2024

Sounds good!

@fumiama
Copy link
Member

fumiama commented Jun 18, 2024

OK.

@fumiama
Copy link
Member

fumiama commented Jun 18, 2024

I have invited you to https://github.com/2noise/ChatEval
Therefore, I would like to close this PR first.
Feel free to push your codes there! 🎉

@fumiama fumiama closed this Jun 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants