feat: Add Speaker Consistency Evaluation Feature #317

6drf21e · 2024-06-14T23:44:10Z

This PR introduces a new feature for evaluating speaker consistency within the project. It includes scripts for generating test audio and assessing the stability of speakers' voice timbre using the ERes2NetV2 Speaker Recognition Model.

Changes

Added audio_generator.py: Script to generate test audio files based on text data provided in test_data.yaml.
Added consistency_evaluator.py: Script to evaluate the stability of speaker voice timbre by calculating the cosine similarity between embedding vectors of audio segments.
Included test_data.yaml: YAML file containing text data for generating audio.
Updated README.md to include installation instructions, usage guidelines, and details about command-line arguments for both scripts.
Added requirements.txt to specify dependencies required for running the scripts.

6drf21e · 2024-06-15T00:25:50Z

colab test
https://colab.research.google.com/drive/1BzyUvrh1XBtaA6UqPLXiKyhUSitNJsqm

fumiama · 2024-06-18T12:43:57Z

Thanks for your contribution! This helps us a lot. 🤝
However, this repo aims to provide the ChatTTS package so it is hard to add too many bundles 😂.
If you agree, we want to invite you to contribute to a new repo in our organization, 2noise.
Then, we can link to your awesome work to the README in this main repo.

6drf21e · 2024-06-18T12:55:11Z

I am happy to accept the invitation to contribute to the new 2noise repository. Thank you for your recognition! @fumiama

fumiama · 2024-06-18T12:58:42Z

What project name would you like? I will create a repo for you and grant you the full permission.

6drf21e · 2024-06-18T13:32:47Z

Any name will do, I'm not particular about it. Naming things is always the hardest part for me 😂.

fumiama · 2024-06-18T13:55:14Z

All right 😂. I will decide by my self.

fumiama · 2024-06-18T13:58:45Z

And, what license would you like? (I suggest AGPL v3)

6drf21e · 2024-06-18T14:11:40Z

Sounds good!

fumiama · 2024-06-18T14:12:39Z

OK.

fumiama · 2024-06-18T14:16:19Z

I have invited you to https://github.com/2noise/ChatEval
Therefore, I would like to close this PR first.
Feel free to push your codes there! 🎉

6drf21e added 2 commits June 15, 2024 07:42

feat: Add Speaker Consistency Evaluation Feature

2bbc957

fix: Correct package name in requirements.txt

2e07a20

6drf21e mentioned this pull request Jun 16, 2024

我这有 10000 条音色，拿去试试？ 6drf21e/ChatTTS_Speaker#1

Closed

OrvilleQ mentioned this pull request Jun 18, 2024

生成的 csv 中不包含 score，gender，age 和 feature。 6drf21e/ChatTTS_Speaker#2

Open

fumiama added documentation Improvements or additions to documentation enhancement New feature or request good first issue Good for newcomers labels Jun 18, 2024

fumiama closed this Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add Speaker Consistency Evaluation Feature #317

feat: Add Speaker Consistency Evaluation Feature #317

6drf21e commented Jun 14, 2024

6drf21e commented Jun 15, 2024

fumiama commented Jun 18, 2024

6drf21e commented Jun 18, 2024

fumiama commented Jun 18, 2024

6drf21e commented Jun 18, 2024

fumiama commented Jun 18, 2024

fumiama commented Jun 18, 2024

6drf21e commented Jun 18, 2024

fumiama commented Jun 18, 2024

fumiama commented Jun 18, 2024

feat: Add Speaker Consistency Evaluation Feature #317

feat: Add Speaker Consistency Evaluation Feature #317

Conversation

6drf21e commented Jun 14, 2024

Changes

6drf21e commented Jun 15, 2024

fumiama commented Jun 18, 2024

6drf21e commented Jun 18, 2024

fumiama commented Jun 18, 2024

6drf21e commented Jun 18, 2024

fumiama commented Jun 18, 2024

fumiama commented Jun 18, 2024

6drf21e commented Jun 18, 2024

fumiama commented Jun 18, 2024

fumiama commented Jun 18, 2024