Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Audio file vocal synthesis INPUT cannot exceed a certain length #77

Open
andrew-fennell opened this issue Apr 18, 2022 · 0 comments
Open
Labels
bug Something isn't working software

Comments

@andrew-fennell
Copy link
Owner

Problem

Audio files that are over a minute long do not work as vocal synthesis text input. (if you give an audio file to "copy" the words from, rather than providing text directly)

Error:
image

Proposed solution

  • Cut the provided audio into segments
  • Transcribe each audio segment
  • Combine transcriptions

This could run into issues with words and sentences being cut, which would decrease the quality of the transcriptions.

@andrew-fennell andrew-fennell added bug Something isn't working software labels Apr 18, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working software
Projects
None yet
Development

No branches or pull requests

1 participant