-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve Language detection #265
Comments
What have you tried so far? Maybe you can share your current changes and we can help you implement this. |
Actually I am not able to figure out how to proceed further. There are several variables in the fix like "content_frames", "N_FRAMES", "segment"; functions used "pad_or_trim" in this solution which I am not able to figure out how to get the correct values for faster_whisper implementation |
has been this implemented? |
Any update ? |
FYI, I created a new PR to implement this feature: #732 |
How will the model behave if the conditions are not met? |
Since wishper detects language based on first 30 secs of the audio, sometimes there are errors in language detection. For example
This video is in english but both whisper and faster_whisper detects language as hindi "hi". There is a solution for this issue in whisper here. Since I am new to Ctranslate2, I am having difficulty in cloning this solution to faster_whisper. Can someone help me on this? Thanks.
P.S. I have tested the solution in whisper and it works.
The text was updated successfully, but these errors were encountered: