Improve Language detection #265

ab-pandey · 2023-05-30T07:13:48Z

Since wishper detects language based on first 30 secs of the audio, sometimes there are errors in language detection. For example
This video is in english but both whisper and faster_whisper detects language as hindi "hi". There is a solution for this issue in whisper here. Since I am new to Ctranslate2, I am having difficulty in cloning this solution to faster_whisper. Can someone help me on this? Thanks.
P.S. I have tested the solution in whisper and it works.

guillaumekln · 2023-06-01T16:18:34Z

Since I am new to Ctranslate2, I am having difficulty in cloning openai/whisper#676 to faster_whisper. Can someone help me on this? Thanks

What have you tried so far? Maybe you can share your current changes and we can help you implement this.

ab-pandey · 2023-06-05T05:41:07Z

Actually I am not able to figure out how to proceed further. There are several variables in the fix like "content_frames", "N_FRAMES", "segment"; functions used "pad_or_trim" in this solution which I am not able to figure out how to get the correct values for faster_whisper implementation

elloza · 2024-02-22T11:29:52Z

has been this implemented?

dilerbatu · 2024-03-01T15:36:31Z

Any update ?

trungkienbkhn · 2024-03-04T03:45:25Z

FYI, I created a new PR to implement this feature: #732

ldolegowski92 · 2024-05-13T11:41:40Z

Do Twojej wiadomości, utworzyłem nowy PR, aby wdrożyć tę funkcję: #732

How will the model behave if the conditions are not met?

ab-pandey closed this as completed Jun 9, 2023

ab-pandey reopened this Jun 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Language detection #265

Improve Language detection #265

ab-pandey commented May 30, 2023 •

edited

Loading

guillaumekln commented Jun 1, 2023

ab-pandey commented Jun 5, 2023

elloza commented Feb 22, 2024

dilerbatu commented Mar 1, 2024

trungkienbkhn commented Mar 4, 2024

ldolegowski92 commented May 13, 2024

Improve Language detection #265

Improve Language detection #265

Comments

ab-pandey commented May 30, 2023 • edited Loading

guillaumekln commented Jun 1, 2023

ab-pandey commented Jun 5, 2023

elloza commented Feb 22, 2024

dilerbatu commented Mar 1, 2024

trungkienbkhn commented Mar 4, 2024

ldolegowski92 commented May 13, 2024

ab-pandey commented May 30, 2023 •

edited

Loading