-
Notifications
You must be signed in to change notification settings - Fork 4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Strange behavior of "stream" example (Linux, amd64) #354
Comments
The The larger models are quite heavy for real-time processing. |
Sure, will try that out. Thanks a lot. I already stumbled on that other thread suggesting to set the step size at least twice more than encoding results from bench on my own hardware |
Strange, I'm getting the same very slow transcription results on Windows too. Downloaded latest release and tried out the artifacts from the latest commit to run into the same slow and inaccurate transcriptions on both builds. Very weird... |
There was a bug in the I think this fixes both the garbage results + the floating point exception |
@ggerganov there seems to be a problem with |
@meakbiyik whisper.cpp/examples/stream/stream.cpp Line 438 in 8738427
Let me know the exact command / parameters that you are using. |
@ggerganov not sure if this is the issue since I actually do not use "-kc" argument anyways, it hallucinates a bit too much :) But you are right, I set the "--step" argument so the issue is probably not VAD/high-pass filter. |
Here's a small update on my side: apparently there was an issue in my code that caused some absurd stutters. I resolved that, and now the master branch works perfectly - but I can still see clear difference in performance for low-quality sound between master and 385236d. I am now guessing that there were some optimizations in the matrix multiplications that reduced the robustness of the model somehow against noise. For all other purposes, everything works well :) |
This is very likely related to the new temperature fallback strategy that is enabled by default. whisper.cpp/examples/stream/stream.cpp Lines 617 to 620 in c9aeb33
|
I suspect that the main issue is not the temperature (since I started experiencing it pretty much immediately after the above-referenced commit). My bet would be on either the loss of precision from 32-16 bit conversions, or some bug related to them, since this can directly cause issues with noise robustness (and possibly the overall quality of tiny models) without creating a problem for high-SNR data and bigger models. |
Hey @ggerganov! Now that the temperature is fixed, to stay as close as possible to the original whisper model, can we re-enable it in stream example as well? It would overall be ideal if we can update the stream parameters to align with the |
The problem with the fallback is that when it triggers it increases the decoding time significantly. |
Hello there,
After doing some smoke tests of whisper.cpp utilizing ./main (all of that was working just perfectly with different language models) I moved to "stream" example - https://github.com/ggerganov/whisper.cpp/tree/master/examples/stream
The thing is, no matter what parameters I use (number of threads, different models, different step sizes/length), I cannot get it to recognize anything distant from the real-time speeds.
The closest I can get, is to use tiny.en model while keeping all the rest parameters unspecified, like this:
If I start adding any parameters to the above, or deviate from the tiny-en model, I start getting unpredictable results - garbage output, containing just a single word / few words, empty lines thrown in stdout over and over again, last displayed line being repeated over and over again.
One example - if I just add -vth 0.6 parameter to the above, I'm starting to get these lines:
If I set "--step 0", as in the "Sliding window mode with VAD" example, it just fails with "Floating point exception (core dumped)"
If I switch to any other bit heavier model, all allocated CPU threads are just maxed out 100% and printed results are almost garbage.
Ubuntu 22.10, AMD Ryzen 5 3400G (4 cores / 8 threads)
I appreciate any direction for the troubleshooting. I prob can profile the execution, to see where's the most CPU time is spent on, if that helps. I just cannot believe that my CPU cannot handle all that :)
The text was updated successfully, but these errors were encountered: