Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VAD detects humming as speech #113

Open
CuriousMonkey7 opened this issue Mar 2, 2025 · 1 comment
Open

VAD detects humming as speech #113

CuriousMonkey7 opened this issue Mar 2, 2025 · 1 comment

Comments

@CuriousMonkey7
Copy link
Contributor

While using the llm_voice_chat demo, I hummed randomly, and the VAD model detected it as speech. However, the STT returned empty text, resulting in this:

Image

I tried setting vad_options.threshold to 0.7 but didn't see any changes. This seems more like a VAD model issue than a FastRC problem. Just mentioning it here in case some parameter can be tuned to resolve it

@freddyaboulton
Copy link
Owner

Thanks @CuriousMonkey7 - that's interesting. I'll look into it. I think setting a higher threshold should have fixed it though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants