-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do something more for the silence mic hallucinating part #20
Comments
@aiaimimi0920 Is the silence mic hallucination better now? |
no, now when you not talking ,it will automatically adds "Thank you" or "you" maybe I can use the method mentioned in this issue to solve it |
Is this fixed by the latest changes? |
Not resolved.
Possible solutions:
|
I've seen solutions like implementing an audio denoise https://github.com/snakers4/silero-vad but it seems more work to use than ggml-whisper. |
It is a PyTorch model, and we may need a C++ implementation, or use iree.gd? |
I can't estimate how long it will take to combine silero-vad with Whisper together but it should be feasible inside of iree.gd. |
Exposed the vad option for threshold. Also made it so that if it's halucinating, only take max 4 tokens, and no more(as they could be legit characters). |
It's not very bad as is, maybe we just expose the vad option threshold and play with it.
The text was updated successfully, but these errors were encountered: