Skip to content

❓ Relative thresholding using VADIterator #367

Answered by snakers4
mukundt asked this question in Q&A
Discussion options

You must be logged in to vote

Hi,

Currently we have a static threshold of 0.85

One approach may be to fiddle with VAD activation and deactivation thresholds separately.
Another approach may be to actually have some accumulation buffer that postpones the VAD activation, but if activated, it activates from a previous early point.

(1) there is a large amount of background noise (2) there is a quiet environment but the speaker is also very quiet.

The VAD normalizes audio internally, so in a sense both of these cases are very similar.
As for the quiet audio, if all audio is quiet, you can try re-normalizing it and / or having some naive median rejection algorithm, where there is some median energy threshold where every…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@mukundt
Comment options

@snakers4
Comment options

Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
help wanted Extra attention is needed
2 participants
Converted from issue

This discussion was converted from issue #366 on August 15, 2023 20:00.