Skip to content

Chunk time vs number of sample inconsistency. #211

Answered by snakers4
Rodolfo-S asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @Rodolfo-S ,

Please use the sample numbers from the docstring:

silero-vad/utils_vad.py

Lines 131 to 165 in 7c671a7

This method is used for splitting long audios into speech chunks using silero VAD
Parameters
----------
audio: torch.Tensor, one dimensional
One dimensional float torch.Tensor, other types are casted to torch if possible
model: preloaded .jit silero VAD model
threshold: float (default - 0.5)
Speech threshold. Silero VAD outputs speech probabilities for each audio chunk, probabilities ABOVE this value are considered as SPEECH.
It is better to tune this parameter for each dataset separately, but "…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
help wanted Extra attention is needed
2 participants
Converted from issue

This discussion was converted from issue #210 on August 08, 2022 04:03.