Skip to content

❓ vad bad performance on child speech? #52

Answered by snakers4
garymmi asked this question in Q&A
Discussion options

You must be logged in to vote

Several remarks here

  • We do not have a lot of children's speech per se in the training data
  • This sample just looks too short to be cut by a VAD
  • We were planning to release an adaptive post-processing tweak that would make parameter tuning obsolete
  • You can use this snippet to tune params, albeit it should be done on longer audio files, becase this one is too short to be meaninful imo
speech_timestamps = get_speech_ts(wav, model,
                                  num_samples_per_window=2000,
                                  num_steps=8,
                                  visualize_probs=True)

This snippet produces this image

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@snakers4
Comment options

@garymmi
Comment options

Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #51 on April 01, 2021 08:43.