Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 648 Bytes

README.md

File metadata and controls

18 lines (13 loc) · 648 Bytes

About the dataset

LibriVAD is an open source dataset for voice activity detection in noisy environments. It is derived from LibriSpeech signals (clean subset) and DNS challenge noises.

Generating LibriVAD

You need to download LibriSpeech, the noise from the DNS Challenge (datasets/noise) and the forced alignments.

To generate LibriVAD, clone the repo and run the main script : run.sh (edit run.sh with correct paths)

git clone https://github.com/JorisCos/LibriMix
cd LibriMix 
./run.sh storage_dir