Reference papaer: END-TO-END SPEECHENHANCEMENT BASED ON DISCRETE COSINE TRANSFORM
arXiv: https://arxiv.org/abs/1910.07840
Envirments setup:
1)Tensorflow 1.13
2)librosa, numpy, scipy
Usage:
- Run data_prepare.py to create packs feature, command line:
python data_prepare.py pack_waves --workspace=.. --clean_dir=path_to_clean_wav --noisy_dir=path_to_noisy_wav
- Run train.py to train the U-net, command line:
python train.py
Infer:
1)Run model in ckpt, "python infer.py"