This project is the implementation of DARCN, which has been accepted by INTERSPEECH2020. The network combines the psychological characteristic of human in dynamic attention and recursive learning. Experimental results indicate that the proposed method obtains consistent metric improvements than previous approaches.
Put the noisy-clean pairs into ./Dataset/train and ./Dataset/dev
change the parameter settings accroding to your directory (within config.py)
Run json_extract.py to generate json files, which records the utterance file names for both training and validation set
# Run json_extract.py
json_extract.py
Network Training
# Run main.py to begin network training
main.py
Test After training, the model with smallest MSE will be saved into ./Best_model, put the test mix utterance into ./Test/mix
# Run Test.py to test the model
Test.py
If you use our code in your research or wish to refer to the baseline results, please use the following BibTeX entry.
@inproceedings{Li2020,
author={Andong Li and Chengshi Zheng and Cunhang Fan and Renhua Peng and Xiaodong Li},
title={{A Recursive Network with Dynamic Attention for Monaural Speech Enhancement}},
year=2020,
booktitle={Proc. Interspeech 2020},
pages={2422--2426},
doi={10.21437/Interspeech.2020-1513},
url={http://dx.doi.org/10.21437/Interspeech.2020-1513}
}