Change the repository type filter
All
Repositories list
28 repositories
FN-SSL
PublicThe Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source LocalizationFS-EEND
PublicThe official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]NBSS
PublicThe official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberationUMA-ASR
PublicATST-SED
PublicThis repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".RealMAN
PublicA description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]SAR-SSL
PublicA python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]ATST-RCT
Public- A library built for easier audio self-supervised training, downstream tasks evaluation
RVAE-EM
PublicOfficial PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]FullSubNet
PublicPyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."McNet
PublicThe official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023RCT
PublicNarrowband_DeepFiltering
PublicRTF_InterFrameSpecSub
PublicRS_noisePSD
PublicDP_RTF_SSL
Publicbss_ctf_lasso
Publicdereverb_ctf_nonneg
PublicBSS_CTF_EM
PublicLSTM-noisePSD
Publicctf_mint
PublicOnlineSSL_DPRTF_EG
PublicSMIF_online_dereverb
Public