Handle NaN embeddings in speaker diarization. #1461

csukuangfj · 2024-10-24T04:23:04Z

jamescarter-le · 2024-11-03T21:19:11Z

I encounter this NaN issue with most models (3dspeaker_speech_eres2net_sv_en_voxceleb_16k works), like the NEMO models returning NaN for most (but not all) audio clips.

I'm using .NET, org.k2fsa.sherpa.onnx 1.10.30

Handle NaN embeddings in speaker diarization.

aa6108c

See also thewh1teagle/sherpa-rs#33

csukuangfj mentioned this pull request Oct 24, 2024

Speaker diarization errors thewh1teagle/sherpa-rs#33

Open

csukuangfj added 2 commits October 24, 2024 13:26

Minor fixes

dccf714

Use std::none_of() to replace sum() for computing isNaN

6f5ff1c

csukuangfj merged commit a5295aa into k2-fsa:master Oct 24, 2024
68 of 201 checks passed

csukuangfj deleted the fix-speaker-diarization branch October 24, 2024 06:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle NaN embeddings in speaker diarization. #1461

Handle NaN embeddings in speaker diarization. #1461

csukuangfj commented Oct 24, 2024

jamescarter-le commented Nov 3, 2024

Handle NaN embeddings in speaker diarization. #1461

Handle NaN embeddings in speaker diarization. #1461

Conversation

csukuangfj commented Oct 24, 2024

jamescarter-le commented Nov 3, 2024