Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle NaN embeddings in speaker diarization. #1461

Merged
merged 3 commits into from
Oct 24, 2024

Conversation

csukuangfj
Copy link
Collaborator

@csukuangfj csukuangfj merged commit a5295aa into k2-fsa:master Oct 24, 2024
68 of 201 checks passed
@csukuangfj csukuangfj deleted the fix-speaker-diarization branch October 24, 2024 06:03
@jamescarter-le
Copy link

I encounter this NaN issue with most models (3dspeaker_speech_eres2net_sv_en_voxceleb_16k works), like the NEMO models returning NaN for most (but not all) audio clips.

I'm using .NET, org.k2fsa.sherpa.onnx 1.10.30

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Speaker diarization errors
2 participants