-
Notifications
You must be signed in to change notification settings - Fork 109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: Add timestamps for streaming ASR #119
Conversation
Comparing the alignment with https://github.com/CorentinJ/librispeech-alignments
|
A second comparison using a different utterance:
|
Different from #52, the encoder model in this PR uses LSTM instead of Conformer. Also, the first token is no longer emitted on the first frame. |
Cool!! |
@csukuangfj What is missing in this PR ? |
I think I only made changes to lstm_transducer_stateless. Other folders for streaming models have not been updated yet. |
Ok ! Let me try to take care of it today |
You can use the changes from this PR. I am closing it now. Thanks again! |
@csukuangfj You mean I create my own branch with your changes right ? |
Yes, you can use any approach that you think work the best. |
Use the model from k2-fsa/icefall#558 for testing.
Start the server
Start the client
wave=./test_wavs/1089-134686-0001.wav wave=./test_wavs/1221-135766-0002.wav ./sherpa/bin/pruned_stateless_emformer_rnnt2/streaming_client.py \ --server-port 6006 \ $wave
Output from the client: