Refactoring #4

csukuangfj · 2021-07-31T12:27:00Z

TODOs

Add tests and documentation to transformer.py and conformer.py; Fix its style issues.

danpovey · 2021-07-31T13:08:28Z

That was fast! Thanks!

csukuangfj · 2021-08-03T09:34:04Z

egs/librispeech/ASR/conformer_ctc/transformer.py

        )

+        # TODO: Use eos_id as ignore_id.
+        #  tgt_key_padding_mask = decoder_padding_mask(ys_in_pad, ignore_id=eos_id)


It is commented out since existing models are trained with it disabled.
If it is enabled, the WER becomes worse.
We should enable it when we start to train a new model.

csukuangfj · 2021-08-03T12:16:06Z

The following is the WER from the model trained by #3 and decoded with this pull-request:
(With n-gram LM rescoring and attention decoder. The model is trained for 26 epochs)

For test-clean, WER of different settings are:
ngram_lm_scale_0.7_attention_scale_0.6  2.96    best for test-clean
ngram_lm_scale_0.9_attention_scale_0.5  2.96
ngram_lm_scale_0.7_attention_scale_0.5  2.97
ngram_lm_scale_0.7_attention_scale_0.7  2.97
ngram_lm_scale_0.9_attention_scale_0.6  2.97
ngram_lm_scale_0.9_attention_scale_0.7  2.97
ngram_lm_scale_0.9_attention_scale_0.9  2.97
ngram_lm_scale_1.0_attention_scale_0.7  2.97
ngram_lm_scale_1.0_attention_scale_0.9  2.97
ngram_lm_scale_1.0_attention_scale_1.0  2.97
ngram_lm_scale_1.0_attention_scale_1.1  2.97
ngram_lm_scale_1.0_attention_scale_1.2  2.97
ngram_lm_scale_1.0_attention_scale_1.3  2.97
ngram_lm_scale_1.1_attention_scale_0.9  2.97

---

For test-other, WER of different settings are:
ngram_lm_scale_1.0_attention_scale_0.9  6.65    best for test-other
ngram_lm_scale_1.1_attention_scale_1.1  6.65
ngram_lm_scale_0.9_attention_scale_0.7  6.66
ngram_lm_scale_1.0_attention_scale_1.0  6.66
ngram_lm_scale_1.0_attention_scale_1.1  6.66
ngram_lm_scale_0.9_attention_scale_1.0  6.67
ngram_lm_scale_1.0_attention_scale_0.7  6.67
ngram_lm_scale_1.0_attention_scale_1.2  6.67
ngram_lm_scale_1.0_attention_scale_1.3  6.67
ngram_lm_scale_0.9_attention_scale_0.5  6.68
ngram_lm_scale_0.9_attention_scale_0.6  6.68
ngram_lm_scale_0.9_attention_scale_0.9  6.68
ngram_lm_scale_0.9_attention_scale_1.1  6.68
ngram_lm_scale_0.9_attention_scale_1.3  6.68
ngram_lm_scale_0.9_attention_scale_1.5  6.68

Epochs 14-26 are used in model averaging.

I have uploaded the above checkpoints to
https://huggingface.co/csukuangfj/conformer_ctc/tree/main

To reproduce the decoding result:

clone the above repo containing checkpoints and put it into conformer_ctc/exp/
after step 1, you should have conformer_ctc/exp/epoch-{14,15,...,26}.pt
run

./prepare.sh
./conformer_ctc/decode.py --epoch 26 --avg 13 --max-duration=50

You should get the above result.

The results are expected to become better if trained with more epochs.
I will rerun the training with the bug in k2-fsa/snowfall#242 fixed.

danpovey · 2021-08-03T12:31:33Z

Great!!

…

On Tue, Aug 3, 2021 at 8:16 PM Fangjun Kuang ***@***.***> wrote: The following is the WER from the model trained by #3 <#3> and decoded with this pull-request: (With n-gram LM rescoring and attention decoder. The model is trained for 26 epochs) For test-clean, WER of different settings are: ngram_lm_scale_0.7_attention_scale_0.6 2.96 best for test-clean ngram_lm_scale_0.9_attention_scale_0.5 2.96 ngram_lm_scale_0.7_attention_scale_0.5 2.97 ngram_lm_scale_0.7_attention_scale_0.7 2.97 ngram_lm_scale_0.9_attention_scale_0.6 2.97 ngram_lm_scale_0.9_attention_scale_0.7 2.97 ngram_lm_scale_0.9_attention_scale_0.9 2.97 ngram_lm_scale_1.0_attention_scale_0.7 2.97 ngram_lm_scale_1.0_attention_scale_0.9 2.97 ngram_lm_scale_1.0_attention_scale_1.0 2.97 ngram_lm_scale_1.0_attention_scale_1.1 2.97 ngram_lm_scale_1.0_attention_scale_1.2 2.97 ngram_lm_scale_1.0_attention_scale_1.3 2.97 ngram_lm_scale_1.1_attention_scale_0.9 2.97 Epochs 14-26 are used in model averaging. ------------------------------ I have uploaded the above checkpoints to https://huggingface.co/csukuangfj/conformer_ctc/tree/main To reproduce the decoding result: 1. clone the above repo containing checkpoints and put it into conformer_ctc/exp/ 2. after step 1, you should have conformer_ctc/exp/epoch-{14,15,...,26}.pt 3. run ./prepare.sh ./conformer_ctc/decode.py --epoch 26 --avg 13 --max-duration=50 1. You should get the above result. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#4 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLOYRK6U225FIAUPRC2TT27MZDANCNFSM5BJ7IYRA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&utm_campaign=notification-email> .

pzelasko · 2021-08-03T12:38:45Z

Nice! I'm curious -- did you ever try to run the same thing but with MMI instead of CTC?

csukuangfj · 2021-08-03T12:42:15Z

Nice! I'm curious -- did you ever try to run the same thing but with MMI instead of CTC?

yes, I am planning to do that with a pretrained P. All the related code can be found in snowfall.

csukuangfj · 2021-08-04T06:52:35Z

Merging it to avoid conflicts.

csukuangfj added 3 commits July 31, 2021 15:55

Fix an error in TDNN-LSTM training.

c9222bd

Merge remote-tracking branch 'dan/master' into style-check

c72a11e

WIP: Refactoring

1fa3099

csukuangfj added 2 commits August 2, 2021 23:48

Refactor transformer.py

f6091b1

Remove unused code.

2be7a0a

csukuangfj commented Aug 3, 2021

View reviewed changes

csukuangfj changed the title ~~WIP: Refactoring~~ Refactoring Aug 3, 2021

Minor fixes.

a6d9b3c

csukuangfj merged commit 5a0b9bc into k2-fsa:master Aug 4, 2021

csukuangfj deleted the refactor branch August 4, 2021 06:53

danpovey mentioned this pull request Sep 2, 2021

RuntimeError: Specified device cuda:0 does not match device of data cuda:-2 #33

Closed

Lzhang-hub mentioned this pull request Oct 11, 2021

CUDA out of memory in decoding #70

Open

danpovey mentioned this pull request Nov 27, 2021

Decoding error 'Fsa' object doesn't support assignment. #133

Open

wwxm0523 mentioned this pull request Jan 30, 2022

LF-MMI GPU OOM #196

Open

ahazned mentioned this pull request Apr 13, 2022

Illegal memory error when training with multi-GPU #247

Open

iggygeek mentioned this pull request Nov 29, 2023

Zipformer training crash : 'cannot set number of interop threads ' ... #1395

Closed

ngoel17 mentioned this pull request Sep 30, 2024

Illegal memory access during zipformer training #1764

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring #4

Refactoring #4

csukuangfj commented Jul 31, 2021

danpovey commented Jul 31, 2021

csukuangfj Aug 3, 2021

csukuangfj commented Aug 3, 2021 •

edited

Loading

danpovey commented Aug 3, 2021 via email

pzelasko commented Aug 3, 2021

csukuangfj commented Aug 3, 2021

csukuangfj commented Aug 4, 2021

Refactoring #4

Refactoring #4

Conversation

csukuangfj commented Jul 31, 2021

TODOs

danpovey commented Jul 31, 2021

csukuangfj Aug 3, 2021

Choose a reason for hiding this comment

csukuangfj commented Aug 3, 2021 • edited Loading

danpovey commented Aug 3, 2021 via email

pzelasko commented Aug 3, 2021

csukuangfj commented Aug 3, 2021

csukuangfj commented Aug 4, 2021

csukuangfj commented Aug 3, 2021 •

edited

Loading