test #15

csukuangfj · 2023-09-27T04:12:33Z

No description provided.

* utils: add symlink_or_copyfile * pruned_transducer_stateless7: use symlinks (when possible) to output best epochs * Rename function --------- Co-authored-by: Yifan Yang <[email protected]>

Co-authored-by: Fangjun Kuang <[email protected]>

* add CTC loss option in zipformer recipe * add ctc_decode.py * support CTC model export, add jit_pretrained_ctc.py, pretrained_ctc.py * update README.md and RESULTS.md * add CI test

* Update model.py * Update train.py * Update decoder.py

* CTC loss return tensor * Update model.py

* copy files * update train.py * small fixes * Add decode.py * Fix dataloader in decode.py * add blank penalty * Add blank-penalty to other decoding method * Minor fixes * add zipformer2 recipe * Minor fixes * Remove pruned7 * export and test models * Replace bpe with tokens in export.py and pretrain.py * Minor fixes * Minor fixes * Minor fixes * Fix export * Update results * Fix zipformer-ctc * Fix ci * Fix ci * Fix CI * Fix CI --------- Co-authored-by: Fangjun Kuang <[email protected]>

* initial commit for zipformer tedlium * fix unk decoding * add pretrained model and logs * update for new AsrModel * add option for choosing rnnt type * add results with modified rnnt

* support testing onnx exported model on the test sets * use token_table instead

* Add start-batch option for RNNLM training * Also set epoch * Skip batches on load

@csukuangfj

* merge upstream * add SURT model and training * add libricss decoding * add chunk width randomization * decode SURT with libricss * initial commit for zipformer_ctc * remove unwanted changes * remove changes to other recipe * fix zipformer softlink * fix for JIT export * add missing file * fix symbolic links * update results * clean commit for SURT recipe * training libricss surt model * remove unwanted files * remove unwanted changes * remove changes in librispeech * change some files to symlinks * remove unwanted changes in utils * add export script * add README * minor fix in README * add assets for README * replace some files with symlinks * remove unused decoding methods * fix symlink * address comments from @csukuangfj

* add shallow fusion documentation * add documentation for LODR * upload docs for LM rescoring

* Fix for ci * Fix frame_reducer

* merge upstream * add SURT model and training * add libricss decoding * add chunk width randomization * decode SURT with libricss * initial commit for zipformer_ctc * remove unwanted changes * remove changes to other recipe * fix zipformer softlink * fix for JIT export * add missing file * fix symbolic links * update results * clean commit for SURT recipe * training libricss surt model * remove unwanted files * remove unwanted changes * remove changes in librispeech * change some files to symlinks * remove unwanted changes in utils * add export script * add README * minor fix in README * add assets for README * replace some files with symlinks * remove unused decoding methods * initial commit for SURT AMI recipe * fix symlink * add train + decode scripts * add missing symlink * change files to symlink * change file type

…fsa#1242)

* Init commit for recipes trained on multiple zh datasets. * fbank extraction for thchs30 * added support for aishell1 * added support for aishell-2 * fixes * fixes * fixes * added support for stcmds and primewords * fixes * added support for magicdata script for fbank computation not done yet * added script for magicdata fbank computation * file permission fixed * updated for the wenetspeech recipe * updated * Update preprocess_kespeech.py * updated * updated * updated * updated * file permission fixed * updated paths * fixes * added support for kespeech dev/test set fbank computation * fixes for file permission * refined support for KeSpeech * added scripts for BPE model training * updated * init commit for the multi_zh-cn zipformer recipe * disable speed perturbation by default * updated * updated * added necessary files for the zipformer recipe * removed redundant wenetspeech M and S sets * updates for multi dataset decoding * refined * formatting issues fixed * updated * minor fixes * this commit finalize the recipe (hopefully) * fixed formatting issues * minor fixes * updated * using soft links to reduce redundancy * minor updates * using soft links to reduce redundancy * minor updates * minor updates * using soft links to reduce redundancy * minor updates * Update README.md * minor updates * Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py Co-authored-by: Fangjun Kuang <[email protected]> * Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py Co-authored-by: Fangjun Kuang <[email protected]> * Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py Co-authored-by: Fangjun Kuang <[email protected]> * Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py Co-authored-by: Fangjun Kuang <[email protected]> * Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py Co-authored-by: Fangjun Kuang <[email protected]> * Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py Co-authored-by: Fangjun Kuang <[email protected]> * minor updates * minor fixes * fixed a formatting issue * Update preprocess_kespeech.py * Update prepare.sh * Update egs/multi_zh-hans/ASR/local/compute_fbank_kespeech_splits.py Co-authored-by: Fangjun Kuang <[email protected]> * Update egs/multi_zh-hans/ASR/local/preprocess_kespeech.py Co-authored-by: Fangjun Kuang <[email protected]> * removed redundant files * symlinks added * minor updates * added CI tests for `multi_zh-hans` * minor fixes * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh --------- Co-authored-by: Fangjun Kuang <[email protected]>

Co-authored-by: zss11 <[email protected]>

* Update conformer.py * Update zipformer.py fix bug in get_dynamic_dropout_rate

* Use torch.jit.script() to export the decoder model See also k2-fsa/sherpa-onnx#327

…#1269) * fixes for `diagnostics` Replace `2 ** 22` with `512` as the default value of `diagnostics.TensorDiagnosticOptions` also black formatted some scripts * fixed formatting issues

* formatted the entire librispeech recipe * minor updates

* add documentation for training an RNNLM

* typo fix

…#1244)

SarahSmitho and others added 30 commits June 7, 2023 11:17

verify have installed ffmpeg (k2-fsa#1117)

3ae47a4

Fix parameters_names in train.py (k2-fsa#1121)

dca21c2

Use symlinks for best epochs (k2-fsa#1123)

b4c38d7

* utils: add symlink_or_copyfile * pruned_transducer_stateless7: use symlinks (when possible) to output best epochs * Rename function --------- Co-authored-by: Yifan Yang <[email protected]>

add updated zipformer onnx export (k2-fsa#1108)

0cb71ad

Co-authored-by: Fangjun Kuang <[email protected]>

Add CTC loss option in zipformer recipe (k2-fsa#1111)

0ad037d

* add CTC loss option in zipformer recipe * add ctc_decode.py * support CTC model export, add jit_pretrained_ctc.py, pretrained_ctc.py * update README.md and RESULTS.md * add CI test

Fix running exported model on GPU. (k2-fsa#1131)

947f061

Fix Zipformer (k2-fsa#1132)

0a46579

* Update model.py * Update train.py * Update decoder.py

Fix for diagnostic (k2-fsa#1135)

d667dc3

* CTC loss return tensor * Update model.py

fix small typo (k2-fsa#1144)

4d5b836

Fix ONNX export of the latest streaming zipformer model. (k2-fsa#1148)

968ebd2

Zipformer for TedLium (k2-fsa#1125)

9c2172c

* initial commit for zipformer tedlium * fix unk decoding * add pretrained model and logs * update for new AsrModel * add option for choosing rnnt type * add results with modified rnnt

Support int8 quantization in decoder (k2-fsa#1152)

db71b03

Minor fix in tedlium results file (k2-fsa#1153)

c59c89f

support testing onnx exported model on the test sets (k2-fsa#1150)

ccd8c62

* support testing onnx exported model on the test sets * use token_table instead

zipformer2 logaddexp onnx safe (k2-fsa#1157)

98d8946

Fix logaddexp for ONNX export (k2-fsa#1158)

c3e23ec

Fix ONNX export for the latest non-streaming zipformer. (k2-fsa#1160)

9009d02

Add start-batch option for RNNLM training (k2-fsa#1161)

eca0202

* Add start-batch option for RNNLM training * Also set epoch * Skip batches on load

fixed default param for an aishell recipe (k2-fsa#1159)

856c0f2

Fix zipformer CI test (k2-fsa#1164)

b8a1794

Fix CI test for zipformer CTC (k2-fsa#1165)

130ad03

Fix failed CI tests (k2-fsa#1166)

6fd6743

Shallow fusion & LODR documentation (k2-fsa#1142)

11523c5

* add shallow fusion documentation * add documentation for LODR * upload docs for LM rescoring

Fix blank skip ci test (k2-fsa#1167)

ffe816e

* Fix for ci * Fix frame_reducer

add sym link (k2-fsa#1170)

5ed6fc0

removed batch_name to fix a KeyError with "uttid" (k2-fsa#1172)

4ab7d61

Add tests for subsample.py and fix typos (k2-fsa#1180)

1dbbd77

JinZr and others added 25 commits September 4, 2023 17:56

minor fixes (k2-fsa#1240)

9ef8145

doc str fixes (k2-fsa#1241)

d50a9ea

Update run-gigaspeech-pruned-transducer-stateless2-2022-05-12.sh (k2-…

c912bd6

…fsa#1242)

fixed a CI test issue related to python version (k2-fsa#1243)

49a4b67

enable sclite_mode for swbd scoring (k2-fsa#1239)

3199058

Fixes to incorporate with the latest Lhotse release (k2-fsa#1249)

7cc2dae

modify tal_csasr recipe (k2-fsa#1252)

fba1710

Co-authored-by: zss11 <[email protected]>

Minor fixes to the libricss recipe (k2-fsa#1256)

565d2c2

Fix typo in README.md (k2-fsa#1257)

0c564c6

fix thchs-30 download command (k2-fsa#1260)

7e1288a

Update decoder.py (k2-fsa#1262)

bbb03f7

Update conformer.py (k2-fsa#1200)

45d60ef

* Update conformer.py * Update zipformer.py fix bug in get_dynamic_dropout_rate

Fix CI tests (k2-fsa#1266)

f5dc957

Fix exporting decoder model to onnx (k2-fsa#1264)

34e40a8

* Use torch.jit.script() to export the decoder model See also k2-fsa/sherpa-onnx#327

fixes for init value of diagnostics.TensorDiagnosticOptions (k2-fsa…

ef658d6

…#1269) * fixes for `diagnostics` Replace `2 ** 22` with `512` as the default value of `diagnostics.TensorDiagnosticOptions` also black formatted some scripts * fixed formatting issues

formatted the entire LibriSpeech recipe (k2-fsa#1270)

ef5da48

* formatted the entire librispeech recipe * minor updates

Add documentation for RNNLM training (k2-fsa#1267)

97f9b9c

* add documentation for training an RNNLM

Fix docs for MVQ (k2-fsa#1272)

e17f884

* typo fix

added softlinks to local dir (k2-fsa#1273)

1b565dd

Support CTC decoding on CPU using OpenFst and kaldi decoders. (k2-fsa…

2318c3f

…#1244)

Copy HL decoding script to HLG decoding script

8e10ce0

Support HLG decoding using OpenFst with kaldi decoders

2fd0673

small fixes

043aafc

small fixes

f357d3f

csukuangfj added the ctc label Sep 27, 2023

Fix building HLG

edc37b0

csukuangfj added ctc and removed ctc labels Sep 27, 2023

support triggering CI manually

0ab247f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test #15

test #15

csukuangfj commented Sep 27, 2023

test #15

Are you sure you want to change the base?

test #15

Conversation

csukuangfj commented Sep 27, 2023