Export fixes for Riva #5496

borisfom · 2022-11-23T22:06:35Z

Signed-off-by: Boris Fomitchev [email protected]

What does this PR do ?

Fixes to make Riva networks export. Most of those are actually already in main.

Signed-off-by: Boris Fomitchev <[email protected]>

* Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]>

* Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]>

* Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Signed-off-by: shane carroll <[email protected]>

* Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Signed-off-by: Hainan Xu <[email protected]>

* Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]>

* update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]>

* update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]>

* Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Signed-off-by: andrusenkoau <[email protected]>

* update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (NVIDIA#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (NVIDIA#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (NVIDIA#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) (NVIDIA#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (NVIDIA#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (NVIDIA#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: andrusenkoau <[email protected]>

* update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]>

* Merge r1.13.0 main (#5570) * update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Optimized loop and bugfix in SDE (#5573) - Fixed bug with loading custom data attributes from JSON in Speech Data Explorer Signed-off-by: George Zelenfroynd <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update torchmetrics (#5566) * add task arg Signed-off-by: nithinraok <[email protected]> * update state Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Co-authored-by: Taejin Park <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove useless files. (#5580) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add initial NFA code Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make use of the specified device during viterbi decoding Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL notes Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL warning Signed-off-by: Elena Rastorgueva <[email protected]> * Add an option to defer data setup from ``__init__`` to ``setup`` (#5569) * Add an option to defer dataloader setup from __init__ to setup Signed-off-by: Ante Jukić <[email protected]> * Updated doc Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make utt_id specified by number of parts of audio_filepath user wishes to use Signed-off-by: Elena Rastorgueva <[email protected]> * remove audio_sr TODO - reduce risk of silent bugs Signed-off-by: Elena Rastorgueva <[email protected]> * Add check that model is CTC Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Text generation improvement (UI client, data parallel support) (#5437) * Squashed commit of the following: commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 15:07:42 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 35b424044fe80c3081e7756ab21244f701716f7e Author: Yi Dong <[email protected]> Date: Thu Oct 13 08:04:49 2022 -0700 get rid of base Signed-off-by: Yi Dong <[email protected]> commit 2955210e2311791543538cfbb5ad26b79414c954 Merge: d52edef8c eaf6757ca Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:17:02 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit d52edef8cd7b36593838fb270047e80f8ccb652e Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:16:24 2022 +0000 align with main Signed-off-by: Yi Dong <[email protected]> commit eaf6757ca5be8e099492f57c81d984429b0ad49c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 13:12:11 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:10:58 2022 +0000 same as main Signed-off-by: Yi Dong <[email protected]> commit e335de51bcc0d681c58b568c3d8c238bc5687c3b Merge: c231086e0 4463a9fe9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:08:09 2022 +0000 Merge branch 'main' into universal_prompt commit c231086e057f1efaa915f691d84664cb3d5aad85 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 19:59:12 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:56:17 2022 +0000 default to pad Signed-off-by: Yi Dong <[email protected]> commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:55:44 2022 +0000 add the option to pad the tokens Signed-off-by: Yi Dong <[email protected]> commit 876dc395b43fdeeaa2bcbbe13c76523633764c33 Merge: fbb0f4035 fe3c77ee9 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:20:47 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392 Author: Yi Dong <[email protected]> Date: Wed Oct 12 18:59:49 2022 +0000 fix import again Signed-off-by: Yi Dong <[email protected]> commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 16:00:24 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937 Author: Yi Dong <[email protected]> Date: Wed Oct 12 15:58:32 2022 +0000 enable server Signed-off-by: Yi Dong <[email protected]> commit cbe05d9fbc978f812cfbb671f45f147f300713c4 Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:07:28 2022 +0000 fix comment error Signed-off-by: Yi Dong <[email protected]> commit 1948048922e726ec6131e44b1a745389f18d4ef2 Merge: 232c2cce3 984f5c09a Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:05:30 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 232c2cce34d7a8b902da406706f3dd9b39475091 Merge: 34c8a68df 658243fb6 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:50:00 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f Merge: 658243fb6 3fda5de46 Author: Yi Dong <[email protected]> Date: Wed Oct 12 08:42:11 2022 -0400 Merge branch 'main' into fix_global_init commit 658243fb6580191b5d60edd30cde16dcc23cbb85 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:40:57 2022 +0000 fix import error Signed-off-by: Yi Dong <[email protected]> commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c Author: Yi Dong <[email protected]> Date: Tue Oct 11 22:44:12 2022 +0000 update the fused kernel Signed-off-by: Yi Dong <[email protected]> commit 536cf6bef9447b75843fad630729c47a2fba35f3 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:44:52 2022 -0700 add the missing file Signed-off-by: Yi Dong <[email protected]> commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:43:14 2022 -0700 fix fused softmax Signed-off-by: Yi Dong <[email protected]> commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:16:48 2022 -0700 move global step to base Signed-off-by: Yi Dong <[email protected]> commit 34c8a68df084b18d377e84415d9f07b2cd6673dd Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:50:11 2022 +0000 fix pipeline for eval Signed-off-by: Yi Dong <[email protected]> commit eee5d38218f26660c3ffebe9f615c850c80a1f0d Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:48:22 2022 +0000 fix for pipleline parallel Signed-off-by: Yi Dong <[email protected]> commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d Merge: 125e49947 e3b4c4d1f Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:13 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 125e4994760448ff75dd9328395813eda1c87547 Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:04 2022 +0000 add share option Signed-off-by: Yi Dong <[email protected]> commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368 Author: Yi Dong <[email protected]> Date: Wed Oct 5 11:43:48 2022 -0700 make sure consolidation works Signed-off-by: Yi Dong <[email protected]> commit a5c833964ecf05dc460ca1da69275c4019742150 Merge: 2a07ab52d abcb74be2 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:29 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 2a07ab52d95f15ba666823028c69e23825666c05 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:23 2022 +0000 added requirement Signed-off-by: Yi Dong <[email protected]> commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:39:42 2022 +0000 added a simple web server Signed-off-by: Yi Dong <[email protected]> commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b Author: Yi Dong <[email protected]> Date: Wed Oct 5 06:54:12 2022 -0700 fix empty val loss Signed-off-by: Yi Dong <[email protected]> commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26 Author: Yi Dong <[email protected]> Date: Tue Oct 4 19:25:30 2022 -0700 text gen working Signed-off-by: Yi Dong <[email protected]> commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba Author: Yi Dong <[email protected]> Date: Tue Oct 4 16:08:40 2022 -0700 first change Signed-off-by: Yi Dong <[email protected]> commit 59d077585e6962a669b824af58f64e8a0bea6547 Author: Yi Dong <[email protected]> Date: Tue Oct 4 15:00:40 2022 -0700 revert Signed-off-by: Yi Dong <[email protected]> commit 12a0f3902d99e9179403644bd951c045df716ca7 Author: Yi Dong <[email protected]> Date: Tue Oct 4 21:26:23 2022 +0000 init imp Signed-off-by: Yi Dong <[email protected]> commit 62a15dfd943cc48be495ac61b9f2f00995775c5f Merge: 82c90d2cd e0cc6b767 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:58:26 2022 -0700 Merge branch 'main' into universal_prompt commit 82c90d2cd0fd156f16a4b899f8c741d598f33990 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:17:13 2022 -0700 add sync Signed-off-by: Yi Dong <[email protected]> commit 9819b703eef877d90cd1257bf3610c69de9b4d7e Author: Yi Dong <[email protected]> Date: Sun Oct 2 17:52:34 2022 -0700 fix save model Signed-off-by: root <[email protected]> commit e4937e2fc5fb7d70754c97668416e4a69c3079fe Author: Yi Dong <[email protected]> Date: Sat Oct 1 18:56:09 2022 +0000 working Signed-off-by: Yi Dong <[email protected]> commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b Author: Yi Dong <[email protected]> Date: Sat Oct 1 17:34:03 2022 +0000 calcuate the mask Signed-off-by: Yi Dong <[email protected]> commit 9db3bc13eb65a94a475b837603351da68e3745bc Author: Yi Dong <[email protected]> Date: Fri Sep 30 23:26:32 2022 +0000 fix bug in datasets Signed-off-by: Yi Dong <[email protected]> commit f289900375d4412f53f8110be00fec6587627550 Author: Yi Dong <[email protected]> Date: Fri Sep 30 22:29:40 2022 +0000 update the code Signed-off-by: Yi Dong <[email protected]> commit 8e28a1f208aabaab72dbe769e72756baada04d99 Author: Yi Dong <[email protected]> Date: Fri Sep 30 21:52:52 2022 +0000 added new ds Signed-off-by: Yi Dong <[email protected]> commit 8d41315bab7ce90e200a8a7d1023c34f8e046897 Author: Yi Dong <[email protected]> Date: Fri Sep 30 18:57:09 2022 +0000 added new files Signed-off-by: Yi Dong <[email protected]> commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f Merge: cbe8b7ab1 fa6cd8588 Author: Yi Dong <[email protected]> Date: Thu Sep 29 21:43:29 2022 +0000 Merge branch 'llm-prompt-learning-improvements' into universal_prompt commit fa6cd858839277939446afe7275976078d54c512 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Sep 29 16:47:30 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0 Merge: 7d6d46742 8d670bc77 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 09:43:27 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 7d6d46742170a66758287a207d67e1b1bfd15613 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 16:42:43 2022 +0000 Removed inference step and added sentence peice check to predict step Signed-off-by: Virginia Adams <[email protected]> commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 15:26:32 2022 +0000 fixed first stage check for pipeline parallel T5 pt Signed-off-by: Virginia Adams <[email protected]> commit 3637be2b258c8d9028856f9971edb7da4a8121f0 Merge: a3ea722fd 986a76612 Author: Virginia Adams <[email protected]> Date: Wed Sep 28 10:23:30 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4 Merge: 770967a52 971485ce7 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 13:35:52 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 770967a5251a474b6dcc2d44bf9a2076adbcb604 Merge: d23bf6c30 e3ac280a8 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:17:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit d23bf6c30acc0e3f6af9b4e24547669866a34d62 Merge: de6a31651 333d2b749 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:05:16 2022 -0700 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit de6a31651e63d88a42b971794d93f18ff5a3cdff Author: Virginia Adams <[email protected]> Date: Mon Sep 26 17:00:53 2022 +0000 Updated PP check to be on first stage pipeline only Signed-off-by: Virginia Adams <[email protected]> commit 333d2b7498e6742ce66436f733c980a74616900c Merge: 592c0986a a39fc925a Author: Virginia Adams <[email protected]> Date: Fri Sep 23 16:11:21 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 592c0986a476a91b57b8605d7b70830d7acfa021 Author: Virginia Adams <[email protected]> Date: Fri Sep 23 23:08:41 2022 +0000 Fixed unused import and CI test bug Signed-off-by: Virginia Adams <[email protected]> commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455 Merge: ce4b72c8c b566c2d0e Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:25 2022 +0000 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit ce4b72c8c52f32be336e323dd78a38089edc3e7c Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:16 2022 +0000 Switch to import from base class Signed-off-by: Virginia Adams <[email protected]> commit b566c2d0e35a068f758fd1310bc620a47be4590b Merge: 6621f2854 e872061ac Author: Virginia Adams <[email protected]> Date: Fri Sep 23 10:09:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 6621f28543828a48484a5637f6c9f3ccb23a5b02 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:47:35 2022 +0000 python format fix Signed-off-by: Virginia Adams <[email protected]> commit 8deafc8987b6af5f7b99a250310f57a40198c37f Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:28:02 2022 +0000 Save .nemo on new best val score Signed-off-by: Virginia Adams <[email protected]> commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 18:03:19 2022 +0000 Added automatic checkpoint to nemo file method Signed-off-by: Virginia Adams <[email protected]> commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b Author: Virginia Adams <[email protected]> Date: Wed Sep 14 02:11:56 2022 +0000 Make GPT use base prompt learning model class: Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yi Dong <[email protected]> * fix LGTM Signed-off-by: Yi Dong <[email protected]> * fix validation Signed-off-by: Yi Dong <[email protected]> * change for the lm eval Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * make text generation work in data parallel environment Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * implement the service with rest service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * surpress log Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Update config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore function needed for NMT Signed-off-by: MaximumEntropy <[email protected]> * handles no answer only Signed-off-by: Yi Dong <[email protected]> * Fix config Signed-off-by: MaximumEntropy <[email protected]> * added knn to web Signed-off-by: Yi Dong <[email protected]> * fix lgtm.com comments Signed-off-by: Yi Dong <[email protected]> * output the retrieved context Signed-off-by: Yi Dong <[email protected]> * allow no neighbor query Signed-off-by: Yi Dong <[email protected]> * remove the imports Signed-off-by: Yi Dong <[email protected]> * warn only once Signed-off-by: Yi Dong <[email protected]> * Change output file format from JSON to JSONL Signed-off-by: MaximumEntropy <[email protected]> * new t0 dataset Signed-off-by: Yi Dong <[email protected]> * Add T0 data preproc scripts Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Merge and multiprocessing Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for is_correct Signed-off-by: MaximumEntropy <[email protected]> * fix epoch > 2 Signed-off-by: Yi Dong <[email protected]> * handles multiple dataloader Signed-off-by: Yi Dong <[email protected]> * remove template Signed-off-by: Yi Dong <[email protected]> * Refactor T0 dataset Signed-off-by: MaximumEntropy <[email protected]> * Add script to merge train folder into individual training files to minimize number of blends Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added on the fly service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add combo instance Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added combo service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * send weights back to server Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix index store Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add reset button Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add add eos Signed-off-by: Yi Dong <[email protected]> * use a seperate bert service Signed-off-by: Yi Dong <[email protected]> * no loss of accuracy Signed-off-by: Yi Dong <[email protected]> * pin the gradio version Signed-off-by: Yi Dong <[email protected]> * Remove bin compat Signed-off-by: MaximumEntropy <[email protected]> * Fix header lines Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * evaluate based on text generation Signed-off-by: Yi Dong <[email protected]> * exact match result aggregation Signed-off-by: Yi Dong <[email protected]> * working SP and SA Signed-off-by: Yi Dong <[email protected]> * sync Signed-off-by: Yi Dong <[email protected]> * fix checkpoint Signed-off-by: Yi Dong <[email protected]> * fix eval Signed-off-by: Yi Dong <[email protected]> * backup states Signed-off-by: Yi Dong <[email protected]> * backup states reset Signed-off-by: Yi Dong <[email protected]> * fix the bug Signed-off-by: Yi Dong <[email protected]> * fix evaluation for sentence piece Signed-off-by: Yi Dong <[email protected]> * fix a bug Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * potential fix in the future Signed-off-by: Yi Dong <[email protected]> * remove the universal codes Signed-off-by: Yi Dong <[email protected]> * remove universal strategy Signed-off-by: Yi Dong <[email protected]> * address reviewer comment Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add align function docstrings and make most args optional Signed-off-by: Elena Rastorgueva <[email protected]> * Remove redundant returns of viterbi and log probs matrices Signed-off-by: Elena Rastorgueva <[email protected]> * Rename h# to <initial_silence> Signed-off-by: Elena Rastorgueva <[email protected]> * Update manifest format description in README Signed-off-by: Elena Rastorgueva <[email protected]> * always remove any spaces from utt_id Signed-off-by: Elena Rastorgueva <[email protected]> * Patch the hanging of threads on very large stderr (#5589) (#5590) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * O2 style amp for gpt3 ptuning (#5246) * enable amp o2 plugin Signed-off-by: Jimmy Zhang <[email protected]> * only create master param if param requires gradient Signed-off-by: Jimmy Zhang <[email protected]> * remove pytorch autocast Signed-off-by: Jimmy Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Jimmy Zhang <[email protected]> * Update optimizer_with_main_params.py Signed-off-by: JimmyZhang12 <[email protected]> * create master grad only if param group requires grad Signed-off-by: Jimmy Zhang <[email protected]> * fix grad scaler for pp > 1 Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: JimmyZhang12 <[email protected]> Co-authored-by: Jimmy Zhang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Better patch hydra (#5591) (#5592) * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Yet another fix with hydra multirun (#5594) (#5595) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add RETRO model documentation (#5578) * added retro doc Signed-off-by: Yi Dong <[email protected]> * finish data part Signed-off-by: Yi Dong <[email protected]> * added the data format Signed-off-by: Yi Dong <[email protected]> * added training script Signed-off-by: Yi Dong <[email protected]> * added training and evaluation steps Signed-off-by: Yi Dong <[email protected]> * edit the text Signed-off-by: Yi Dong <[email protected]> * added the images Signed-off-by: Yi Dong <[email protected]> * fix beginning Signed-off-by: Yi Dong <[email protected]> * fix the grammar Signed-off-by: Yi Dong <[email protected]> * trim it down Signed-off-by: Yi Dong <[email protected]> * add wandb option Signed-off-by: Yi Dong <[email protected]> * add reference Signed-off-by: Yi Dong <[email protected]> * fix path Signed-off-by: Yi Dong <[email protected]> * added the parameters table Signed-off-by: Yi Dong <[email protected]> * fix section Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix: setup_multiple validation/test data (#5585) Fix: setup_multiple validation/test data (#5585) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move to optimizer based EMA implementation (#5169) * Move to optimizer Signed-off-by: SeanNaren <[email protected]> * Fix replacing weights Signed-off-by: SeanNaren <[email protected]> * Allow swapping of weights be optional Signed-off-by: SeanNaren <[email protected]> * Save 2 models Signed-off-by: SeanNaren <[email protected]> * Use different hook Signed-off-by: SeanNaren <[email protected]> * Expose cpu device Signed-off-by: SeanNaren <[email protected]> * Add clause to see if this fixes issue with O2 optimizer Signed-off-by: SeanNaren <[email protected]> * Try to get O2 working Signed-off-by: SeanNaren <[email protected]> * WIP Signed-off-by: SeanNaren <[email protected]> * Fixes Signed-off-by: SeanNaren <[email protected]> * Fixes to tests Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Remove import Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Add comment Signed-off-by: SeanNaren <[email protected]> * Remove overwrite Signed-off-by: SeanNaren <[email protected]> * Add BatchNorm, currently tests fail Signed-off-by: SeanNaren <[email protected]> * Fix tests/functionality for batch norm Signed-off-by: SeanNaren <[email protected]> * Get rid of NLP changes Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * AIStore for ASR datasets (#5462) AIStore for ASR datasets Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add support for MHA adapters to ASR (#5396) * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * Temporary fixes to new signature of mixin Signed-off-by: smajumdar <[email protected]> * Add adapter util for constants, add all mha adapters. Signed-off-by: smajumdar <[email protected]> * Update name of function Signed-off-by: smajumdar <[email protected]> * Roll back changes to convASR Signed-off-by: smajumdar <[email protected]> * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * First draft of Conformer support for MHA attention Signed-off-by: smajumdar <[email protected]> * Add some preliminary tests Signed-off-by: smajumdar <[email protected]> * Add support for projection of the hidden dimension for attention Signed-off-by: smajumdar <[email protected]> * Add support for squeezeformer Signed-off-by: smajumdar <[email protected]> * Update train adapter config Signed-off-by: smajumdar <[email protected]> * Add tests for squeezeformer and unit tests for new modules Signed-off-by: smajumdar <[email protected]> * Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path Signed-off-by: smajumdar <[email protected]> * Update location of adapters Signed-off-by: smajumdar <[email protected]> * Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v Signed-off-by: smajumdar <[email protected]> * Update expmanager to clean up checkpoints Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update training scripts Signed-off-by: smajumdar <[email protected]> * Update config and docs Signed-off-by: smajumdar <[email protected]> * Expose nemo delete function Signed-off-by: smajumdar <[email protected]> * Correct adapter partial state saving Signed-off-by: smajumdar <[email protected]> * Correct a bug with state management of adapter tokens Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Pull down EMA test Signed-off-by: smajumdar <[email protected]> * Correct name of adapter module utility class Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Co-authored-by: Jocelyn <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Create separator parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Call align function with hydra config Signed-off-by: Elena Rastorgueva <[email protected]> * update usage example Signed-off-by: Elena Rastorgueva <[email protected]> * Update Dockerfile (#5614) (#5616) Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future. Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate pretrained_name and model_path parameters Signed-off-by: Elena Rastorgueva <[email protected]> * make "optional" tags bold in markdown Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move non-main functions to utils dir Signed-off-by: Elena Rastorgueva <[email protected]> * Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615) Signed-off-by: Ante Jukić <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] fix ranges of char set for accented letters. (#5607) * [TTS] fix ranges of char set for accented letters. * remove digits pattern and added unit tests for math operators. Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Change success message to reduce confusion (#5621) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update documentation and tutorials for Adapters (#5610) * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update test Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Rename test file Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602) * [TTS] add type hints and change variable names for tokenizers and g2p Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * 1. Added missing import for gather_objects. (#5627) Signed-off-by: Micha Livne <[email protected]> Signed-off-by: Micha Livne <[email protected]> Co-authored-by: Micha Livne <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fixed RadTTS unit test (#5572) Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove tests (#5633) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Conformer local attention (#5525) * local attn and merge Signed-off-by: sam1373 <[email protected]> * optional Signed-off-by: sam1373 <[email protected]> * override Signed-off-by: sam1373 <[email protected]> * incorporate comments Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> * fix Signed-off-by: sam1373 <[email protected]> * comment Signed-off-by: sam1373 <[email protected]> * changes, test Signed-off-by: sam1373 <[email protected]> * changes Signed-off-by: sam1373 <[email protected]> * check att context Signed-off-by: sam1373 <[email protected]> * readme link Signed-off-by: sam1373 <[email protected]> * utils Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> Signed-off-by: sam1373 <[email protected]> Signed-off-by: Samuel Kriman <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add core classes and functions for online clustering diarizer part 1 (#5526) * Add core classes and functions for online clustering diarizer Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add audio to labels code Signed-off-by: Taejin Park <[email protected]> * resolve type errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added unit=tests for very short audio Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Filled all missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved conflict and added missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed unit-test errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix the wrongly added file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * Fix wrongly included file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * resolve code quality issue Signed-off-by: Taejin Park <[email protected]> * Fixed unit-test errors and bugs Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * changed total_sec for offline_clustering toy_data in unit-tests Signed-off-by: Taejin Park <[email protected]> * fixed merging index offset bug Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * only including part 1 files Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed unused function Signed-off-by: Taejin Park <[email protected]> * fixed unused imports Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * divided nmesc_clustering.py into two and reflected first-pass comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * adding offline/online_clustering.py Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix code QL autocomment Signed-off-by: Taejin Park <[email protected]> * Removed unused imports Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/online_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Reflected comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved code scanning issue Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/offline_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641) * add stt_eo_conformer_ctc_large model * stt_eo_conformer_transducer_large Co-authored-by: Andrei Andrusenko <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Removed unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Specify that filepaths need to be absolute Signed-off-by: Elena Rastorgueva <[email protected]> * replaces any spaces in utt_id with dashes Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make hydra script callable by another script Signed-off-by: Elena Rastorgueva <[email protected]> * do not specify default model or model_downsample_factor Signed-off-by: Elena Rastorgueva <[email protected]> * [Dockerfile] Remove AIS archive from docker image (#5629) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Measure audio_sr from audio instead of needing to specify Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541) * Chinese TTS replaces default pypinyin dict * Add jieba word segmenter as an option Signed-off-by: Yuekai Zhang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate parameters for device of transcription and viterbi steps Signed-off-by: Elena Rastorgueva <[email protected]> * Add mention of gecko Signed-off-by: Elena Rastorgueva <[email protected]> * [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [Add] ASR+VAD Inference Pipeline (#5575) Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets. Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: fayejf <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * rename separator to ctm_grouping_separator and refactor Signed-off-by: Elena Rastorgueva <[email protected]> * Bert interleaved (#5556) * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * Update bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Adding tests * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * Fix bug fix sequence parallel and Interleaved * Fix bug fix sequence parallel and Interleaved Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add duration padding support for RADTTS inference (#5650) * Added duration padding support for RADTTS inference * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Kevin Shih <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Add remove_blank_tokens_from_ctm parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Dont save initial_silence line in CTM Signed-off-by: Elena Rastorgueva <[email protected]> * Add DLLogger support to exp_manager (#5658) * Add DLLogger support to exp_manager Signed-off-by: Alexandre Milesi <[email protected]> * Move dllogger to separate file and check import Signed-off-by: Alexandre Milesi <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Remove unused import Signed-off-by: Alexandre Milesi <[email protected]> Signed-off-by: Alexandre Milesi <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add minimum_timestamp_duration parameter Signed-off-by: Elena Rastorgueva <[email protected]> * add suggestion about removing blanks to README Signed-off-by: Elena Rastorgueva <[email protected]> * reorder args Signed-off-by: Elena Rastorgueva <[email protected]> * clarify description of ctm_grouping_separator in README Signed-off-by: Elena Rastorgueva <[email protected]> * update docstring Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Port stateless timer to exp manager (#5584) * Port stateless timer to exp manager Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes and remove from all megatron code Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change message Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix EMA restart by allowing device to be set by the class init (#5668) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP (moved to separate repo) - merge to main (#5630) * Remove sdp files from tools folder Signed-off-by: Elena Rastorgueva <[email protected]> * Add page to docs with new SDP location Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add interface for making amax reduction optional for FP8 (#5447) * add TE interface for making amax reduction optional Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Kirthi Shankar Sivamani <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add tts dict cust notebook (#5662) * add tts dict cust notebook Signed-off-by: ekmb <[email protected]> * review Signed-off-by: ekmb <[email protected]> * fixed audio links Signed-off-by: ekmb <[email protected]> * remove old notebook Signed-off-by: ekmb <[email protected]> * fix typo Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [ASR] Audio processing base, multi-channel enhancement models (#5356) * Audio processing base model, enc-mask-dec enhancement, tests and modules Signed-off-by: Ante Jukić <[email protected]> * Addressed review comments Signed-off-by: Ante Jukić <[email protected]> * Fixed CodeQL warnings Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments: - renamed AudioProcessingModel to AudioToAudioModel - various small modifications - updated unit tests Signed-off-by: Ante Jukić <[email protected]> * Addressed comments - Moved spectrogram to audio_preprocessing - Renamed MultichannelFeatures - Updated config and unit tests Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Expose ClusteringDiarizer device (#5681) * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add Beam Search support to ASR transcribe() (#5443) * Add support for beam decoding via high level API. Signed-off-by: smajumdar <[email protected]> * Add ctc decoding section Signed-off-by: smajumdar <[email protected]> * Update ctc transcribe API to return results from beam search Signed-off-by: smajumdar <[email protected]> * Add argument to preserve arpa file Signed-off-by: smajumdar <[email protected]> * Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding Signed-off-by: smajumdar <[email protected]> * Update eval script and doc to use new API Signed-off-by: smajumdar <[email protected]> * Add tests for ctc greedy decoding Signed-off-by: smajumdar <[email protected]> * Address reviewer comments and add docstrings Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix changes and address comments Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: Samuel Kriman <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Propagate attention_dropout flag for GPT-3 (#5669) * Propagate attention_dropout flag for GPT-3 Signed-off-by: Mikołaj Błaż <[email protected]> * Add default to megatron_gpt_config Signed-off-by: Mikołaj Błaż <[email protected]> Signed-off-by: Mikołaj Błaż <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <complex451@gmail…

* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <[email protected]> * remove cmudict downloading Signed-off-by: Oktai Tatanov <[email protected]> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <[email protected]> * add libritts processing Signed-off-by: Oktai Tatanov <[email protected]> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <[email protected]> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <[email protected]> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <[email protected]> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <[email protected]> * fix optimizers Signed-off-by: Oktai Tatanov <[email protected]> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Jason <[email protected]> Signed-off-by: Jason <[email protected]> * make new commit Signed-off-by: Jason <[email protected]> * add copyright headers Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * rename README Signed-off-by: Oktai Tatanov <[email protected]> * fix style without vits_modules Signed-off-by: Oktai Tatanov <[email protected]> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <[email protected]> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <[email protected]> * update package info Signed-off-by: ericharper <[email protected]> * new exp * update branch Signed-off-by: ericharper <[email protected]> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <[email protected]> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <[email protected]> * duration fix Signed-off-by: nithinraok <[email protected]> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <[email protected]> * fixes for code review Signed-off-by: Alexandra Antonova <[email protected]> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> * update the default (#4135) Signed-off-by: ekmb <[email protected]> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <[email protected]> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Do not create tmp directory Signed-off-by: PeganovAnton <[email protected]> * Fix parameter name Signed-off-by: PeganovAnton <[email protected]> * finish cherry-pick op Signed-off-by: PeganovAnton <[email protected]> * Fix labels errors Signed-off-by: PeganovAnton <[email protected]> * Remove duplicate stage Signed-off-by: PeganovAnton <[email protected]> * Change target branch Signed-off-by: PeganovAnton <[email protected]> * fix typo (#4140) Signed-off-by: Yang Zhang <[email protected]> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Use built-in tempfile library Signed-off-by: PeganovAnton <[email protected]> * Fix code style Signed-off-by: PeganovAnton <[email protected]> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <[email protected]> Co-authored-by: Nithin Rao <[email protected]> * fix doc (#4146) Signed-off-by: Yang Zhang <[email protected]> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <[email protected]> * Fix for new TTSDataset class Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * inference fix Signed-off-by: treacker <[email protected]> * removed old code Signed-off-by: treacker <[email protected]> * updated parser logic Signed-off-by: treacker <[email protected]> * reverted version update Signed-off-by: treacker <[email protected]> * refactored parser logic Signed-off-by: treacker <[email protected]> * Updated Jenkinsfile Signed-off-by: treacker <[email protected]> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Update Jenkinsfile Signed-off-by: treacker <[email protected]> * Update tacotron.yaml Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: treacker <[email protected]> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Fixed jenkins Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <[email protected]> * start fix Signed-off-by: nithinraok <[email protected]> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <[email protected]> * updated scp to filelist Signed-off-by: nithinraok <[email protected]> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <[email protected]> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <[email protected]> * remove extremely unreliable links Signed-off-by: fayejf <[email protected]> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <[email protected]> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <[email protected]> * changed model location to nemo Signed-off-by: Alexandra Antonova <[email protected]> * style fix Signed-off-by: Alexandra Antonova <[email protected]> * fix space Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <[email protected]> * chunks -> segments Signed-off-by: nithinraok <[email protected]> * Khz -> kHz Signed-off-by: nithinraok <[email protected]> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <[email protected]> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Update ASR scores table (#5254) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <[email protected]> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <[email protected]> * Fixed typo Signed-off-by: Matvei Novikov <[email protected]> * Fixed branch variable Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <[email protected]> * udpate transcribe Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Igor Gitman <[email protected]> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <[email protected]> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <[email protected]> * fixes Signed-off-by: Matvei Novikov <[email protected]> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <[email protected]> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <[email protected]> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <[email protected]> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <[email protected]> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <[email protected]> * Make config example less confusing Signed-off-by: Elena Rastorgueva <[email protected]> * Fix typo Signed-off-by: Elena Rastorgueva <[email protected]> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused imports Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <[email protected]> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Co-authored-by: Igor Gitman <[email protected]> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <[email protected]> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <[email protected]> * Added more saftey checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * disable pc test (#5426) Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <[email protected]> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <[email protected]> * storing * updated version Signed-off-by: Evgeniy Shabalin <[email protected]> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed issues Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <[email protected]> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <[email protected]> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <[email protected]> * typecheck fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <[email protected]> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <[email protected]> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: nithinraok <[email protected]> Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Sasha Meister <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Evgeniy Shabalin <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Oktai Tatanov <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: bene-ges <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Jocelyn <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Sean Naren <[email protected]> Co-authored-by: Matvei Novikov <[email protected]> Co-authored-by: Zhilin Wang <[email protected]> Co-authored-by: Igor Gitman <[email protected]> Co-authored-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: Taejin Park <[email protected]> Co-authored-by: Adi Renduchintala <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Co-authored-by: Xuesong Yang <[email protected]>

* Merge r1.13.0 main (#5570) * update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Optimized loop and bugfix in SDE (#5573) - Fixed bug with loading custom data attributes from JSON in Speech Data Explorer Signed-off-by: George Zelenfroynd <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update torchmetrics (#5566) * add task arg Signed-off-by: nithinraok <[email protected]> * update state Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Co-authored-by: Taejin Park <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove useless files. (#5580) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add initial NFA code Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make use of the specified device during viterbi decoding Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL notes Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL warning Signed-off-by: Elena Rastorgueva <[email protected]> * Add an option to defer data setup from ``__init__`` to ``setup`` (#5569) * Add an option to defer dataloader setup from __init__ to setup Signed-off-by: Ante Jukić <[email protected]> * Updated doc Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make utt_id specified by number of parts of audio_filepath user wishes to use Signed-off-by: Elena Rastorgueva <[email protected]> * remove audio_sr TODO - reduce risk of silent bugs Signed-off-by: Elena Rastorgueva <[email protected]> * Add check that model is CTC Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Text generation improvement (UI client, data parallel support) (#5437) * Squashed commit of the following: commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 15:07:42 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 35b424044fe80c3081e7756ab21244f701716f7e Author: Yi Dong <[email protected]> Date: Thu Oct 13 08:04:49 2022 -0700 get rid of base Signed-off-by: Yi Dong <[email protected]> commit 2955210e2311791543538cfbb5ad26b79414c954 Merge: d52edef8c eaf6757ca Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:17:02 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit d52edef8cd7b36593838fb270047e80f8ccb652e Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:16:24 2022 +0000 align with main Signed-off-by: Yi Dong <[email protected]> commit eaf6757ca5be8e099492f57c81d984429b0ad49c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 13:12:11 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:10:58 2022 +0000 same as main Signed-off-by: Yi Dong <[email protected]> commit e335de51bcc0d681c58b568c3d8c238bc5687c3b Merge: c231086e0 4463a9fe9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:08:09 2022 +0000 Merge branch 'main' into universal_prompt commit c231086e057f1efaa915f691d84664cb3d5aad85 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 19:59:12 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:56:17 2022 +0000 default to pad Signed-off-by: Yi Dong <[email protected]> commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:55:44 2022 +0000 add the option to pad the tokens Signed-off-by: Yi Dong <[email protected]> commit 876dc395b43fdeeaa2bcbbe13c76523633764c33 Merge: fbb0f4035 fe3c77ee9 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:20:47 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392 Author: Yi Dong <[email protected]> Date: Wed Oct 12 18:59:49 2022 +0000 fix import again Signed-off-by: Yi Dong <[email protected]> commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 16:00:24 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937 Author: Yi Dong <[email protected]> Date: Wed Oct 12 15:58:32 2022 +0000 enable server Signed-off-by: Yi Dong <[email protected]> commit cbe05d9fbc978f812cfbb671f45f147f300713c4 Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:07:28 2022 +0000 fix comment error Signed-off-by: Yi Dong <[email protected]> commit 1948048922e726ec6131e44b1a745389f18d4ef2 Merge: 232c2cce3 984f5c09a Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:05:30 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 232c2cce34d7a8b902da406706f3dd9b39475091 Merge: 34c8a68df 658243fb6 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:50:00 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f Merge: 658243fb6 3fda5de46 Author: Yi Dong <[email protected]> Date: Wed Oct 12 08:42:11 2022 -0400 Merge branch 'main' into fix_global_init commit 658243fb6580191b5d60edd30cde16dcc23cbb85 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:40:57 2022 +0000 fix import error Signed-off-by: Yi Dong <[email protected]> commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c Author: Yi Dong <[email protected]> Date: Tue Oct 11 22:44:12 2022 +0000 update the fused kernel Signed-off-by: Yi Dong <[email protected]> commit 536cf6bef9447b75843fad630729c47a2fba35f3 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:44:52 2022 -0700 add the missing file Signed-off-by: Yi Dong <[email protected]> commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:43:14 2022 -0700 fix fused softmax Signed-off-by: Yi Dong <[email protected]> commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:16:48 2022 -0700 move global step to base Signed-off-by: Yi Dong <[email protected]> commit 34c8a68df084b18d377e84415d9f07b2cd6673dd Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:50:11 2022 +0000 fix pipeline for eval Signed-off-by: Yi Dong <[email protected]> commit eee5d38218f26660c3ffebe9f615c850c80a1f0d Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:48:22 2022 +0000 fix for pipleline parallel Signed-off-by: Yi Dong <[email protected]> commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d Merge: 125e49947 e3b4c4d1f Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:13 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 125e4994760448ff75dd9328395813eda1c87547 Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:04 2022 +0000 add share option Signed-off-by: Yi Dong <[email protected]> commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368 Author: Yi Dong <[email protected]> Date: Wed Oct 5 11:43:48 2022 -0700 make sure consolidation works Signed-off-by: Yi Dong <[email protected]> commit a5c833964ecf05dc460ca1da69275c4019742150 Merge: 2a07ab52d abcb74be2 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:29 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 2a07ab52d95f15ba666823028c69e23825666c05 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:23 2022 +0000 added requirement Signed-off-by: Yi Dong <[email protected]> commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:39:42 2022 +0000 added a simple web server Signed-off-by: Yi Dong <[email protected]> commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b Author: Yi Dong <[email protected]> Date: Wed Oct 5 06:54:12 2022 -0700 fix empty val loss Signed-off-by: Yi Dong <[email protected]> commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26 Author: Yi Dong <[email protected]> Date: Tue Oct 4 19:25:30 2022 -0700 text gen working Signed-off-by: Yi Dong <[email protected]> commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba Author: Yi Dong <[email protected]> Date: Tue Oct 4 16:08:40 2022 -0700 first change Signed-off-by: Yi Dong <[email protected]> commit 59d077585e6962a669b824af58f64e8a0bea6547 Author: Yi Dong <[email protected]> Date: Tue Oct 4 15:00:40 2022 -0700 revert Signed-off-by: Yi Dong <[email protected]> commit 12a0f3902d99e9179403644bd951c045df716ca7 Author: Yi Dong <[email protected]> Date: Tue Oct 4 21:26:23 2022 +0000 init imp Signed-off-by: Yi Dong <[email protected]> commit 62a15dfd943cc48be495ac61b9f2f00995775c5f Merge: 82c90d2cd e0cc6b767 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:58:26 2022 -0700 Merge branch 'main' into universal_prompt commit 82c90d2cd0fd156f16a4b899f8c741d598f33990 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:17:13 2022 -0700 add sync Signed-off-by: Yi Dong <[email protected]> commit 9819b703eef877d90cd1257bf3610c69de9b4d7e Author: Yi Dong <[email protected]> Date: Sun Oct 2 17:52:34 2022 -0700 fix save model Signed-off-by: root <[email protected]> commit e4937e2fc5fb7d70754c97668416e4a69c3079fe Author: Yi Dong <[email protected]> Date: Sat Oct 1 18:56:09 2022 +0000 working Signed-off-by: Yi Dong <[email protected]> commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b Author: Yi Dong <[email protected]> Date: Sat Oct 1 17:34:03 2022 +0000 calcuate the mask Signed-off-by: Yi Dong <[email protected]> commit 9db3bc13eb65a94a475b837603351da68e3745bc Author: Yi Dong <[email protected]> Date: Fri Sep 30 23:26:32 2022 +0000 fix bug in datasets Signed-off-by: Yi Dong <[email protected]> commit f289900375d4412f53f8110be00fec6587627550 Author: Yi Dong <[email protected]> Date: Fri Sep 30 22:29:40 2022 +0000 update the code Signed-off-by: Yi Dong <[email protected]> commit 8e28a1f208aabaab72dbe769e72756baada04d99 Author: Yi Dong <[email protected]> Date: Fri Sep 30 21:52:52 2022 +0000 added new ds Signed-off-by: Yi Dong <[email protected]> commit 8d41315bab7ce90e200a8a7d1023c34f8e046897 Author: Yi Dong <[email protected]> Date: Fri Sep 30 18:57:09 2022 +0000 added new files Signed-off-by: Yi Dong <[email protected]> commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f Merge: cbe8b7ab1 fa6cd8588 Author: Yi Dong <[email protected]> Date: Thu Sep 29 21:43:29 2022 +0000 Merge branch 'llm-prompt-learning-improvements' into universal_prompt commit fa6cd858839277939446afe7275976078d54c512 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Sep 29 16:47:30 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0 Merge: 7d6d46742 8d670bc77 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 09:43:27 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 7d6d46742170a66758287a207d67e1b1bfd15613 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 16:42:43 2022 +0000 Removed inference step and added sentence peice check to predict step Signed-off-by: Virginia Adams <[email protected]> commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 15:26:32 2022 +0000 fixed first stage check for pipeline parallel T5 pt Signed-off-by: Virginia Adams <[email protected]> commit 3637be2b258c8d9028856f9971edb7da4a8121f0 Merge: a3ea722fd 986a76612 Author: Virginia Adams <[email protected]> Date: Wed Sep 28 10:23:30 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4 Merge: 770967a52 971485ce7 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 13:35:52 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 770967a5251a474b6dcc2d44bf9a2076adbcb604 Merge: d23bf6c30 e3ac280a8 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:17:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit d23bf6c30acc0e3f6af9b4e24547669866a34d62 Merge: de6a31651 333d2b749 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:05:16 2022 -0700 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit de6a31651e63d88a42b971794d93f18ff5a3cdff Author: Virginia Adams <[email protected]> Date: Mon Sep 26 17:00:53 2022 +0000 Updated PP check to be on first stage pipeline only Signed-off-by: Virginia Adams <[email protected]> commit 333d2b7498e6742ce66436f733c980a74616900c Merge: 592c0986a a39fc925a Author: Virginia Adams <[email protected]> Date: Fri Sep 23 16:11:21 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 592c0986a476a91b57b8605d7b70830d7acfa021 Author: Virginia Adams <[email protected]> Date: Fri Sep 23 23:08:41 2022 +0000 Fixed unused import and CI test bug Signed-off-by: Virginia Adams <[email protected]> commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455 Merge: ce4b72c8c b566c2d0e Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:25 2022 +0000 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit ce4b72c8c52f32be336e323dd78a38089edc3e7c Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:16 2022 +0000 Switch to import from base class Signed-off-by: Virginia Adams <[email protected]> commit b566c2d0e35a068f758fd1310bc620a47be4590b Merge: 6621f2854 e872061ac Author: Virginia Adams <[email protected]> Date: Fri Sep 23 10:09:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 6621f28543828a48484a5637f6c9f3ccb23a5b02 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:47:35 2022 +0000 python format fix Signed-off-by: Virginia Adams <[email protected]> commit 8deafc8987b6af5f7b99a250310f57a40198c37f Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:28:02 2022 +0000 Save .nemo on new best val score Signed-off-by: Virginia Adams <[email protected]> commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 18:03:19 2022 +0000 Added automatic checkpoint to nemo file method Signed-off-by: Virginia Adams <[email protected]> commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b Author: Virginia Adams <[email protected]> Date: Wed Sep 14 02:11:56 2022 +0000 Make GPT use base prompt learning model class: Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yi Dong <[email protected]> * fix LGTM Signed-off-by: Yi Dong <[email protected]> * fix validation Signed-off-by: Yi Dong <[email protected]> * change for the lm eval Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * make text generation work in data parallel environment Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * implement the service with rest service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * surpress log Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Update config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore function needed for NMT Signed-off-by: MaximumEntropy <[email protected]> * handles no answer only Signed-off-by: Yi Dong <[email protected]> * Fix config Signed-off-by: MaximumEntropy <[email protected]> * added knn to web Signed-off-by: Yi Dong <[email protected]> * fix lgtm.com comments Signed-off-by: Yi Dong <[email protected]> * output the retrieved context Signed-off-by: Yi Dong <[email protected]> * allow no neighbor query Signed-off-by: Yi Dong <[email protected]> * remove the imports Signed-off-by: Yi Dong <[email protected]> * warn only once Signed-off-by: Yi Dong <[email protected]> * Change output file format from JSON to JSONL Signed-off-by: MaximumEntropy <[email protected]> * new t0 dataset Signed-off-by: Yi Dong <[email protected]> * Add T0 data preproc scripts Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Merge and multiprocessing Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for is_correct Signed-off-by: MaximumEntropy <[email protected]> * fix epoch > 2 Signed-off-by: Yi Dong <[email protected]> * handles multiple dataloader Signed-off-by: Yi Dong <[email protected]> * remove template Signed-off-by: Yi Dong <[email protected]> * Refactor T0 dataset Signed-off-by: MaximumEntropy <[email protected]> * Add script to merge train folder into individual training files to minimize number of blends Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added on the fly service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add combo instance Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added combo service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * send weights back to server Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix index store Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add reset button Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add add eos Signed-off-by: Yi Dong <[email protected]> * use a seperate bert service Signed-off-by: Yi Dong <[email protected]> * no loss of accuracy Signed-off-by: Yi Dong <[email protected]> * pin the gradio version Signed-off-by: Yi Dong <[email protected]> * Remove bin compat Signed-off-by: MaximumEntropy <[email protected]> * Fix header lines Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * evaluate based on text generation Signed-off-by: Yi Dong <[email protected]> * exact match result aggregation Signed-off-by: Yi Dong <[email protected]> * working SP and SA Signed-off-by: Yi Dong <[email protected]> * sync Signed-off-by: Yi Dong <[email protected]> * fix checkpoint Signed-off-by: Yi Dong <[email protected]> * fix eval Signed-off-by: Yi Dong <[email protected]> * backup states Signed-off-by: Yi Dong <[email protected]> * backup states reset Signed-off-by: Yi Dong <[email protected]> * fix the bug Signed-off-by: Yi Dong <[email protected]> * fix evaluation for sentence piece Signed-off-by: Yi Dong <[email protected]> * fix a bug Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * potential fix in the future Signed-off-by: Yi Dong <[email protected]> * remove the universal codes Signed-off-by: Yi Dong <[email protected]> * remove universal strategy Signed-off-by: Yi Dong <[email protected]> * address reviewer comment Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add align function docstrings and make most args optional Signed-off-by: Elena Rastorgueva <[email protected]> * Remove redundant returns of viterbi and log probs matrices Signed-off-by: Elena Rastorgueva <[email protected]> * Rename h# to <initial_silence> Signed-off-by: Elena Rastorgueva <[email protected]> * Update manifest format description in README Signed-off-by: Elena Rastorgueva <[email protected]> * always remove any spaces from utt_id Signed-off-by: Elena Rastorgueva <[email protected]> * Patch the hanging of threads on very large stderr (#5589) (#5590) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * O2 style amp for gpt3 ptuning (#5246) * enable amp o2 plugin Signed-off-by: Jimmy Zhang <[email protected]> * only create master param if param requires gradient Signed-off-by: Jimmy Zhang <[email protected]> * remove pytorch autocast Signed-off-by: Jimmy Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Jimmy Zhang <[email protected]> * Update optimizer_with_main_params.py Signed-off-by: JimmyZhang12 <[email protected]> * create master grad only if param group requires grad Signed-off-by: Jimmy Zhang <[email protected]> * fix grad scaler for pp > 1 Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: JimmyZhang12 <[email protected]> Co-authored-by: Jimmy Zhang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Better patch hydra (#5591) (#5592) * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Yet another fix with hydra multirun (#5594) (#5595) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add RETRO model documentation (#5578) * added retro doc Signed-off-by: Yi Dong <[email protected]> * finish data part Signed-off-by: Yi Dong <[email protected]> * added the data format Signed-off-by: Yi Dong <[email protected]> * added training script Signed-off-by: Yi Dong <[email protected]> * added training and evaluation steps Signed-off-by: Yi Dong <[email protected]> * edit the text Signed-off-by: Yi Dong <[email protected]> * added the images Signed-off-by: Yi Dong <[email protected]> * fix beginning Signed-off-by: Yi Dong <[email protected]> * fix the grammar Signed-off-by: Yi Dong <[email protected]> * trim it down Signed-off-by: Yi Dong <[email protected]> * add wandb option Signed-off-by: Yi Dong <[email protected]> * add reference Signed-off-by: Yi Dong <[email protected]> * fix path Signed-off-by: Yi Dong <[email protected]> * added the parameters table Signed-off-by: Yi Dong <[email protected]> * fix section Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix: setup_multiple validation/test data (#5585) Fix: setup_multiple validation/test data (#5585) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move to optimizer based EMA implementation (#5169) * Move to optimizer Signed-off-by: SeanNaren <[email protected]> * Fix replacing weights Signed-off-by: SeanNaren <[email protected]> * Allow swapping of weights be optional Signed-off-by: SeanNaren <[email protected]> * Save 2 models Signed-off-by: SeanNaren <[email protected]> * Use different hook Signed-off-by: SeanNaren <[email protected]> * Expose cpu device Signed-off-by: SeanNaren <[email protected]> * Add clause to see if this fixes issue with O2 optimizer Signed-off-by: SeanNaren <[email protected]> * Try to get O2 working Signed-off-by: SeanNaren <[email protected]> * WIP Signed-off-by: SeanNaren <[email protected]> * Fixes Signed-off-by: SeanNaren <[email protected]> * Fixes to tests Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Remove import Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Add comment Signed-off-by: SeanNaren <[email protected]> * Remove overwrite Signed-off-by: SeanNaren <[email protected]> * Add BatchNorm, currently tests fail Signed-off-by: SeanNaren <[email protected]> * Fix tests/functionality for batch norm Signed-off-by: SeanNaren <[email protected]> * Get rid of NLP changes Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * AIStore for ASR datasets (#5462) AIStore for ASR datasets Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add support for MHA adapters to ASR (#5396) * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * Temporary fixes to new signature of mixin Signed-off-by: smajumdar <[email protected]> * Add adapter util for constants, add all mha adapters. Signed-off-by: smajumdar <[email protected]> * Update name of function Signed-off-by: smajumdar <[email protected]> * Roll back changes to convASR Signed-off-by: smajumdar <[email protected]> * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * First draft of Conformer support for MHA attention Signed-off-by: smajumdar <[email protected]> * Add some preliminary tests Signed-off-by: smajumdar <[email protected]> * Add support for projection of the hidden dimension for attention Signed-off-by: smajumdar <[email protected]> * Add support for squeezeformer Signed-off-by: smajumdar <[email protected]> * Update train adapter config Signed-off-by: smajumdar <[email protected]> * Add tests for squeezeformer and unit tests for new modules Signed-off-by: smajumdar <[email protected]> * Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path Signed-off-by: smajumdar <[email protected]> * Update location of adapters Signed-off-by: smajumdar <[email protected]> * Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v Signed-off-by: smajumdar <[email protected]> * Update expmanager to clean up checkpoints Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update training scripts Signed-off-by: smajumdar <[email protected]> * Update config and docs Signed-off-by: smajumdar <[email protected]> * Expose nemo delete function Signed-off-by: smajumdar <[email protected]> * Correct adapter partial state saving Signed-off-by: smajumdar <[email protected]> * Correct a bug with state management of adapter tokens Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Pull down EMA test Signed-off-by: smajumdar <[email protected]> * Correct name of adapter module utility class Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Co-authored-by: Jocelyn <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Create separator parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Call align function with hydra config Signed-off-by: Elena Rastorgueva <[email protected]> * update usage example Signed-off-by: Elena Rastorgueva <[email protected]> * Update Dockerfile (#5614) (#5616) Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future. Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate pretrained_name and model_path parameters Signed-off-by: Elena Rastorgueva <[email protected]> * make "optional" tags bold in markdown Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move non-main functions to utils dir Signed-off-by: Elena Rastorgueva <[email protected]> * Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615) Signed-off-by: Ante Jukić <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] fix ranges of char set for accented letters. (#5607) * [TTS] fix ranges of char set for accented letters. * remove digits pattern and added unit tests for math operators. Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Change success message to reduce confusion (#5621) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update documentation and tutorials for Adapters (#5610) * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update test Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Rename test file Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602) * [TTS] add type hints and change variable names for tokenizers and g2p Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * 1. Added missing import for gather_objects. (#5627) Signed-off-by: Micha Livne <[email protected]> Signed-off-by: Micha Livne <[email protected]> Co-authored-by: Micha Livne <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fixed RadTTS unit test (#5572) Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove tests (#5633) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Conformer local attention (#5525) * local attn and merge Signed-off-by: sam1373 <[email protected]> * optional Signed-off-by: sam1373 <[email protected]> * override Signed-off-by: sam1373 <[email protected]> * incorporate comments Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> * fix Signed-off-by: sam1373 <[email protected]> * comment Signed-off-by: sam1373 <[email protected]> * changes, test Signed-off-by: sam1373 <[email protected]> * changes Signed-off-by: sam1373 <[email protected]> * check att context Signed-off-by: sam1373 <[email protected]> * readme link Signed-off-by: sam1373 <[email protected]> * utils Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> Signed-off-by: sam1373 <[email protected]> Signed-off-by: Samuel Kriman <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add core classes and functions for online clustering diarizer part 1 (#5526) * Add core classes and functions for online clustering diarizer Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add audio to labels code Signed-off-by: Taejin Park <[email protected]> * resolve type errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added unit=tests for very short audio Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Filled all missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved conflict and added missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed unit-test errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix the wrongly added file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * Fix wrongly included file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * resolve code quality issue Signed-off-by: Taejin Park <[email protected]> * Fixed unit-test errors and bugs Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * changed total_sec for offline_clustering toy_data in unit-tests Signed-off-by: Taejin Park <[email protected]> * fixed merging index offset bug Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * only including part 1 files Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed unused function Signed-off-by: Taejin Park <[email protected]> * fixed unused imports Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * divided nmesc_clustering.py into two and reflected first-pass comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * adding offline/online_clustering.py Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix code QL autocomment Signed-off-by: Taejin Park <[email protected]> * Removed unused imports Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/online_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Reflected comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved code scanning issue Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/offline_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641) * add stt_eo_conformer_ctc_large model * stt_eo_conformer_transducer_large Co-authored-by: Andrei Andrusenko <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Removed unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Specify that filepaths need to be absolute Signed-off-by: Elena Rastorgueva <[email protected]> * replaces any spaces in utt_id with dashes Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make hydra script callable by another script Signed-off-by: Elena Rastorgueva <[email protected]> * do not specify default model or model_downsample_factor Signed-off-by: Elena Rastorgueva <[email protected]> * [Dockerfile] Remove AIS archive from docker image (#5629) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Measure audio_sr from audio instead of needing to specify Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541) * Chinese TTS replaces default pypinyin dict * Add jieba word segmenter as an option Signed-off-by: Yuekai Zhang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate parameters for device of transcription and viterbi steps Signed-off-by: Elena Rastorgueva <[email protected]> * Add mention of gecko Signed-off-by: Elena Rastorgueva <[email protected]> * [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [Add] ASR+VAD Inference Pipeline (#5575) Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets. Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: fayejf <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * rename separator to ctm_grouping_separator and refactor Signed-off-by: Elena Rastorgueva <[email protected]> * Bert interleaved (#5556) * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * Update bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Adding tests * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * Fix bug fix sequence parallel and Interleaved * Fix bug fix sequence parallel and Interleaved Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add duration padding support for RADTTS inference (#5650) * Added duration padding support for RADTTS inference * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Kevin Shih <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Add remove_blank_tokens_from_ctm parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Dont save initial_silence line in CTM Signed-off-by: Elena Rastorgueva <[email protected]> * Add DLLogger support to exp_manager (#5658) * Add DLLogger support to exp_manager Signed-off-by: Alexandre Milesi <[email protected]> * Move dllogger to separate file and check import Signed-off-by: Alexandre Milesi <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Remove unused import Signed-off-by: Alexandre Milesi <[email protected]> Signed-off-by: Alexandre Milesi <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add minimum_timestamp_duration parameter Signed-off-by: Elena Rastorgueva <[email protected]> * add suggestion about removing blanks to README Signed-off-by: Elena Rastorgueva <[email protected]> * reorder args Signed-off-by: Elena Rastorgueva <[email protected]> * clarify description of ctm_grouping_separator in README Signed-off-by: Elena Rastorgueva <[email protected]> * update docstring Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Port stateless timer to exp manager (#5584) * Port stateless timer to exp manager Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes and remove from all megatron code Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change message Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix EMA restart by allowing device to be set by the class init (#5668) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP (moved to separate repo) - merge to main (#5630) * Remove sdp files from tools folder Signed-off-by: Elena Rastorgueva <[email protected]> * Add page to docs with new SDP location Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add interface for making amax reduction optional for FP8 (#5447) * add TE interface for making amax reduction optional Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Kirthi Shankar Sivamani <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add tts dict cust notebook (#5662) * add tts dict cust notebook Signed-off-by: ekmb <[email protected]> * review Signed-off-by: ekmb <[email protected]> * fixed audio links Signed-off-by: ekmb <[email protected]> * remove old notebook Signed-off-by: ekmb <[email protected]> * fix typo Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [ASR] Audio processing base, multi-channel enhancement models (#5356) * Audio processing base model, enc-mask-dec enhancement, tests and modules Signed-off-by: Ante Jukić <[email protected]> * Addressed review comments Signed-off-by: Ante Jukić <[email protected]> * Fixed CodeQL warnings Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments: - renamed AudioProcessingModel to AudioToAudioModel - various small modifications - updated unit tests Signed-off-by: Ante Jukić <[email protected]> * Addressed comments - Moved spectrogram to audio_preprocessing - Renamed MultichannelFeatures - Updated config and unit tests Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Expose ClusteringDiarizer device (#5681) * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add Beam Search support to ASR transcribe() (#5443) * Add support for beam decoding via high level API. Signed-off-by: smajumdar <[email protected]> * Add ctc decoding section Signed-off-by: smajumdar <[email protected]> * Update ctc transcribe API to return results from beam search Signed-off-by: smajumdar <[email protected]> * Add argument to preserve arpa file Signed-off-by: smajumdar <[email protected]> * Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding Signed-off-by: smajumdar <[email protected]> * Update eval script and doc to use new API Signed-off-by: smajumdar <[email protected]> * Add tests for ctc greedy decoding Signed-off-by: smajumdar <[email protected]> * Address reviewer comments and add docstrings Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix changes and address comments Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: Samuel Kriman <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Propagate attention_dropout flag for GPT-3 (#5669) * Propagate attention_dropout flag for GPT-3 Signed-off-by: Mikołaj Błaż <[email protected]> * Add default to megatron_gpt_config Signed-off-by: Mikołaj Błaż <[email protected]> Signed-off-by: Mikołaj Błaż <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <complex451@gmail…

* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <[email protected]> * remove cmudict downloading Signed-off-by: Oktai Tatanov <[email protected]> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <[email protected]> * add libritts processing Signed-off-by: Oktai Tatanov <[email protected]> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <[email protected]> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <[email protected]> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <[email protected]> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <[email protected]> * fix optimizers Signed-off-by: Oktai Tatanov <[email protected]> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Jason <[email protected]> Signed-off-by: Jason <[email protected]> * make new commit Signed-off-by: Jason <[email protected]> * add copyright headers Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * rename README Signed-off-by: Oktai Tatanov <[email protected]> * fix style without vits_modules Signed-off-by: Oktai Tatanov <[email protected]> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <[email protected]> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <[email protected]> * update package info Signed-off-by: ericharper <[email protected]> * new exp * update branch Signed-off-by: ericharper <[email protected]> * Restored tests previously disabled for 22.03 base (NVIDIA#4109) Signed-off-by: Boris Fomitchev <[email protected]> * add augmentation to label models (NVIDIA#4113) * add augmentation to label models Signed-off-by: nithinraok <[email protected]> * duration fix Signed-off-by: nithinraok <[email protected]> * Call register_bert_model after assigning self.bert_model variable (NVIDIA#4116) Signed-off-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <[email protected]> * fixes for code review Signed-off-by: Alexandra Antonova <[email protected]> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> * update the default (NVIDIA#4135) Signed-off-by: ekmb <[email protected]> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <[email protected]> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Do not create tmp directory Signed-off-by: PeganovAnton <[email protected]> * Fix parameter name Signed-off-by: PeganovAnton <[email protected]> * finish cherry-pick op Signed-off-by: PeganovAnton <[email protected]> * Fix labels errors Signed-off-by: PeganovAnton <[email protected]> * Remove duplicate stage Signed-off-by: PeganovAnton <[email protected]> * Change target branch Signed-off-by: PeganovAnton <[email protected]> * fix typo (NVIDIA#4140) Signed-off-by: Yang Zhang <[email protected]> * Fix/punctuation avoid overwritting tmp files (NVIDIA#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Use built-in tempfile library Signed-off-by: PeganovAnton <[email protected]> * Fix code style Signed-off-by: PeganovAnton <[email protected]> * bug_fix_diarization_manifest_creation (NVIDIA#4125) Signed-off-by: Yang Zhang <[email protected]> Co-authored-by: Nithin Rao <[email protected]> * fix doc (NVIDIA#4146) Signed-off-by: Yang Zhang <[email protected]> * Tacotron2 retrain (NVIDIA#4103) * fix yaml Signed-off-by: treacker <[email protected]> * Fix for new TTSDataset class Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * inference fix Signed-off-by: treacker <[email protected]> * removed old code Signed-off-by: treacker <[email protected]> * updated parser logic Signed-off-by: treacker <[email protected]> * reverted version update Signed-off-by: treacker <[email protected]> * refactored parser logic Signed-off-by: treacker <[email protected]> * Updated Jenkinsfile Signed-off-by: treacker <[email protected]> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Update Jenkinsfile Signed-off-by: treacker <[email protected]> * Update tacotron.yaml Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: treacker <[email protected]> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Fixed jenkins Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Multiprocess improvements (NVIDIA#4127) * initial commit Signed-off-by: nithinraok <[email protected]> * start fix Signed-off-by: nithinraok <[email protected]> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <[email protected]> * updated scp to filelist Signed-off-by: nithinraok <[email protected]> * WaveGlow input type fixes (NVIDIA#4151) Signed-off-by: Jocelyn Huang <[email protected]> * notebooks' link, typo and import fix (NVIDIA#4158) * redo missing pr 4007 Signed-off-by: fayejf <[email protected]> * remove extremely unreliable links Signed-off-by: fayejf <[email protected]> * Thutmose tagger bug fixes (NVIDIA#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <[email protected]> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <[email protected]> * changed model location to nemo Signed-off-by: Alexandra Antonova <[email protected]> * style fix Signed-off-by: Alexandra Antonova <[email protected]> * fix space Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * update speaker docs (NVIDIA#4164) * update speaker docs Signed-off-by: nithinraok <[email protected]> * chunks -> segments Signed-off-by: nithinraok <[email protected]> * Khz -> kHz Signed-off-by: nithinraok <[email protected]> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <[email protected]> * Fix link to inference notebook (NVIDIA#5247) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Update ASR scores table (NVIDIA#5254) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Fix links to speaker identification notebook (NVIDIA#5260) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> * Minor typo fixes in TTS tutorial (NVIDIA#5266) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Pcla tutorial fixes (NVIDIA#5271) * Fixed typos Signed-off-by: Matvei Novikov <[email protected]> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <[email protected]> * Fixed typo Signed-off-by: Matvei Novikov <[email protected]> * Fixed branch variable Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix bug into Dialogue tutorial (NVIDIA#5277) * Typo fix (NVIDIA#5288) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix dialogue tutorial bug (NVIDIA#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA#5310) * typo fix Signed-off-by: fayejf <[email protected]> * udpate transcribe Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Add italian model checkpoints (NVIDIA#5316) Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Igor Gitman <[email protected]> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <[email protected]> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA#5313) * fixes Signed-off-by: Matvei Novikov <[email protected]> * fixes Signed-off-by: Matvei Novikov <[email protected]> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA#5359) * Add Gradio App to ASR Docs (NVIDIA#5270) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * tutorial fixes (NVIDIA#5354) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Add SDP documentation (NVIDIA#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <[email protected]> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <[email protected]> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <[email protected]> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <[email protected]> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <[email protected]> * Make config example less confusing Signed-off-by: Elena Rastorgueva <[email protected]> * Fix typo Signed-off-by: Elena Rastorgueva <[email protected]> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused imports Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <[email protected]> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Co-authored-by: Igor Gitman <[email protected]> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <[email protected]> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA#5379) * Fixed bug in notebook (NVIDIA#5382) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> * Force MHA QKV onto fp32 (NVIDIA#5391) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <[email protected]> * Added more saftey checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> * Megatron Export Update (NVIDIA#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * disable pc test (NVIDIA#5426) Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (NVIDIA#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA#5242) (NVIDIA#5446) Signed-off-by: Vahid <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> * Set sync_batch_comm in other places (NVIDIA#5448) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Radtts 1.13 (NVIDIA#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 plus (NVIDIA#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Add num layers check (NVIDIA#5470) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Change to kwargs (NVIDIA#5475) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) (NVIDIA#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * minor bug fix (NVIDIA#5521) Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> * added set_start_method + function param bugfix (NVIDIA#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (NVIDIA#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * Remove broadcast (NVIDIA#5558) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <[email protected]> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <[email protected]> * storing * updated version Signed-off-by: Evgeniy Shabalin <[email protected]> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed issues Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <[email protected]> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <[email protected]> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <[email protected]> * typecheck fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <[email protected]> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <[email protected]> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: nithinraok <[email protected]> Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Sasha Meister <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Evgeniy Shabalin <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Oktai Tatanov <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: bene-ges <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Jocelyn <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Sean Naren <[email protected]> Co-authored-by: Matvei Novikov <[email protected]> Co-authored-by: Zhilin Wang <[email protected]> Co-authored-by: Igor Gitman <[email protected]> Co-authored-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: Taejin Park <[email protected]> Co-authored-by: Adi Renduchintala <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Co-authored-by: Xuesong Yang <[email protected]>

* Merge r1.13.0 main (#5570) * update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Optimized loop and bugfix in SDE (#5573) - Fixed bug with loading custom data attributes from JSON in Speech Data Explorer Signed-off-by: George Zelenfroynd <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update torchmetrics (#5566) * add task arg Signed-off-by: nithinraok <[email protected]> * update state Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Co-authored-by: Taejin Park <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove useless files. (#5580) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add initial NFA code Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make use of the specified device during viterbi decoding Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL notes Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL warning Signed-off-by: Elena Rastorgueva <[email protected]> * Add an option to defer data setup from ``__init__`` to ``setup`` (#5569) * Add an option to defer dataloader setup from __init__ to setup Signed-off-by: Ante Jukić <[email protected]> * Updated doc Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make utt_id specified by number of parts of audio_filepath user wishes to use Signed-off-by: Elena Rastorgueva <[email protected]> * remove audio_sr TODO - reduce risk of silent bugs Signed-off-by: Elena Rastorgueva <[email protected]> * Add check that model is CTC Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Text generation improvement (UI client, data parallel support) (#5437) * Squashed commit of the following: commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 15:07:42 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 35b424044fe80c3081e7756ab21244f701716f7e Author: Yi Dong <[email protected]> Date: Thu Oct 13 08:04:49 2022 -0700 get rid of base Signed-off-by: Yi Dong <[email protected]> commit 2955210e2311791543538cfbb5ad26b79414c954 Merge: d52edef8c eaf6757ca Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:17:02 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit d52edef8cd7b36593838fb270047e80f8ccb652e Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:16:24 2022 +0000 align with main Signed-off-by: Yi Dong <[email protected]> commit eaf6757ca5be8e099492f57c81d984429b0ad49c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 13:12:11 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:10:58 2022 +0000 same as main Signed-off-by: Yi Dong <[email protected]> commit e335de51bcc0d681c58b568c3d8c238bc5687c3b Merge: c231086e0 4463a9fe9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:08:09 2022 +0000 Merge branch 'main' into universal_prompt commit c231086e057f1efaa915f691d84664cb3d5aad85 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 19:59:12 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:56:17 2022 +0000 default to pad Signed-off-by: Yi Dong <[email protected]> commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:55:44 2022 +0000 add the option to pad the tokens Signed-off-by: Yi Dong <[email protected]> commit 876dc395b43fdeeaa2bcbbe13c76523633764c33 Merge: fbb0f4035 fe3c77ee9 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:20:47 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392 Author: Yi Dong <[email protected]> Date: Wed Oct 12 18:59:49 2022 +0000 fix import again Signed-off-by: Yi Dong <[email protected]> commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 16:00:24 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937 Author: Yi Dong <[email protected]> Date: Wed Oct 12 15:58:32 2022 +0000 enable server Signed-off-by: Yi Dong <[email protected]> commit cbe05d9fbc978f812cfbb671f45f147f300713c4 Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:07:28 2022 +0000 fix comment error Signed-off-by: Yi Dong <[email protected]> commit 1948048922e726ec6131e44b1a745389f18d4ef2 Merge: 232c2cce3 984f5c09a Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:05:30 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 232c2cce34d7a8b902da406706f3dd9b39475091 Merge: 34c8a68df 658243fb6 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:50:00 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f Merge: 658243fb6 3fda5de46 Author: Yi Dong <[email protected]> Date: Wed Oct 12 08:42:11 2022 -0400 Merge branch 'main' into fix_global_init commit 658243fb6580191b5d60edd30cde16dcc23cbb85 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:40:57 2022 +0000 fix import error Signed-off-by: Yi Dong <[email protected]> commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c Author: Yi Dong <[email protected]> Date: Tue Oct 11 22:44:12 2022 +0000 update the fused kernel Signed-off-by: Yi Dong <[email protected]> commit 536cf6bef9447b75843fad630729c47a2fba35f3 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:44:52 2022 -0700 add the missing file Signed-off-by: Yi Dong <[email protected]> commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:43:14 2022 -0700 fix fused softmax Signed-off-by: Yi Dong <[email protected]> commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:16:48 2022 -0700 move global step to base Signed-off-by: Yi Dong <[email protected]> commit 34c8a68df084b18d377e84415d9f07b2cd6673dd Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:50:11 2022 +0000 fix pipeline for eval Signed-off-by: Yi Dong <[email protected]> commit eee5d38218f26660c3ffebe9f615c850c80a1f0d Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:48:22 2022 +0000 fix for pipleline parallel Signed-off-by: Yi Dong <[email protected]> commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d Merge: 125e49947 e3b4c4d1f Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:13 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 125e4994760448ff75dd9328395813eda1c87547 Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:04 2022 +0000 add share option Signed-off-by: Yi Dong <[email protected]> commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368 Author: Yi Dong <[email protected]> Date: Wed Oct 5 11:43:48 2022 -0700 make sure consolidation works Signed-off-by: Yi Dong <[email protected]> commit a5c833964ecf05dc460ca1da69275c4019742150 Merge: 2a07ab52d abcb74be2 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:29 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 2a07ab52d95f15ba666823028c69e23825666c05 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:23 2022 +0000 added requirement Signed-off-by: Yi Dong <[email protected]> commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:39:42 2022 +0000 added a simple web server Signed-off-by: Yi Dong <[email protected]> commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b Author: Yi Dong <[email protected]> Date: Wed Oct 5 06:54:12 2022 -0700 fix empty val loss Signed-off-by: Yi Dong <[email protected]> commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26 Author: Yi Dong <[email protected]> Date: Tue Oct 4 19:25:30 2022 -0700 text gen working Signed-off-by: Yi Dong <[email protected]> commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba Author: Yi Dong <[email protected]> Date: Tue Oct 4 16:08:40 2022 -0700 first change Signed-off-by: Yi Dong <[email protected]> commit 59d077585e6962a669b824af58f64e8a0bea6547 Author: Yi Dong <[email protected]> Date: Tue Oct 4 15:00:40 2022 -0700 revert Signed-off-by: Yi Dong <[email protected]> commit 12a0f3902d99e9179403644bd951c045df716ca7 Author: Yi Dong <[email protected]> Date: Tue Oct 4 21:26:23 2022 +0000 init imp Signed-off-by: Yi Dong <[email protected]> commit 62a15dfd943cc48be495ac61b9f2f00995775c5f Merge: 82c90d2cd e0cc6b767 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:58:26 2022 -0700 Merge branch 'main' into universal_prompt commit 82c90d2cd0fd156f16a4b899f8c741d598f33990 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:17:13 2022 -0700 add sync Signed-off-by: Yi Dong <[email protected]> commit 9819b703eef877d90cd1257bf3610c69de9b4d7e Author: Yi Dong <[email protected]> Date: Sun Oct 2 17:52:34 2022 -0700 fix save model Signed-off-by: root <[email protected]> commit e4937e2fc5fb7d70754c97668416e4a69c3079fe Author: Yi Dong <[email protected]> Date: Sat Oct 1 18:56:09 2022 +0000 working Signed-off-by: Yi Dong <[email protected]> commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b Author: Yi Dong <[email protected]> Date: Sat Oct 1 17:34:03 2022 +0000 calcuate the mask Signed-off-by: Yi Dong <[email protected]> commit 9db3bc13eb65a94a475b837603351da68e3745bc Author: Yi Dong <[email protected]> Date: Fri Sep 30 23:26:32 2022 +0000 fix bug in datasets Signed-off-by: Yi Dong <[email protected]> commit f289900375d4412f53f8110be00fec6587627550 Author: Yi Dong <[email protected]> Date: Fri Sep 30 22:29:40 2022 +0000 update the code Signed-off-by: Yi Dong <[email protected]> commit 8e28a1f208aabaab72dbe769e72756baada04d99 Author: Yi Dong <[email protected]> Date: Fri Sep 30 21:52:52 2022 +0000 added new ds Signed-off-by: Yi Dong <[email protected]> commit 8d41315bab7ce90e200a8a7d1023c34f8e046897 Author: Yi Dong <[email protected]> Date: Fri Sep 30 18:57:09 2022 +0000 added new files Signed-off-by: Yi Dong <[email protected]> commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f Merge: cbe8b7ab1 fa6cd8588 Author: Yi Dong <[email protected]> Date: Thu Sep 29 21:43:29 2022 +0000 Merge branch 'llm-prompt-learning-improvements' into universal_prompt commit fa6cd858839277939446afe7275976078d54c512 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Sep 29 16:47:30 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0 Merge: 7d6d46742 8d670bc77 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 09:43:27 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 7d6d46742170a66758287a207d67e1b1bfd15613 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 16:42:43 2022 +0000 Removed inference step and added sentence peice check to predict step Signed-off-by: Virginia Adams <[email protected]> commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 15:26:32 2022 +0000 fixed first stage check for pipeline parallel T5 pt Signed-off-by: Virginia Adams <[email protected]> commit 3637be2b258c8d9028856f9971edb7da4a8121f0 Merge: a3ea722fd 986a76612 Author: Virginia Adams <[email protected]> Date: Wed Sep 28 10:23:30 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4 Merge: 770967a52 971485ce7 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 13:35:52 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 770967a5251a474b6dcc2d44bf9a2076adbcb604 Merge: d23bf6c30 e3ac280a8 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:17:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit d23bf6c30acc0e3f6af9b4e24547669866a34d62 Merge: de6a31651 333d2b749 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:05:16 2022 -0700 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit de6a31651e63d88a42b971794d93f18ff5a3cdff Author: Virginia Adams <[email protected]> Date: Mon Sep 26 17:00:53 2022 +0000 Updated PP check to be on first stage pipeline only Signed-off-by: Virginia Adams <[email protected]> commit 333d2b7498e6742ce66436f733c980a74616900c Merge: 592c0986a a39fc925a Author: Virginia Adams <[email protected]> Date: Fri Sep 23 16:11:21 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 592c0986a476a91b57b8605d7b70830d7acfa021 Author: Virginia Adams <[email protected]> Date: Fri Sep 23 23:08:41 2022 +0000 Fixed unused import and CI test bug Signed-off-by: Virginia Adams <[email protected]> commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455 Merge: ce4b72c8c b566c2d0e Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:25 2022 +0000 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit ce4b72c8c52f32be336e323dd78a38089edc3e7c Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:16 2022 +0000 Switch to import from base class Signed-off-by: Virginia Adams <[email protected]> commit b566c2d0e35a068f758fd1310bc620a47be4590b Merge: 6621f2854 e872061ac Author: Virginia Adams <[email protected]> Date: Fri Sep 23 10:09:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 6621f28543828a48484a5637f6c9f3ccb23a5b02 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:47:35 2022 +0000 python format fix Signed-off-by: Virginia Adams <[email protected]> commit 8deafc8987b6af5f7b99a250310f57a40198c37f Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:28:02 2022 +0000 Save .nemo on new best val score Signed-off-by: Virginia Adams <[email protected]> commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 18:03:19 2022 +0000 Added automatic checkpoint to nemo file method Signed-off-by: Virginia Adams <[email protected]> commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b Author: Virginia Adams <[email protected]> Date: Wed Sep 14 02:11:56 2022 +0000 Make GPT use base prompt learning model class: Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yi Dong <[email protected]> * fix LGTM Signed-off-by: Yi Dong <[email protected]> * fix validation Signed-off-by: Yi Dong <[email protected]> * change for the lm eval Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * make text generation work in data parallel environment Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * implement the service with rest service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * surpress log Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Update config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore function needed for NMT Signed-off-by: MaximumEntropy <[email protected]> * handles no answer only Signed-off-by: Yi Dong <[email protected]> * Fix config Signed-off-by: MaximumEntropy <[email protected]> * added knn to web Signed-off-by: Yi Dong <[email protected]> * fix lgtm.com comments Signed-off-by: Yi Dong <[email protected]> * output the retrieved context Signed-off-by: Yi Dong <[email protected]> * allow no neighbor query Signed-off-by: Yi Dong <[email protected]> * remove the imports Signed-off-by: Yi Dong <[email protected]> * warn only once Signed-off-by: Yi Dong <[email protected]> * Change output file format from JSON to JSONL Signed-off-by: MaximumEntropy <[email protected]> * new t0 dataset Signed-off-by: Yi Dong <[email protected]> * Add T0 data preproc scripts Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Merge and multiprocessing Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for is_correct Signed-off-by: MaximumEntropy <[email protected]> * fix epoch > 2 Signed-off-by: Yi Dong <[email protected]> * handles multiple dataloader Signed-off-by: Yi Dong <[email protected]> * remove template Signed-off-by: Yi Dong <[email protected]> * Refactor T0 dataset Signed-off-by: MaximumEntropy <[email protected]> * Add script to merge train folder into individual training files to minimize number of blends Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added on the fly service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add combo instance Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added combo service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * send weights back to server Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix index store Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add reset button Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add add eos Signed-off-by: Yi Dong <[email protected]> * use a seperate bert service Signed-off-by: Yi Dong <[email protected]> * no loss of accuracy Signed-off-by: Yi Dong <[email protected]> * pin the gradio version Signed-off-by: Yi Dong <[email protected]> * Remove bin compat Signed-off-by: MaximumEntropy <[email protected]> * Fix header lines Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * evaluate based on text generation Signed-off-by: Yi Dong <[email protected]> * exact match result aggregation Signed-off-by: Yi Dong <[email protected]> * working SP and SA Signed-off-by: Yi Dong <[email protected]> * sync Signed-off-by: Yi Dong <[email protected]> * fix checkpoint Signed-off-by: Yi Dong <[email protected]> * fix eval Signed-off-by: Yi Dong <[email protected]> * backup states Signed-off-by: Yi Dong <[email protected]> * backup states reset Signed-off-by: Yi Dong <[email protected]> * fix the bug Signed-off-by: Yi Dong <[email protected]> * fix evaluation for sentence piece Signed-off-by: Yi Dong <[email protected]> * fix a bug Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * potential fix in the future Signed-off-by: Yi Dong <[email protected]> * remove the universal codes Signed-off-by: Yi Dong <[email protected]> * remove universal strategy Signed-off-by: Yi Dong <[email protected]> * address reviewer comment Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add align function docstrings and make most args optional Signed-off-by: Elena Rastorgueva <[email protected]> * Remove redundant returns of viterbi and log probs matrices Signed-off-by: Elena Rastorgueva <[email protected]> * Rename h# to <initial_silence> Signed-off-by: Elena Rastorgueva <[email protected]> * Update manifest format description in README Signed-off-by: Elena Rastorgueva <[email protected]> * always remove any spaces from utt_id Signed-off-by: Elena Rastorgueva <[email protected]> * Patch the hanging of threads on very large stderr (#5589) (#5590) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * O2 style amp for gpt3 ptuning (#5246) * enable amp o2 plugin Signed-off-by: Jimmy Zhang <[email protected]> * only create master param if param requires gradient Signed-off-by: Jimmy Zhang <[email protected]> * remove pytorch autocast Signed-off-by: Jimmy Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Jimmy Zhang <[email protected]> * Update optimizer_with_main_params.py Signed-off-by: JimmyZhang12 <[email protected]> * create master grad only if param group requires grad Signed-off-by: Jimmy Zhang <[email protected]> * fix grad scaler for pp > 1 Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: JimmyZhang12 <[email protected]> Co-authored-by: Jimmy Zhang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Better patch hydra (#5591) (#5592) * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Yet another fix with hydra multirun (#5594) (#5595) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add RETRO model documentation (#5578) * added retro doc Signed-off-by: Yi Dong <[email protected]> * finish data part Signed-off-by: Yi Dong <[email protected]> * added the data format Signed-off-by: Yi Dong <[email protected]> * added training script Signed-off-by: Yi Dong <[email protected]> * added training and evaluation steps Signed-off-by: Yi Dong <[email protected]> * edit the text Signed-off-by: Yi Dong <[email protected]> * added the images Signed-off-by: Yi Dong <[email protected]> * fix beginning Signed-off-by: Yi Dong <[email protected]> * fix the grammar Signed-off-by: Yi Dong <[email protected]> * trim it down Signed-off-by: Yi Dong <[email protected]> * add wandb option Signed-off-by: Yi Dong <[email protected]> * add reference Signed-off-by: Yi Dong <[email protected]> * fix path Signed-off-by: Yi Dong <[email protected]> * added the parameters table Signed-off-by: Yi Dong <[email protected]> * fix section Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix: setup_multiple validation/test data (#5585) Fix: setup_multiple validation/test data (#5585) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move to optimizer based EMA implementation (#5169) * Move to optimizer Signed-off-by: SeanNaren <[email protected]> * Fix replacing weights Signed-off-by: SeanNaren <[email protected]> * Allow swapping of weights be optional Signed-off-by: SeanNaren <[email protected]> * Save 2 models Signed-off-by: SeanNaren <[email protected]> * Use different hook Signed-off-by: SeanNaren <[email protected]> * Expose cpu device Signed-off-by: SeanNaren <[email protected]> * Add clause to see if this fixes issue with O2 optimizer Signed-off-by: SeanNaren <[email protected]> * Try to get O2 working Signed-off-by: SeanNaren <[email protected]> * WIP Signed-off-by: SeanNaren <[email protected]> * Fixes Signed-off-by: SeanNaren <[email protected]> * Fixes to tests Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Remove import Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Add comment Signed-off-by: SeanNaren <[email protected]> * Remove overwrite Signed-off-by: SeanNaren <[email protected]> * Add BatchNorm, currently tests fail Signed-off-by: SeanNaren <[email protected]> * Fix tests/functionality for batch norm Signed-off-by: SeanNaren <[email protected]> * Get rid of NLP changes Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * AIStore for ASR datasets (#5462) AIStore for ASR datasets Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add support for MHA adapters to ASR (#5396) * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * Temporary fixes to new signature of mixin Signed-off-by: smajumdar <[email protected]> * Add adapter util for constants, add all mha adapters. Signed-off-by: smajumdar <[email protected]> * Update name of function Signed-off-by: smajumdar <[email protected]> * Roll back changes to convASR Signed-off-by: smajumdar <[email protected]> * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * First draft of Conformer support for MHA attention Signed-off-by: smajumdar <[email protected]> * Add some preliminary tests Signed-off-by: smajumdar <[email protected]> * Add support for projection of the hidden dimension for attention Signed-off-by: smajumdar <[email protected]> * Add support for squeezeformer Signed-off-by: smajumdar <[email protected]> * Update train adapter config Signed-off-by: smajumdar <[email protected]> * Add tests for squeezeformer and unit tests for new modules Signed-off-by: smajumdar <[email protected]> * Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path Signed-off-by: smajumdar <[email protected]> * Update location of adapters Signed-off-by: smajumdar <[email protected]> * Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v Signed-off-by: smajumdar <[email protected]> * Update expmanager to clean up checkpoints Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update training scripts Signed-off-by: smajumdar <[email protected]> * Update config and docs Signed-off-by: smajumdar <[email protected]> * Expose nemo delete function Signed-off-by: smajumdar <[email protected]> * Correct adapter partial state saving Signed-off-by: smajumdar <[email protected]> * Correct a bug with state management of adapter tokens Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Pull down EMA test Signed-off-by: smajumdar <[email protected]> * Correct name of adapter module utility class Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Co-authored-by: Jocelyn <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Create separator parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Call align function with hydra config Signed-off-by: Elena Rastorgueva <[email protected]> * update usage example Signed-off-by: Elena Rastorgueva <[email protected]> * Update Dockerfile (#5614) (#5616) Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future. Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate pretrained_name and model_path parameters Signed-off-by: Elena Rastorgueva <[email protected]> * make "optional" tags bold in markdown Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move non-main functions to utils dir Signed-off-by: Elena Rastorgueva <[email protected]> * Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615) Signed-off-by: Ante Jukić <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] fix ranges of char set for accented letters. (#5607) * [TTS] fix ranges of char set for accented letters. * remove digits pattern and added unit tests for math operators. Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Change success message to reduce confusion (#5621) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update documentation and tutorials for Adapters (#5610) * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update test Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Rename test file Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602) * [TTS] add type hints and change variable names for tokenizers and g2p Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * 1. Added missing import for gather_objects. (#5627) Signed-off-by: Micha Livne <[email protected]> Signed-off-by: Micha Livne <[email protected]> Co-authored-by: Micha Livne <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fixed RadTTS unit test (#5572) Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove tests (#5633) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Conformer local attention (#5525) * local attn and merge Signed-off-by: sam1373 <[email protected]> * optional Signed-off-by: sam1373 <[email protected]> * override Signed-off-by: sam1373 <[email protected]> * incorporate comments Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> * fix Signed-off-by: sam1373 <[email protected]> * comment Signed-off-by: sam1373 <[email protected]> * changes, test Signed-off-by: sam1373 <[email protected]> * changes Signed-off-by: sam1373 <[email protected]> * check att context Signed-off-by: sam1373 <[email protected]> * readme link Signed-off-by: sam1373 <[email protected]> * utils Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> Signed-off-by: sam1373 <[email protected]> Signed-off-by: Samuel Kriman <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add core classes and functions for online clustering diarizer part 1 (#5526) * Add core classes and functions for online clustering diarizer Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add audio to labels code Signed-off-by: Taejin Park <[email protected]> * resolve type errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added unit=tests for very short audio Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Filled all missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved conflict and added missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed unit-test errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix the wrongly added file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * Fix wrongly included file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * resolve code quality issue Signed-off-by: Taejin Park <[email protected]> * Fixed unit-test errors and bugs Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * changed total_sec for offline_clustering toy_data in unit-tests Signed-off-by: Taejin Park <[email protected]> * fixed merging index offset bug Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * only including part 1 files Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed unused function Signed-off-by: Taejin Park <[email protected]> * fixed unused imports Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * divided nmesc_clustering.py into two and reflected first-pass comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * adding offline/online_clustering.py Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix code QL autocomment Signed-off-by: Taejin Park <[email protected]> * Removed unused imports Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/online_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Reflected comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved code scanning issue Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/offline_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641) * add stt_eo_conformer_ctc_large model * stt_eo_conformer_transducer_large Co-authored-by: Andrei Andrusenko <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Removed unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Specify that filepaths need to be absolute Signed-off-by: Elena Rastorgueva <[email protected]> * replaces any spaces in utt_id with dashes Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make hydra script callable by another script Signed-off-by: Elena Rastorgueva <[email protected]> * do not specify default model or model_downsample_factor Signed-off-by: Elena Rastorgueva <[email protected]> * [Dockerfile] Remove AIS archive from docker image (#5629) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Measure audio_sr from audio instead of needing to specify Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541) * Chinese TTS replaces default pypinyin dict * Add jieba word segmenter as an option Signed-off-by: Yuekai Zhang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate parameters for device of transcription and viterbi steps Signed-off-by: Elena Rastorgueva <[email protected]> * Add mention of gecko Signed-off-by: Elena Rastorgueva <[email protected]> * [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [Add] ASR+VAD Inference Pipeline (#5575) Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets. Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: fayejf <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * rename separator to ctm_grouping_separator and refactor Signed-off-by: Elena Rastorgueva <[email protected]> * Bert interleaved (#5556) * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * Update bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Adding tests * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * Fix bug fix sequence parallel and Interleaved * Fix bug fix sequence parallel and Interleaved Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add duration padding support for RADTTS inference (#5650) * Added duration padding support for RADTTS inference * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Kevin Shih <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Add remove_blank_tokens_from_ctm parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Dont save initial_silence line in CTM Signed-off-by: Elena Rastorgueva <[email protected]> * Add DLLogger support to exp_manager (#5658) * Add DLLogger support to exp_manager Signed-off-by: Alexandre Milesi <[email protected]> * Move dllogger to separate file and check import Signed-off-by: Alexandre Milesi <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Remove unused import Signed-off-by: Alexandre Milesi <[email protected]> Signed-off-by: Alexandre Milesi <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add minimum_timestamp_duration parameter Signed-off-by: Elena Rastorgueva <[email protected]> * add suggestion about removing blanks to README Signed-off-by: Elena Rastorgueva <[email protected]> * reorder args Signed-off-by: Elena Rastorgueva <[email protected]> * clarify description of ctm_grouping_separator in README Signed-off-by: Elena Rastorgueva <[email protected]> * update docstring Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Port stateless timer to exp manager (#5584) * Port stateless timer to exp manager Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes and remove from all megatron code Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change message Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix EMA restart by allowing device to be set by the class init (#5668) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP (moved to separate repo) - merge to main (#5630) * Remove sdp files from tools folder Signed-off-by: Elena Rastorgueva <[email protected]> * Add page to docs with new SDP location Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add interface for making amax reduction optional for FP8 (#5447) * add TE interface for making amax reduction optional Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Kirthi Shankar Sivamani <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add tts dict cust notebook (#5662) * add tts dict cust notebook Signed-off-by: ekmb <[email protected]> * review Signed-off-by: ekmb <[email protected]> * fixed audio links Signed-off-by: ekmb <[email protected]> * remove old notebook Signed-off-by: ekmb <[email protected]> * fix typo Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [ASR] Audio processing base, multi-channel enhancement models (#5356) * Audio processing base model, enc-mask-dec enhancement, tests and modules Signed-off-by: Ante Jukić <[email protected]> * Addressed review comments Signed-off-by: Ante Jukić <[email protected]> * Fixed CodeQL warnings Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments: - renamed AudioProcessingModel to AudioToAudioModel - various small modifications - updated unit tests Signed-off-by: Ante Jukić <[email protected]> * Addressed comments - Moved spectrogram to audio_preprocessing - Renamed MultichannelFeatures - Updated config and unit tests Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Expose ClusteringDiarizer device (#5681) * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add Beam Search support to ASR transcribe() (#5443) * Add support for beam decoding via high level API. Signed-off-by: smajumdar <[email protected]> * Add ctc decoding section Signed-off-by: smajumdar <[email protected]> * Update ctc transcribe API to return results from beam search Signed-off-by: smajumdar <[email protected]> * Add argument to preserve arpa file Signed-off-by: smajumdar <[email protected]> * Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding Signed-off-by: smajumdar <[email protected]> * Update eval script and doc to use new API Signed-off-by: smajumdar <[email protected]> * Add tests for ctc greedy decoding Signed-off-by: smajumdar <[email protected]> * Address reviewer comments and add docstrings Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix changes and address comments Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: Samuel Kriman <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Propagate attention_dropout flag for GPT-3 (#5669) * Propagate attention_dropout flag for GPT-3 Signed-off-by: Mikołaj Błaż <[email protected]> * Add default to megatron_gpt_config Signed-off-by: Mikołaj Błaż <[email protected]> Signed-off-by: Mikołaj Błaż <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <complex451@gmail…

* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <[email protected]> * remove cmudict downloading Signed-off-by: Oktai Tatanov <[email protected]> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <[email protected]> * add libritts processing Signed-off-by: Oktai Tatanov <[email protected]> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <[email protected]> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <[email protected]> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <[email protected]> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <[email protected]> * fix optimizers Signed-off-by: Oktai Tatanov <[email protected]> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Jason <[email protected]> Signed-off-by: Jason <[email protected]> * make new commit Signed-off-by: Jason <[email protected]> * add copyright headers Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * rename README Signed-off-by: Oktai Tatanov <[email protected]> * fix style without vits_modules Signed-off-by: Oktai Tatanov <[email protected]> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <[email protected]> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <[email protected]> * update package info Signed-off-by: ericharper <[email protected]> * new exp * update branch Signed-off-by: ericharper <[email protected]> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <[email protected]> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <[email protected]> * duration fix Signed-off-by: nithinraok <[email protected]> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <[email protected]> * fixes for code review Signed-off-by: Alexandra Antonova <[email protected]> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> * update the default (#4135) Signed-off-by: ekmb <[email protected]> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <[email protected]> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Do not create tmp directory Signed-off-by: PeganovAnton <[email protected]> * Fix parameter name Signed-off-by: PeganovAnton <[email protected]> * finish cherry-pick op Signed-off-by: PeganovAnton <[email protected]> * Fix labels errors Signed-off-by: PeganovAnton <[email protected]> * Remove duplicate stage Signed-off-by: PeganovAnton <[email protected]> * Change target branch Signed-off-by: PeganovAnton <[email protected]> * fix typo (#4140) Signed-off-by: Yang Zhang <[email protected]> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Use built-in tempfile library Signed-off-by: PeganovAnton <[email protected]> * Fix code style Signed-off-by: PeganovAnton <[email protected]> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <[email protected]> Co-authored-by: Nithin Rao <[email protected]> * fix doc (#4146) Signed-off-by: Yang Zhang <[email protected]> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <[email protected]> * Fix for new TTSDataset class Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * inference fix Signed-off-by: treacker <[email protected]> * removed old code Signed-off-by: treacker <[email protected]> * updated parser logic Signed-off-by: treacker <[email protected]> * reverted version update Signed-off-by: treacker <[email protected]> * refactored parser logic Signed-off-by: treacker <[email protected]> * Updated Jenkinsfile Signed-off-by: treacker <[email protected]> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Update Jenkinsfile Signed-off-by: treacker <[email protected]> * Update tacotron.yaml Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: treacker <[email protected]> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Fixed jenkins Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <[email protected]> * start fix Signed-off-by: nithinraok <[email protected]> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <[email protected]> * updated scp to filelist Signed-off-by: nithinraok <[email protected]> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <[email protected]> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <[email protected]> * remove extremely unreliable links Signed-off-by: fayejf <[email protected]> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <[email protected]> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <[email protected]> * changed model location to nemo Signed-off-by: Alexandra Antonova <[email protected]> * style fix Signed-off-by: Alexandra Antonova <[email protected]> * fix space Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <[email protected]> * chunks -> segments Signed-off-by: nithinraok <[email protected]> * Khz -> kHz Signed-off-by: nithinraok <[email protected]> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <[email protected]> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Update ASR scores table (#5254) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <[email protected]> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <[email protected]> * Fixed typo Signed-off-by: Matvei Novikov <[email protected]> * Fixed branch variable Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <[email protected]> * udpate transcribe Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Igor Gitman <[email protected]> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <[email protected]> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <[email protected]> * fixes Signed-off-by: Matvei Novikov <[email protected]> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <[email protected]> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <[email protected]> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <[email protected]> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <[email protected]> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <[email protected]> * Make config example less confusing Signed-off-by: Elena Rastorgueva <[email protected]> * Fix typo Signed-off-by: Elena Rastorgueva <[email protected]> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused imports Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <[email protected]> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Co-authored-by: Igor Gitman <[email protected]> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <[email protected]> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <[email protected]> * Added more saftey checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * disable pc test (#5426) Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <[email protected]> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <[email protected]> * storing * updated version Signed-off-by: Evgeniy Shabalin <[email protected]> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed issues Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <[email protected]> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <[email protected]> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <[email protected]> * typecheck fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <[email protected]> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <[email protected]> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: nithinraok <[email protected]> Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Sasha Meister <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Evgeniy Shabalin <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Oktai Tatanov <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: bene-ges <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Jocelyn <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Sean Naren <[email protected]> Co-authored-by: Matvei Novikov <[email protected]> Co-authored-by: Zhilin Wang <[email protected]> Co-authored-by: Igor Gitman <[email protected]> Co-authored-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: Taejin Park <[email protected]> Co-authored-by: Adi Renduchintala <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Co-authored-by: Xuesong Yang <[email protected]>

* Flashlight Decoder for Nemo (#5790) * Added initial flashlight decoding files Signed-off-by: Daniel Egert <[email protected]> * Fixed some minor bugs Signed-off-by: Daniel Egert <[email protected]> * Added fixes from auto style thingamajig Signed-off-by: Daniel Egert <[email protected]> * Ran pre-commit and fixed script file formatting Signed-off-by: Daniel Egert <[email protected]> * Minor fixes brought up during PR review Signed-off-by: Daniel Egert <[email protected]> * Added Windows-compatible code to eval_beamsearch_ngram.py Signed-off-by: Daniel Egert <[email protected]> * Added initial flashlight decoding files Signed-off-by: Daniel Egert <[email protected]> * Fixed some minor bugs Signed-off-by: Daniel Egert <[email protected]> * Added fixes from auto style thingamajig Signed-off-by: Daniel Egert <[email protected]> * Ran pre-commit and fixed script file formatting Signed-off-by: Daniel Egert <[email protected]> * Minor fixes brought up during PR review Signed-off-by: Daniel Egert <[email protected]> * Added Windows-compatible code to eval_beamsearch_ngram.py Signed-off-by: Daniel Egert <[email protected]> Signed-off-by: Daniel Egert <[email protected]> Co-authored-by: Daniel Egert <[email protected]> * update torchmetrics to latest version (#5801) (#5802) * update torchmetrics to latest version Signed-off-by: nithinraok <[email protected]> * pin min version for metrics Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Co-authored-by: Nithin Rao <[email protected]> * Update radtts' infer path (#5788) * Update radtts' infer path Signed-off-by: Jason <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update to newer code Signed-off-by: Jason <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add import Signed-off-by: Jason <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * move code to model Signed-off-by: Jason <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add input and output types property Signed-off-by: Jason <[email protected]> * remove comment Signed-off-by: Jason <[email protected]> * update volume Signed-off-by: Jason <[email protected]> * align volume Signed-off-by: Jason <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Jason <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * add constraint info on batch size for tar dataset (#5812) * add constraint info on batch size for tar dataset Signed-off-by: Yang Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * style fix Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: Yang Zhang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * NeMo Forced Aligner (#5571) * Merge r1.13.0 main (#5570) * update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Optimized loop and bugfix in SDE (#5573) - Fixed bug with loading custom data attributes from JSON in Speech Data Explorer Signed-off-by: George Zelenfroynd <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update torchmetrics (#5566) * add task arg Signed-off-by: nithinraok <[email protected]> * update state Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Co-authored-by: Taejin Park <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove useless files. (#5580) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add initial NFA code Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make use of the specified device during viterbi decoding Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL notes Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL warning Signed-off-by: Elena Rastorgueva <[email protected]> * Add an option to defer data setup from ``__init__`` to ``setup`` (#5569) * Add an option to defer dataloader setup from __init__ to setup Signed-off-by: Ante Jukić <[email protected]> * Updated doc Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make utt_id specified by number of parts of audio_filepath user wishes to use Signed-off-by: Elena Rastorgueva <[email protected]> * remove audio_sr TODO - reduce risk of silent bugs Signed-off-by: Elena Rastorgueva <[email protected]> * Add check that model is CTC Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Text generation improvement (UI client, data parallel support) (#5437) * Squashed commit of the following: commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 15:07:42 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 35b424044fe80c3081e7756ab21244f701716f7e Author: Yi Dong <[email protected]> Date: Thu Oct 13 08:04:49 2022 -0700 get rid of base Signed-off-by: Yi Dong <[email protected]> commit 2955210e2311791543538cfbb5ad26b79414c954 Merge: d52edef8c eaf6757ca Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:17:02 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit d52edef8cd7b36593838fb270047e80f8ccb652e Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:16:24 2022 +0000 align with main Signed-off-by: Yi Dong <[email protected]> commit eaf6757ca5be8e099492f57c81d984429b0ad49c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 13:12:11 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:10:58 2022 +0000 same as main Signed-off-by: Yi Dong <[email protected]> commit e335de51bcc0d681c58b568c3d8c238bc5687c3b Merge: c231086e0 4463a9fe9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:08:09 2022 +0000 Merge branch 'main' into universal_prompt commit c231086e057f1efaa915f691d84664cb3d5aad85 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 19:59:12 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:56:17 2022 +0000 default to pad Signed-off-by: Yi Dong <[email protected]> commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:55:44 2022 +0000 add the option to pad the tokens Signed-off-by: Yi Dong <[email protected]> commit 876dc395b43fdeeaa2bcbbe13c76523633764c33 Merge: fbb0f4035 fe3c77ee9 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:20:47 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392 Author: Yi Dong <[email protected]> Date: Wed Oct 12 18:59:49 2022 +0000 fix import again Signed-off-by: Yi Dong <[email protected]> commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 16:00:24 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937 Author: Yi Dong <[email protected]> Date: Wed Oct 12 15:58:32 2022 +0000 enable server Signed-off-by: Yi Dong <[email protected]> commit cbe05d9fbc978f812cfbb671f45f147f300713c4 Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:07:28 2022 +0000 fix comment error Signed-off-by: Yi Dong <[email protected]> commit 1948048922e726ec6131e44b1a745389f18d4ef2 Merge: 232c2cce3 984f5c09a Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:05:30 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 232c2cce34d7a8b902da406706f3dd9b39475091 Merge: 34c8a68df 658243fb6 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:50:00 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f Merge: 658243fb6 3fda5de46 Author: Yi Dong <[email protected]> Date: Wed Oct 12 08:42:11 2022 -0400 Merge branch 'main' into fix_global_init commit 658243fb6580191b5d60edd30cde16dcc23cbb85 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:40:57 2022 +0000 fix import error Signed-off-by: Yi Dong <[email protected]> commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c Author: Yi Dong <[email protected]> Date: Tue Oct 11 22:44:12 2022 +0000 update the fused kernel Signed-off-by: Yi Dong <[email protected]> commit 536cf6bef9447b75843fad630729c47a2fba35f3 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:44:52 2022 -0700 add the missing file Signed-off-by: Yi Dong <[email protected]> commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:43:14 2022 -0700 fix fused softmax Signed-off-by: Yi Dong <[email protected]> commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:16:48 2022 -0700 move global step to base Signed-off-by: Yi Dong <[email protected]> commit 34c8a68df084b18d377e84415d9f07b2cd6673dd Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:50:11 2022 +0000 fix pipeline for eval Signed-off-by: Yi Dong <[email protected]> commit eee5d38218f26660c3ffebe9f615c850c80a1f0d Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:48:22 2022 +0000 fix for pipleline parallel Signed-off-by: Yi Dong <[email protected]> commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d Merge: 125e49947 e3b4c4d1f Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:13 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 125e4994760448ff75dd9328395813eda1c87547 Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:04 2022 +0000 add share option Signed-off-by: Yi Dong <[email protected]> commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368 Author: Yi Dong <[email protected]> Date: Wed Oct 5 11:43:48 2022 -0700 make sure consolidation works Signed-off-by: Yi Dong <[email protected]> commit a5c833964ecf05dc460ca1da69275c4019742150 Merge: 2a07ab52d abcb74be2 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:29 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 2a07ab52d95f15ba666823028c69e23825666c05 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:23 2022 +0000 added requirement Signed-off-by: Yi Dong <[email protected]> commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:39:42 2022 +0000 added a simple web server Signed-off-by: Yi Dong <[email protected]> commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b Author: Yi Dong <[email protected]> Date: Wed Oct 5 06:54:12 2022 -0700 fix empty val loss Signed-off-by: Yi Dong <[email protected]> commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26 Author: Yi Dong <[email protected]> Date: Tue Oct 4 19:25:30 2022 -0700 text gen working Signed-off-by: Yi Dong <[email protected]> commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba Author: Yi Dong <[email protected]> Date: Tue Oct 4 16:08:40 2022 -0700 first change Signed-off-by: Yi Dong <[email protected]> commit 59d077585e6962a669b824af58f64e8a0bea6547 Author: Yi Dong <[email protected]> Date: Tue Oct 4 15:00:40 2022 -0700 revert Signed-off-by: Yi Dong <[email protected]> commit 12a0f3902d99e9179403644bd951c045df716ca7 Author: Yi Dong <[email protected]> Date: Tue Oct 4 21:26:23 2022 +0000 init imp Signed-off-by: Yi Dong <[email protected]> commit 62a15dfd943cc48be495ac61b9f2f00995775c5f Merge: 82c90d2cd e0cc6b767 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:58:26 2022 -0700 Merge branch 'main' into universal_prompt commit 82c90d2cd0fd156f16a4b899f8c741d598f33990 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:17:13 2022 -0700 add sync Signed-off-by: Yi Dong <[email protected]> commit 9819b703eef877d90cd1257bf3610c69de9b4d7e Author: Yi Dong <[email protected]> Date: Sun Oct 2 17:52:34 2022 -0700 fix save model Signed-off-by: root <[email protected]> commit e4937e2fc5fb7d70754c97668416e4a69c3079fe Author: Yi Dong <[email protected]> Date: Sat Oct 1 18:56:09 2022 +0000 working Signed-off-by: Yi Dong <[email protected]> commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b Author: Yi Dong <[email protected]> Date: Sat Oct 1 17:34:03 2022 +0000 calcuate the mask Signed-off-by: Yi Dong <[email protected]> commit 9db3bc13eb65a94a475b837603351da68e3745bc Author: Yi Dong <[email protected]> Date: Fri Sep 30 23:26:32 2022 +0000 fix bug in datasets Signed-off-by: Yi Dong <[email protected]> commit f289900375d4412f53f8110be00fec6587627550 Author: Yi Dong <[email protected]> Date: Fri Sep 30 22:29:40 2022 +0000 update the code Signed-off-by: Yi Dong <[email protected]> commit 8e28a1f208aabaab72dbe769e72756baada04d99 Author: Yi Dong <[email protected]> Date: Fri Sep 30 21:52:52 2022 +0000 added new ds Signed-off-by: Yi Dong <[email protected]> commit 8d41315bab7ce90e200a8a7d1023c34f8e046897 Author: Yi Dong <[email protected]> Date: Fri Sep 30 18:57:09 2022 +0000 added new files Signed-off-by: Yi Dong <[email protected]> commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f Merge: cbe8b7ab1 fa6cd8588 Author: Yi Dong <[email protected]> Date: Thu Sep 29 21:43:29 2022 +0000 Merge branch 'llm-prompt-learning-improvements' into universal_prompt commit fa6cd858839277939446afe7275976078d54c512 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Sep 29 16:47:30 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0 Merge: 7d6d46742 8d670bc77 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 09:43:27 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 7d6d46742170a66758287a207d67e1b1bfd15613 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 16:42:43 2022 +0000 Removed inference step and added sentence peice check to predict step Signed-off-by: Virginia Adams <[email protected]> commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 15:26:32 2022 +0000 fixed first stage check for pipeline parallel T5 pt Signed-off-by: Virginia Adams <[email protected]> commit 3637be2b258c8d9028856f9971edb7da4a8121f0 Merge: a3ea722fd 986a76612 Author: Virginia Adams <[email protected]> Date: Wed Sep 28 10:23:30 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4 Merge: 770967a52 971485ce7 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 13:35:52 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 770967a5251a474b6dcc2d44bf9a2076adbcb604 Merge: d23bf6c30 e3ac280a8 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:17:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit d23bf6c30acc0e3f6af9b4e24547669866a34d62 Merge: de6a31651 333d2b749 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:05:16 2022 -0700 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit de6a31651e63d88a42b971794d93f18ff5a3cdff Author: Virginia Adams <[email protected]> Date: Mon Sep 26 17:00:53 2022 +0000 Updated PP check to be on first stage pipeline only Signed-off-by: Virginia Adams <[email protected]> commit 333d2b7498e6742ce66436f733c980a74616900c Merge: 592c0986a a39fc925a Author: Virginia Adams <[email protected]> Date: Fri Sep 23 16:11:21 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 592c0986a476a91b57b8605d7b70830d7acfa021 Author: Virginia Adams <[email protected]> Date: Fri Sep 23 23:08:41 2022 +0000 Fixed unused import and CI test bug Signed-off-by: Virginia Adams <[email protected]> commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455 Merge: ce4b72c8c b566c2d0e Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:25 2022 +0000 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit ce4b72c8c52f32be336e323dd78a38089edc3e7c Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:16 2022 +0000 Switch to import from base class Signed-off-by: Virginia Adams <[email protected]> commit b566c2d0e35a068f758fd1310bc620a47be4590b Merge: 6621f2854 e872061ac Author: Virginia Adams <[email protected]> Date: Fri Sep 23 10:09:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 6621f28543828a48484a5637f6c9f3ccb23a5b02 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:47:35 2022 +0000 python format fix Signed-off-by: Virginia Adams <[email protected]> commit 8deafc8987b6af5f7b99a250310f57a40198c37f Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:28:02 2022 +0000 Save .nemo on new best val score Signed-off-by: Virginia Adams <[email protected]> commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 18:03:19 2022 +0000 Added automatic checkpoint to nemo file method Signed-off-by: Virginia Adams <[email protected]> commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b Author: Virginia Adams <[email protected]> Date: Wed Sep 14 02:11:56 2022 +0000 Make GPT use base prompt learning model class: Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yi Dong <[email protected]> * fix LGTM Signed-off-by: Yi Dong <[email protected]> * fix validation Signed-off-by: Yi Dong <[email protected]> * change for the lm eval Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * make text generation work in data parallel environment Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * implement the service with rest service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * surpress log Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Update config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore function needed for NMT Signed-off-by: MaximumEntropy <[email protected]> * handles no answer only Signed-off-by: Yi Dong <[email protected]> * Fix config Signed-off-by: MaximumEntropy <[email protected]> * added knn to web Signed-off-by: Yi Dong <[email protected]> * fix lgtm.com comments Signed-off-by: Yi Dong <[email protected]> * output the retrieved context Signed-off-by: Yi Dong <[email protected]> * allow no neighbor query Signed-off-by: Yi Dong <[email protected]> * remove the imports Signed-off-by: Yi Dong <[email protected]> * warn only once Signed-off-by: Yi Dong <[email protected]> * Change output file format from JSON to JSONL Signed-off-by: MaximumEntropy <[email protected]> * new t0 dataset Signed-off-by: Yi Dong <[email protected]> * Add T0 data preproc scripts Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Merge and multiprocessing Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for is_correct Signed-off-by: MaximumEntropy <[email protected]> * fix epoch > 2 Signed-off-by: Yi Dong <[email protected]> * handles multiple dataloader Signed-off-by: Yi Dong <[email protected]> * remove template Signed-off-by: Yi Dong <[email protected]> * Refactor T0 dataset Signed-off-by: MaximumEntropy <[email protected]> * Add script to merge train folder into individual training files to minimize number of blends Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added on the fly service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add combo instance Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added combo service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * send weights back to server Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix index store Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add reset button Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add add eos Signed-off-by: Yi Dong <[email protected]> * use a seperate bert service Signed-off-by: Yi Dong <[email protected]> * no loss of accuracy Signed-off-by: Yi Dong <[email protected]> * pin the gradio version Signed-off-by: Yi Dong <[email protected]> * Remove bin compat Signed-off-by: MaximumEntropy <[email protected]> * Fix header lines Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * evaluate based on text generation Signed-off-by: Yi Dong <[email protected]> * exact match result aggregation Signed-off-by: Yi Dong <[email protected]> * working SP and SA Signed-off-by: Yi Dong <[email protected]> * sync Signed-off-by: Yi Dong <[email protected]> * fix checkpoint Signed-off-by: Yi Dong <[email protected]> * fix eval Signed-off-by: Yi Dong <[email protected]> * backup states Signed-off-by: Yi Dong <[email protected]> * backup states reset Signed-off-by: Yi Dong <[email protected]> * fix the bug Signed-off-by: Yi Dong <[email protected]> * fix evaluation for sentence piece Signed-off-by: Yi Dong <[email protected]> * fix a bug Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * potential fix in the future Signed-off-by: Yi Dong <[email protected]> * remove the universal codes Signed-off-by: Yi Dong <[email protected]> * remove universal strategy Signed-off-by: Yi Dong <[email protected]> * address reviewer comment Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add align function docstrings and make most args optional Signed-off-by: Elena Rastorgueva <[email protected]> * Remove redundant returns of viterbi and log probs matrices Signed-off-by: Elena Rastorgueva <[email protected]> * Rename h# to <initial_silence> Signed-off-by: Elena Rastorgueva <[email protected]> * Update manifest format description in README Signed-off-by: Elena Rastorgueva <[email protected]> * always remove any spaces from utt_id Signed-off-by: Elena Rastorgueva <[email protected]> * Patch the hanging of threads on very large stderr (#5589) (#5590) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * O2 style amp for gpt3 ptuning (#5246) * enable amp o2 plugin Signed-off-by: Jimmy Zhang <[email protected]> * only create master param if param requires gradient Signed-off-by: Jimmy Zhang <[email protected]> * remove pytorch autocast Signed-off-by: Jimmy Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Jimmy Zhang <[email protected]> * Update optimizer_with_main_params.py Signed-off-by: JimmyZhang12 <[email protected]> * create master grad only if param group requires grad Signed-off-by: Jimmy Zhang <[email protected]> * fix grad scaler for pp > 1 Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: JimmyZhang12 <[email protected]> Co-authored-by: Jimmy Zhang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Better patch hydra (#5591) (#5592) * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Yet another fix with hydra multirun (#5594) (#5595) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add RETRO model documentation (#5578) * added retro doc Signed-off-by: Yi Dong <[email protected]> * finish data part Signed-off-by: Yi Dong <[email protected]> * added the data format Signed-off-by: Yi Dong <[email protected]> * added training script Signed-off-by: Yi Dong <[email protected]> * added training and evaluation steps Signed-off-by: Yi Dong <[email protected]> * edit the text Signed-off-by: Yi Dong <[email protected]> * added the images Signed-off-by: Yi Dong <[email protected]> * fix beginning Signed-off-by: Yi Dong <[email protected]> * fix the grammar Signed-off-by: Yi Dong <[email protected]> * trim it down Signed-off-by: Yi Dong <[email protected]> * add wandb option Signed-off-by: Yi Dong <[email protected]> * add reference Signed-off-by: Yi Dong <[email protected]> * fix path Signed-off-by: Yi Dong <[email protected]> * added the parameters table Signed-off-by: Yi Dong <[email protected]> * fix section Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix: setup_multiple validation/test data (#5585) Fix: setup_multiple validation/test data (#5585) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move to optimizer based EMA implementation (#5169) * Move to optimizer Signed-off-by: SeanNaren <[email protected]> * Fix replacing weights Signed-off-by: SeanNaren <[email protected]> * Allow swapping of weights be optional Signed-off-by: SeanNaren <[email protected]> * Save 2 models Signed-off-by: SeanNaren <[email protected]> * Use different hook Signed-off-by: SeanNaren <[email protected]> * Expose cpu device Signed-off-by: SeanNaren <[email protected]> * Add clause to see if this fixes issue with O2 optimizer Signed-off-by: SeanNaren <[email protected]> * Try to get O2 working Signed-off-by: SeanNaren <[email protected]> * WIP Signed-off-by: SeanNaren <[email protected]> * Fixes Signed-off-by: SeanNaren <[email protected]> * Fixes to tests Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Remove import Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Add comment Signed-off-by: SeanNaren <[email protected]> * Remove overwrite Signed-off-by: SeanNaren <[email protected]> * Add BatchNorm, currently tests fail Signed-off-by: SeanNaren <[email protected]> * Fix tests/functionality for batch norm Signed-off-by: SeanNaren <[email protected]> * Get rid of NLP changes Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * AIStore for ASR datasets (#5462) AIStore for ASR datasets Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add support for MHA adapters to ASR (#5396) * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * Temporary fixes to new signature of mixin Signed-off-by: smajumdar <[email protected]> * Add adapter util for constants, add all mha adapters. Signed-off-by: smajumdar <[email protected]> * Update name of function Signed-off-by: smajumdar <[email protected]> * Roll back changes to convASR Signed-off-by: smajumdar <[email protected]> * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * First draft of Conformer support for MHA attention Signed-off-by: smajumdar <[email protected]> * Add some preliminary tests Signed-off-by: smajumdar <[email protected]> * Add support for projection of the hidden dimension for attention Signed-off-by: smajumdar <[email protected]> * Add support for squeezeformer Signed-off-by: smajumdar <[email protected]> * Update train adapter config Signed-off-by: smajumdar <[email protected]> * Add tests for squeezeformer and unit tests for new modules Signed-off-by: smajumdar <[email protected]> * Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path Signed-off-by: smajumdar <[email protected]> * Update location of adapters Signed-off-by: smajumdar <[email protected]> * Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v Signed-off-by: smajumdar <[email protected]> * Update expmanager to clean up checkpoints Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update training scripts Signed-off-by: smajumdar <[email protected]> * Update config and docs Signed-off-by: smajumdar <[email protected]> * Expose nemo delete function Signed-off-by: smajumdar <[email protected]> * Correct adapter partial state saving Signed-off-by: smajumdar <[email protected]> * Correct a bug with state management of adapter tokens Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Pull down EMA test Signed-off-by: smajumdar <[email protected]> * Correct name of adapter module utility class Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Co-authored-by: Jocelyn <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Create separator parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Call align function with hydra config Signed-off-by: Elena Rastorgueva <[email protected]> * update usage example Signed-off-by: Elena Rastorgueva <[email protected]> * Update Dockerfile (#5614) (#5616) Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future. Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate pretrained_name and model_path parameters Signed-off-by: Elena Rastorgueva <[email protected]> * make "optional" tags bold in markdown Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move non-main functions to utils dir Signed-off-by: Elena Rastorgueva <[email protected]> * Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615) Signed-off-by: Ante Jukić <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] fix ranges of char set for accented letters. (#5607) * [TTS] fix ranges of char set for accented letters. * remove digits pattern and added unit tests for math operators. Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Change success message to reduce confusion (#5621) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update documentation and tutorials for Adapters (#5610) * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update test Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Rename test file Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602) * [TTS] add type hints and change variable names for tokenizers and g2p Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * 1. Added missing import for gather_objects. (#5627) Signed-off-by: Micha Livne <[email protected]> Signed-off-by: Micha Livne <[email protected]> Co-authored-by: Micha Livne <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fixed RadTTS unit test (#5572) Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove tests (#5633) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Conformer local attention (#5525) * local attn and merge Signed-off-by: sam1373 <[email protected]> * optional Signed-off-by: sam1373 <[email protected]> * override Signed-off-by: sam1373 <[email protected]> * incorporate comments Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> * fix Signed-off-by: sam1373 <[email protected]> * comment Signed-off-by: sam1373 <[email protected]> * changes, test Signed-off-by: sam1373 <[email protected]> * changes Signed-off-by: sam1373 <[email protected]> * check att context Signed-off-by: sam1373 <[email protected]> * readme link Signed-off-by: sam1373 <[email protected]> * utils Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> Signed-off-by: sam1373 <[email protected]> Signed-off-by: Samuel Kriman <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add core classes and functions for online clustering diarizer part 1 (#5526) * Add core classes and functions for online clustering diarizer Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add audio to labels code Signed-off-by: Taejin Park <[email protected]> * resolve type errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added unit=tests for very short audio Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Filled all missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved conflict and added missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed unit-test errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix the wrongly added file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * Fix wrongly included file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * resolve code quality issue Signed-off-by: Taejin Park <[email protected]> * Fixed unit-test errors and bugs Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * changed total_sec for offline_clustering toy_data in unit-tests Signed-off-by: Taejin Park <[email protected]> * fixed merging index offset bug Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * only including part 1 files Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed unused function Signed-off-by: Taejin Park <[email protected]> * fixed unused imports Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * divided nmesc_clustering.py into two and reflected first-pass comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * adding offline/online_clustering.py Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix code QL autocomment Signed-off-by: Taejin Park <[email protected]> * Removed unused imports Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/online_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Reflected comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved code scanning issue Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/offline_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641) * add stt_eo_conformer_ctc_large model * stt_eo_conformer_transducer_large Co-authored-by: Andrei Andrusenko <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Removed unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Specify that filepaths need to be absolute Signed-off-by: Elena Rastorgueva <[email protected]> * replaces any spaces in utt_id with dashes Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make hydra script callable by another script Signed-off-by: Elena Rastorgueva <[email protected]> * do not specify default model or model_downsample_factor Signed-off-by: Elena Rastorgueva <[email protected]> * [Dockerfile] Remove AIS archive from docker image (#5629) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Measure audio_sr from audio instead of needing to specify Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541) * Chinese TTS replaces default pypinyin dict * Add jieba word segmenter as an option Signed-off-by: Yuekai Zhang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate parameters for device of transcription and viterbi steps Signed-off-by: Elena Rastorgueva <[email protected]> * Add mention of gecko Signed-off-by: Elena Rastorgueva <[email protected]> * [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [Add] ASR+VAD Inference Pipeline (#5575) Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets. Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: fayejf <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * rename separator to ctm_grouping_separator and refactor Signed-off-by: Elena Rastorgueva <[email protected]> * Bert interleaved (#5556) * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * Update bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Adding tests * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * Fix bug fix sequence parallel and Interleaved * Fix bug fix sequence parallel and Interleaved Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add duration padding support for RADTTS inference (#5650) * Added duration padding support for RADTTS inference * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Kevin Shih <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Add remove_blank_tokens_from_ctm parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Dont save initial_silence line in CTM Signed-off-by: Elena Rastorgueva <[email protected]> * Add DLLogger support to exp_manager (#5658) * Add DLLogger support to exp_manager Signed-off-by: Alexandre Milesi <[email protected]> * Move dllogger to separate file and check import Signed-off-by: Alexandre Milesi <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Remove unused import Signed-off-by: Alexandre Milesi <[email protected]> Signed-off-by: Alexandre Milesi <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add minimum_timestamp_duration parameter Signed-off-by: Elena Rastorgueva <[email protected]> * add suggestion about removing blanks to README Signed-off-by: Elena Rastorgueva <[email protected]> * reorder args Signed-off-by: Elena Rastorgueva <[email protected]> * clarify description of ctm_grouping_separator in README Signed-off-by: Elena Rastorgueva <[email protected]> * update docstring Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Port stateless timer to exp manager (#5584) * Port stateless timer to exp manager Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes and remove from all megatron code Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change message Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix EMA restart by allowing device to be set by the class init (#5668) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP (moved to separate repo) - merge to main (#5630) * Remove sdp files from tools folder Signed-off-by: Elena Rastorgueva <[email protected]> * Add page to docs with new SDP location Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add interface for making amax reduction optional for FP8 (#5447) * add TE interface for making amax reduction optional Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Kirthi Shankar Sivamani <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add tts …

* Merge r1.13.0 main (#5570) * update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Optimized loop and bugfix in SDE (#5573) - Fixed bug with loading custom data attributes from JSON in Speech Data Explorer Signed-off-by: George Zelenfroynd <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update torchmetrics (#5566) * add task arg Signed-off-by: nithinraok <[email protected]> * update state Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Co-authored-by: Taejin Park <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove useless files. (#5580) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add initial NFA code Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make use of the specified device during viterbi decoding Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL notes Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL warning Signed-off-by: Elena Rastorgueva <[email protected]> * Add an option to defer data setup from ``__init__`` to ``setup`` (#5569) * Add an option to defer dataloader setup from __init__ to setup Signed-off-by: Ante Jukić <[email protected]> * Updated doc Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make utt_id specified by number of parts of audio_filepath user wishes to use Signed-off-by: Elena Rastorgueva <[email protected]> * remove audio_sr TODO - reduce risk of silent bugs Signed-off-by: Elena Rastorgueva <[email protected]> * Add check that model is CTC Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Text generation improvement (UI client, data parallel support) (#5437) * Squashed commit of the following: commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 15:07:42 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 35b424044fe80c3081e7756ab21244f701716f7e Author: Yi Dong <[email protected]> Date: Thu Oct 13 08:04:49 2022 -0700 get rid of base Signed-off-by: Yi Dong <[email protected]> commit 2955210e2311791543538cfbb5ad26b79414c954 Merge: d52edef8c eaf6757ca Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:17:02 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit d52edef8cd7b36593838fb270047e80f8ccb652e Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:16:24 2022 +0000 align with main Signed-off-by: Yi Dong <[email protected]> commit eaf6757ca5be8e099492f57c81d984429b0ad49c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 13:12:11 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:10:58 2022 +0000 same as main Signed-off-by: Yi Dong <[email protected]> commit e335de51bcc0d681c58b568c3d8c238bc5687c3b Merge: c231086e0 4463a9fe9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:08:09 2022 +0000 Merge branch 'main' into universal_prompt commit c231086e057f1efaa915f691d84664cb3d5aad85 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 19:59:12 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:56:17 2022 +0000 default to pad Signed-off-by: Yi Dong <[email protected]> commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:55:44 2022 +0000 add the option to pad the tokens Signed-off-by: Yi Dong <[email protected]> commit 876dc395b43fdeeaa2bcbbe13c76523633764c33 Merge: fbb0f4035 fe3c77ee9 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:20:47 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392 Author: Yi Dong <[email protected]> Date: Wed Oct 12 18:59:49 2022 +0000 fix import again Signed-off-by: Yi Dong <[email protected]> commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 16:00:24 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937 Author: Yi Dong <[email protected]> Date: Wed Oct 12 15:58:32 2022 +0000 enable server Signed-off-by: Yi Dong <[email protected]> commit cbe05d9fbc978f812cfbb671f45f147f300713c4 Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:07:28 2022 +0000 fix comment error Signed-off-by: Yi Dong <[email protected]> commit 1948048922e726ec6131e44b1a745389f18d4ef2 Merge: 232c2cce3 984f5c09a Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:05:30 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 232c2cce34d7a8b902da406706f3dd9b39475091 Merge: 34c8a68df 658243fb6 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:50:00 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f Merge: 658243fb6 3fda5de46 Author: Yi Dong <[email protected]> Date: Wed Oct 12 08:42:11 2022 -0400 Merge branch 'main' into fix_global_init commit 658243fb6580191b5d60edd30cde16dcc23cbb85 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:40:57 2022 +0000 fix import error Signed-off-by: Yi Dong <[email protected]> commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c Author: Yi Dong <[email protected]> Date: Tue Oct 11 22:44:12 2022 +0000 update the fused kernel Signed-off-by: Yi Dong <[email protected]> commit 536cf6bef9447b75843fad630729c47a2fba35f3 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:44:52 2022 -0700 add the missing file Signed-off-by: Yi Dong <[email protected]> commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:43:14 2022 -0700 fix fused softmax Signed-off-by: Yi Dong <[email protected]> commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:16:48 2022 -0700 move global step to base Signed-off-by: Yi Dong <[email protected]> commit 34c8a68df084b18d377e84415d9f07b2cd6673dd Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:50:11 2022 +0000 fix pipeline for eval Signed-off-by: Yi Dong <[email protected]> commit eee5d38218f26660c3ffebe9f615c850c80a1f0d Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:48:22 2022 +0000 fix for pipleline parallel Signed-off-by: Yi Dong <[email protected]> commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d Merge: 125e49947 e3b4c4d1f Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:13 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 125e4994760448ff75dd9328395813eda1c87547 Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:04 2022 +0000 add share option Signed-off-by: Yi Dong <[email protected]> commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368 Author: Yi Dong <[email protected]> Date: Wed Oct 5 11:43:48 2022 -0700 make sure consolidation works Signed-off-by: Yi Dong <[email protected]> commit a5c833964ecf05dc460ca1da69275c4019742150 Merge: 2a07ab52d abcb74be2 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:29 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 2a07ab52d95f15ba666823028c69e23825666c05 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:23 2022 +0000 added requirement Signed-off-by: Yi Dong <[email protected]> commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:39:42 2022 +0000 added a simple web server Signed-off-by: Yi Dong <[email protected]> commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b Author: Yi Dong <[email protected]> Date: Wed Oct 5 06:54:12 2022 -0700 fix empty val loss Signed-off-by: Yi Dong <[email protected]> commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26 Author: Yi Dong <[email protected]> Date: Tue Oct 4 19:25:30 2022 -0700 text gen working Signed-off-by: Yi Dong <[email protected]> commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba Author: Yi Dong <[email protected]> Date: Tue Oct 4 16:08:40 2022 -0700 first change Signed-off-by: Yi Dong <[email protected]> commit 59d077585e6962a669b824af58f64e8a0bea6547 Author: Yi Dong <[email protected]> Date: Tue Oct 4 15:00:40 2022 -0700 revert Signed-off-by: Yi Dong <[email protected]> commit 12a0f3902d99e9179403644bd951c045df716ca7 Author: Yi Dong <[email protected]> Date: Tue Oct 4 21:26:23 2022 +0000 init imp Signed-off-by: Yi Dong <[email protected]> commit 62a15dfd943cc48be495ac61b9f2f00995775c5f Merge: 82c90d2cd e0cc6b767 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:58:26 2022 -0700 Merge branch 'main' into universal_prompt commit 82c90d2cd0fd156f16a4b899f8c741d598f33990 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:17:13 2022 -0700 add sync Signed-off-by: Yi Dong <[email protected]> commit 9819b703eef877d90cd1257bf3610c69de9b4d7e Author: Yi Dong <[email protected]> Date: Sun Oct 2 17:52:34 2022 -0700 fix save model Signed-off-by: root <[email protected]> commit e4937e2fc5fb7d70754c97668416e4a69c3079fe Author: Yi Dong <[email protected]> Date: Sat Oct 1 18:56:09 2022 +0000 working Signed-off-by: Yi Dong <[email protected]> commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b Author: Yi Dong <[email protected]> Date: Sat Oct 1 17:34:03 2022 +0000 calcuate the mask Signed-off-by: Yi Dong <[email protected]> commit 9db3bc13eb65a94a475b837603351da68e3745bc Author: Yi Dong <[email protected]> Date: Fri Sep 30 23:26:32 2022 +0000 fix bug in datasets Signed-off-by: Yi Dong <[email protected]> commit f289900375d4412f53f8110be00fec6587627550 Author: Yi Dong <[email protected]> Date: Fri Sep 30 22:29:40 2022 +0000 update the code Signed-off-by: Yi Dong <[email protected]> commit 8e28a1f208aabaab72dbe769e72756baada04d99 Author: Yi Dong <[email protected]> Date: Fri Sep 30 21:52:52 2022 +0000 added new ds Signed-off-by: Yi Dong <[email protected]> commit 8d41315bab7ce90e200a8a7d1023c34f8e046897 Author: Yi Dong <[email protected]> Date: Fri Sep 30 18:57:09 2022 +0000 added new files Signed-off-by: Yi Dong <[email protected]> commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f Merge: cbe8b7ab1 fa6cd8588 Author: Yi Dong <[email protected]> Date: Thu Sep 29 21:43:29 2022 +0000 Merge branch 'llm-prompt-learning-improvements' into universal_prompt commit fa6cd858839277939446afe7275976078d54c512 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Sep 29 16:47:30 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0 Merge: 7d6d46742 8d670bc77 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 09:43:27 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 7d6d46742170a66758287a207d67e1b1bfd15613 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 16:42:43 2022 +0000 Removed inference step and added sentence peice check to predict step Signed-off-by: Virginia Adams <[email protected]> commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 15:26:32 2022 +0000 fixed first stage check for pipeline parallel T5 pt Signed-off-by: Virginia Adams <[email protected]> commit 3637be2b258c8d9028856f9971edb7da4a8121f0 Merge: a3ea722fd 986a76612 Author: Virginia Adams <[email protected]> Date: Wed Sep 28 10:23:30 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4 Merge: 770967a52 971485ce7 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 13:35:52 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 770967a5251a474b6dcc2d44bf9a2076adbcb604 Merge: d23bf6c30 e3ac280a8 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:17:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit d23bf6c30acc0e3f6af9b4e24547669866a34d62 Merge: de6a31651 333d2b749 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:05:16 2022 -0700 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit de6a31651e63d88a42b971794d93f18ff5a3cdff Author: Virginia Adams <[email protected]> Date: Mon Sep 26 17:00:53 2022 +0000 Updated PP check to be on first stage pipeline only Signed-off-by: Virginia Adams <[email protected]> commit 333d2b7498e6742ce66436f733c980a74616900c Merge: 592c0986a a39fc925a Author: Virginia Adams <[email protected]> Date: Fri Sep 23 16:11:21 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 592c0986a476a91b57b8605d7b70830d7acfa021 Author: Virginia Adams <[email protected]> Date: Fri Sep 23 23:08:41 2022 +0000 Fixed unused import and CI test bug Signed-off-by: Virginia Adams <[email protected]> commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455 Merge: ce4b72c8c b566c2d0e Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:25 2022 +0000 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit ce4b72c8c52f32be336e323dd78a38089edc3e7c Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:16 2022 +0000 Switch to import from base class Signed-off-by: Virginia Adams <[email protected]> commit b566c2d0e35a068f758fd1310bc620a47be4590b Merge: 6621f2854 e872061ac Author: Virginia Adams <[email protected]> Date: Fri Sep 23 10:09:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 6621f28543828a48484a5637f6c9f3ccb23a5b02 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:47:35 2022 +0000 python format fix Signed-off-by: Virginia Adams <[email protected]> commit 8deafc8987b6af5f7b99a250310f57a40198c37f Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:28:02 2022 +0000 Save .nemo on new best val score Signed-off-by: Virginia Adams <[email protected]> commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 18:03:19 2022 +0000 Added automatic checkpoint to nemo file method Signed-off-by: Virginia Adams <[email protected]> commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b Author: Virginia Adams <[email protected]> Date: Wed Sep 14 02:11:56 2022 +0000 Make GPT use base prompt learning model class: Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yi Dong <[email protected]> * fix LGTM Signed-off-by: Yi Dong <[email protected]> * fix validation Signed-off-by: Yi Dong <[email protected]> * change for the lm eval Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * make text generation work in data parallel environment Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * implement the service with rest service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * surpress log Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Update config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore function needed for NMT Signed-off-by: MaximumEntropy <[email protected]> * handles no answer only Signed-off-by: Yi Dong <[email protected]> * Fix config Signed-off-by: MaximumEntropy <[email protected]> * added knn to web Signed-off-by: Yi Dong <[email protected]> * fix lgtm.com comments Signed-off-by: Yi Dong <[email protected]> * output the retrieved context Signed-off-by: Yi Dong <[email protected]> * allow no neighbor query Signed-off-by: Yi Dong <[email protected]> * remove the imports Signed-off-by: Yi Dong <[email protected]> * warn only once Signed-off-by: Yi Dong <[email protected]> * Change output file format from JSON to JSONL Signed-off-by: MaximumEntropy <[email protected]> * new t0 dataset Signed-off-by: Yi Dong <[email protected]> * Add T0 data preproc scripts Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Merge and multiprocessing Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for is_correct Signed-off-by: MaximumEntropy <[email protected]> * fix epoch > 2 Signed-off-by: Yi Dong <[email protected]> * handles multiple dataloader Signed-off-by: Yi Dong <[email protected]> * remove template Signed-off-by: Yi Dong <[email protected]> * Refactor T0 dataset Signed-off-by: MaximumEntropy <[email protected]> * Add script to merge train folder into individual training files to minimize number of blends Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added on the fly service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add combo instance Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added combo service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * send weights back to server Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix index store Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add reset button Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add add eos Signed-off-by: Yi Dong <[email protected]> * use a seperate bert service Signed-off-by: Yi Dong <[email protected]> * no loss of accuracy Signed-off-by: Yi Dong <[email protected]> * pin the gradio version Signed-off-by: Yi Dong <[email protected]> * Remove bin compat Signed-off-by: MaximumEntropy <[email protected]> * Fix header lines Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * evaluate based on text generation Signed-off-by: Yi Dong <[email protected]> * exact match result aggregation Signed-off-by: Yi Dong <[email protected]> * working SP and SA Signed-off-by: Yi Dong <[email protected]> * sync Signed-off-by: Yi Dong <[email protected]> * fix checkpoint Signed-off-by: Yi Dong <[email protected]> * fix eval Signed-off-by: Yi Dong <[email protected]> * backup states Signed-off-by: Yi Dong <[email protected]> * backup states reset Signed-off-by: Yi Dong <[email protected]> * fix the bug Signed-off-by: Yi Dong <[email protected]> * fix evaluation for sentence piece Signed-off-by: Yi Dong <[email protected]> * fix a bug Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * potential fix in the future Signed-off-by: Yi Dong <[email protected]> * remove the universal codes Signed-off-by: Yi Dong <[email protected]> * remove universal strategy Signed-off-by: Yi Dong <[email protected]> * address reviewer comment Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add align function docstrings and make most args optional Signed-off-by: Elena Rastorgueva <[email protected]> * Remove redundant returns of viterbi and log probs matrices Signed-off-by: Elena Rastorgueva <[email protected]> * Rename h# to <initial_silence> Signed-off-by: Elena Rastorgueva <[email protected]> * Update manifest format description in README Signed-off-by: Elena Rastorgueva <[email protected]> * always remove any spaces from utt_id Signed-off-by: Elena Rastorgueva <[email protected]> * Patch the hanging of threads on very large stderr (#5589) (#5590) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * O2 style amp for gpt3 ptuning (#5246) * enable amp o2 plugin Signed-off-by: Jimmy Zhang <[email protected]> * only create master param if param requires gradient Signed-off-by: Jimmy Zhang <[email protected]> * remove pytorch autocast Signed-off-by: Jimmy Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Jimmy Zhang <[email protected]> * Update optimizer_with_main_params.py Signed-off-by: JimmyZhang12 <[email protected]> * create master grad only if param group requires grad Signed-off-by: Jimmy Zhang <[email protected]> * fix grad scaler for pp > 1 Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: JimmyZhang12 <[email protected]> Co-authored-by: Jimmy Zhang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Better patch hydra (#5591) (#5592) * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Yet another fix with hydra multirun (#5594) (#5595) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add RETRO model documentation (#5578) * added retro doc Signed-off-by: Yi Dong <[email protected]> * finish data part Signed-off-by: Yi Dong <[email protected]> * added the data format Signed-off-by: Yi Dong <[email protected]> * added training script Signed-off-by: Yi Dong <[email protected]> * added training and evaluation steps Signed-off-by: Yi Dong <[email protected]> * edit the text Signed-off-by: Yi Dong <[email protected]> * added the images Signed-off-by: Yi Dong <[email protected]> * fix beginning Signed-off-by: Yi Dong <[email protected]> * fix the grammar Signed-off-by: Yi Dong <[email protected]> * trim it down Signed-off-by: Yi Dong <[email protected]> * add wandb option Signed-off-by: Yi Dong <[email protected]> * add reference Signed-off-by: Yi Dong <[email protected]> * fix path Signed-off-by: Yi Dong <[email protected]> * added the parameters table Signed-off-by: Yi Dong <[email protected]> * fix section Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix: setup_multiple validation/test data (#5585) Fix: setup_multiple validation/test data (#5585) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move to optimizer based EMA implementation (#5169) * Move to optimizer Signed-off-by: SeanNaren <[email protected]> * Fix replacing weights Signed-off-by: SeanNaren <[email protected]> * Allow swapping of weights be optional Signed-off-by: SeanNaren <[email protected]> * Save 2 models Signed-off-by: SeanNaren <[email protected]> * Use different hook Signed-off-by: SeanNaren <[email protected]> * Expose cpu device Signed-off-by: SeanNaren <[email protected]> * Add clause to see if this fixes issue with O2 optimizer Signed-off-by: SeanNaren <[email protected]> * Try to get O2 working Signed-off-by: SeanNaren <[email protected]> * WIP Signed-off-by: SeanNaren <[email protected]> * Fixes Signed-off-by: SeanNaren <[email protected]> * Fixes to tests Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Remove import Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Add comment Signed-off-by: SeanNaren <[email protected]> * Remove overwrite Signed-off-by: SeanNaren <[email protected]> * Add BatchNorm, currently tests fail Signed-off-by: SeanNaren <[email protected]> * Fix tests/functionality for batch norm Signed-off-by: SeanNaren <[email protected]> * Get rid of NLP changes Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * AIStore for ASR datasets (#5462) AIStore for ASR datasets Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add support for MHA adapters to ASR (#5396) * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * Temporary fixes to new signature of mixin Signed-off-by: smajumdar <[email protected]> * Add adapter util for constants, add all mha adapters. Signed-off-by: smajumdar <[email protected]> * Update name of function Signed-off-by: smajumdar <[email protected]> * Roll back changes to convASR Signed-off-by: smajumdar <[email protected]> * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * First draft of Conformer support for MHA attention Signed-off-by: smajumdar <[email protected]> * Add some preliminary tests Signed-off-by: smajumdar <[email protected]> * Add support for projection of the hidden dimension for attention Signed-off-by: smajumdar <[email protected]> * Add support for squeezeformer Signed-off-by: smajumdar <[email protected]> * Update train adapter config Signed-off-by: smajumdar <[email protected]> * Add tests for squeezeformer and unit tests for new modules Signed-off-by: smajumdar <[email protected]> * Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path Signed-off-by: smajumdar <[email protected]> * Update location of adapters Signed-off-by: smajumdar <[email protected]> * Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v Signed-off-by: smajumdar <[email protected]> * Update expmanager to clean up checkpoints Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update training scripts Signed-off-by: smajumdar <[email protected]> * Update config and docs Signed-off-by: smajumdar <[email protected]> * Expose nemo delete function Signed-off-by: smajumdar <[email protected]> * Correct adapter partial state saving Signed-off-by: smajumdar <[email protected]> * Correct a bug with state management of adapter tokens Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Pull down EMA test Signed-off-by: smajumdar <[email protected]> * Correct name of adapter module utility class Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Co-authored-by: Jocelyn <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Create separator parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Call align function with hydra config Signed-off-by: Elena Rastorgueva <[email protected]> * update usage example Signed-off-by: Elena Rastorgueva <[email protected]> * Update Dockerfile (#5614) (#5616) Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future. Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate pretrained_name and model_path parameters Signed-off-by: Elena Rastorgueva <[email protected]> * make "optional" tags bold in markdown Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move non-main functions to utils dir Signed-off-by: Elena Rastorgueva <[email protected]> * Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615) Signed-off-by: Ante Jukić <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] fix ranges of char set for accented letters. (#5607) * [TTS] fix ranges of char set for accented letters. * remove digits pattern and added unit tests for math operators. Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Change success message to reduce confusion (#5621) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update documentation and tutorials for Adapters (#5610) * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update test Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Rename test file Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602) * [TTS] add type hints and change variable names for tokenizers and g2p Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * 1. Added missing import for gather_objects. (#5627) Signed-off-by: Micha Livne <[email protected]> Signed-off-by: Micha Livne <[email protected]> Co-authored-by: Micha Livne <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fixed RadTTS unit test (#5572) Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove tests (#5633) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Conformer local attention (#5525) * local attn and merge Signed-off-by: sam1373 <[email protected]> * optional Signed-off-by: sam1373 <[email protected]> * override Signed-off-by: sam1373 <[email protected]> * incorporate comments Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> * fix Signed-off-by: sam1373 <[email protected]> * comment Signed-off-by: sam1373 <[email protected]> * changes, test Signed-off-by: sam1373 <[email protected]> * changes Signed-off-by: sam1373 <[email protected]> * check att context Signed-off-by: sam1373 <[email protected]> * readme link Signed-off-by: sam1373 <[email protected]> * utils Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> Signed-off-by: sam1373 <[email protected]> Signed-off-by: Samuel Kriman <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add core classes and functions for online clustering diarizer part 1 (#5526) * Add core classes and functions for online clustering diarizer Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add audio to labels code Signed-off-by: Taejin Park <[email protected]> * resolve type errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added unit=tests for very short audio Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Filled all missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved conflict and added missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed unit-test errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix the wrongly added file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * Fix wrongly included file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * resolve code quality issue Signed-off-by: Taejin Park <[email protected]> * Fixed unit-test errors and bugs Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * changed total_sec for offline_clustering toy_data in unit-tests Signed-off-by: Taejin Park <[email protected]> * fixed merging index offset bug Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * only including part 1 files Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed unused function Signed-off-by: Taejin Park <[email protected]> * fixed unused imports Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * divided nmesc_clustering.py into two and reflected first-pass comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * adding offline/online_clustering.py Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix code QL autocomment Signed-off-by: Taejin Park <[email protected]> * Removed unused imports Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/online_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Reflected comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved code scanning issue Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/offline_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641) * add stt_eo_conformer_ctc_large model * stt_eo_conformer_transducer_large Co-authored-by: Andrei Andrusenko <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Removed unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Specify that filepaths need to be absolute Signed-off-by: Elena Rastorgueva <[email protected]> * replaces any spaces in utt_id with dashes Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make hydra script callable by another script Signed-off-by: Elena Rastorgueva <[email protected]> * do not specify default model or model_downsample_factor Signed-off-by: Elena Rastorgueva <[email protected]> * [Dockerfile] Remove AIS archive from docker image (#5629) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Measure audio_sr from audio instead of needing to specify Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541) * Chinese TTS replaces default pypinyin dict * Add jieba word segmenter as an option Signed-off-by: Yuekai Zhang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate parameters for device of transcription and viterbi steps Signed-off-by: Elena Rastorgueva <[email protected]> * Add mention of gecko Signed-off-by: Elena Rastorgueva <[email protected]> * [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [Add] ASR+VAD Inference Pipeline (#5575) Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets. Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: fayejf <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * rename separator to ctm_grouping_separator and refactor Signed-off-by: Elena Rastorgueva <[email protected]> * Bert interleaved (#5556) * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * Update bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Adding tests * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * Fix bug fix sequence parallel and Interleaved * Fix bug fix sequence parallel and Interleaved Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add duration padding support for RADTTS inference (#5650) * Added duration padding support for RADTTS inference * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Kevin Shih <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Add remove_blank_tokens_from_ctm parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Dont save initial_silence line in CTM Signed-off-by: Elena Rastorgueva <[email protected]> * Add DLLogger support to exp_manager (#5658) * Add DLLogger support to exp_manager Signed-off-by: Alexandre Milesi <[email protected]> * Move dllogger to separate file and check import Signed-off-by: Alexandre Milesi <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Remove unused import Signed-off-by: Alexandre Milesi <[email protected]> Signed-off-by: Alexandre Milesi <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add minimum_timestamp_duration parameter Signed-off-by: Elena Rastorgueva <[email protected]> * add suggestion about removing blanks to README Signed-off-by: Elena Rastorgueva <[email protected]> * reorder args Signed-off-by: Elena Rastorgueva <[email protected]> * clarify description of ctm_grouping_separator in README Signed-off-by: Elena Rastorgueva <[email protected]> * update docstring Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Port stateless timer to exp manager (#5584) * Port stateless timer to exp manager Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes and remove from all megatron code Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change message Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix EMA restart by allowing device to be set by the class init (#5668) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP (moved to separate repo) - merge to main (#5630) * Remove sdp files from tools folder Signed-off-by: Elena Rastorgueva <[email protected]> * Add page to docs with new SDP location Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add interface for making amax reduction optional for FP8 (#5447) * add TE interface for making amax reduction optional Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Kirthi Shankar Sivamani <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add tts dict cust notebook (#5662) * add tts dict cust notebook Signed-off-by: ekmb <[email protected]> * review Signed-off-by: ekmb <[email protected]> * fixed audio links Signed-off-by: ekmb <[email protected]> * remove old notebook Signed-off-by: ekmb <[email protected]> * fix typo Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [ASR] Audio processing base, multi-channel enhancement models (#5356) * Audio processing base model, enc-mask-dec enhancement, tests and modules Signed-off-by: Ante Jukić <[email protected]> * Addressed review comments Signed-off-by: Ante Jukić <[email protected]> * Fixed CodeQL warnings Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments: - renamed AudioProcessingModel to AudioToAudioModel - various small modifications - updated unit tests Signed-off-by: Ante Jukić <[email protected]> * Addressed comments - Moved spectrogram to audio_preprocessing - Renamed MultichannelFeatures - Updated config and unit tests Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Expose ClusteringDiarizer device (#5681) * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add Beam Search support to ASR transcribe() (#5443) * Add support for beam decoding via high level API. Signed-off-by: smajumdar <[email protected]> * Add ctc decoding section Signed-off-by: smajumdar <[email protected]> * Update ctc transcribe API to return results from beam search Signed-off-by: smajumdar <[email protected]> * Add argument to preserve arpa file Signed-off-by: smajumdar <[email protected]> * Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding Signed-off-by: smajumdar <[email protected]> * Update eval script and doc to use new API Signed-off-by: smajumdar <[email protected]> * Add tests for ctc greedy decoding Signed-off-by: smajumdar <[email protected]> * Address reviewer comments and add docstrings Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix changes and address comments Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: Samuel Kriman <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Propagate attention_dropout flag for GPT-3 (#5669) * Propagate attention_dropout flag for GPT-3 Signed-off-by: Mikołaj Błaż <[email protected]> * Add default to megatron_gpt_config Signed-off-by: Mikołaj Błaż <[email protected]> Signed-off-by: Mikołaj Błaż <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <complex451@gmail…

* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <[email protected]> * remove cmudict downloading Signed-off-by: Oktai Tatanov <[email protected]> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <[email protected]> * add libritts processing Signed-off-by: Oktai Tatanov <[email protected]> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <[email protected]> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <[email protected]> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <[email protected]> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <[email protected]> * fix optimizers Signed-off-by: Oktai Tatanov <[email protected]> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Jason <[email protected]> Signed-off-by: Jason <[email protected]> * make new commit Signed-off-by: Jason <[email protected]> * add copyright headers Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * rename README Signed-off-by: Oktai Tatanov <[email protected]> * fix style without vits_modules Signed-off-by: Oktai Tatanov <[email protected]> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <[email protected]> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <[email protected]> * update package info Signed-off-by: ericharper <[email protected]> * new exp * update branch Signed-off-by: ericharper <[email protected]> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <[email protected]> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <[email protected]> * duration fix Signed-off-by: nithinraok <[email protected]> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <[email protected]> * fixes for code review Signed-off-by: Alexandra Antonova <[email protected]> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> * update the default (#4135) Signed-off-by: ekmb <[email protected]> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <[email protected]> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Do not create tmp directory Signed-off-by: PeganovAnton <[email protected]> * Fix parameter name Signed-off-by: PeganovAnton <[email protected]> * finish cherry-pick op Signed-off-by: PeganovAnton <[email protected]> * Fix labels errors Signed-off-by: PeganovAnton <[email protected]> * Remove duplicate stage Signed-off-by: PeganovAnton <[email protected]> * Change target branch Signed-off-by: PeganovAnton <[email protected]> * fix typo (#4140) Signed-off-by: Yang Zhang <[email protected]> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Use built-in tempfile library Signed-off-by: PeganovAnton <[email protected]> * Fix code style Signed-off-by: PeganovAnton <[email protected]> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <[email protected]> Co-authored-by: Nithin Rao <[email protected]> * fix doc (#4146) Signed-off-by: Yang Zhang <[email protected]> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <[email protected]> * Fix for new TTSDataset class Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * inference fix Signed-off-by: treacker <[email protected]> * removed old code Signed-off-by: treacker <[email protected]> * updated parser logic Signed-off-by: treacker <[email protected]> * reverted version update Signed-off-by: treacker <[email protected]> * refactored parser logic Signed-off-by: treacker <[email protected]> * Updated Jenkinsfile Signed-off-by: treacker <[email protected]> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Update Jenkinsfile Signed-off-by: treacker <[email protected]> * Update tacotron.yaml Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: treacker <[email protected]> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Fixed jenkins Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <[email protected]> * start fix Signed-off-by: nithinraok <[email protected]> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <[email protected]> * updated scp to filelist Signed-off-by: nithinraok <[email protected]> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <[email protected]> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <[email protected]> * remove extremely unreliable links Signed-off-by: fayejf <[email protected]> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <[email protected]> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <[email protected]> * changed model location to nemo Signed-off-by: Alexandra Antonova <[email protected]> * style fix Signed-off-by: Alexandra Antonova <[email protected]> * fix space Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <[email protected]> * chunks -> segments Signed-off-by: nithinraok <[email protected]> * Khz -> kHz Signed-off-by: nithinraok <[email protected]> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <[email protected]> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Update ASR scores table (#5254) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <[email protected]> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <[email protected]> * Fixed typo Signed-off-by: Matvei Novikov <[email protected]> * Fixed branch variable Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <[email protected]> * udpate transcribe Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Igor Gitman <[email protected]> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <[email protected]> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <[email protected]> * fixes Signed-off-by: Matvei Novikov <[email protected]> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <[email protected]> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <[email protected]> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <[email protected]> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <[email protected]> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <[email protected]> * Make config example less confusing Signed-off-by: Elena Rastorgueva <[email protected]> * Fix typo Signed-off-by: Elena Rastorgueva <[email protected]> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused imports Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <[email protected]> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Co-authored-by: Igor Gitman <[email protected]> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <[email protected]> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <[email protected]> * Added more saftey checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * disable pc test (#5426) Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <[email protected]> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <[email protected]> * storing * updated version Signed-off-by: Evgeniy Shabalin <[email protected]> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed issues Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <[email protected]> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <[email protected]> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <[email protected]> * typecheck fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <[email protected]> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <[email protected]> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: nithinraok <[email protected]> Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Sasha Meister <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Evgeniy Shabalin <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Oktai Tatanov <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: bene-ges <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Jocelyn <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Sean Naren <[email protected]> Co-authored-by: Matvei Novikov <[email protected]> Co-authored-by: Zhilin Wang <[email protected]> Co-authored-by: Igor Gitman <[email protected]> Co-authored-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: Taejin Park <[email protected]> Co-authored-by: Adi Renduchintala <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Co-authored-by: Xuesong Yang <[email protected]>

* Merge r1.13.0 main (#5570) * update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Optimized loop and bugfix in SDE (#5573) - Fixed bug with loading custom data attributes from JSON in Speech Data Explorer Signed-off-by: George Zelenfroynd <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update torchmetrics (#5566) * add task arg Signed-off-by: nithinraok <[email protected]> * update state Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Co-authored-by: Taejin Park <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove useless files. (#5580) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add initial NFA code Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make use of the specified device during viterbi decoding Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL notes Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL warning Signed-off-by: Elena Rastorgueva <[email protected]> * Add an option to defer data setup from ``__init__`` to ``setup`` (#5569) * Add an option to defer dataloader setup from __init__ to setup Signed-off-by: Ante Jukić <[email protected]> * Updated doc Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make utt_id specified by number of parts of audio_filepath user wishes to use Signed-off-by: Elena Rastorgueva <[email protected]> * remove audio_sr TODO - reduce risk of silent bugs Signed-off-by: Elena Rastorgueva <[email protected]> * Add check that model is CTC Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Text generation improvement (UI client, data parallel support) (#5437) * Squashed commit of the following: commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 15:07:42 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 35b424044fe80c3081e7756ab21244f701716f7e Author: Yi Dong <[email protected]> Date: Thu Oct 13 08:04:49 2022 -0700 get rid of base Signed-off-by: Yi Dong <[email protected]> commit 2955210e2311791543538cfbb5ad26b79414c954 Merge: d52edef8c eaf6757ca Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:17:02 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit d52edef8cd7b36593838fb270047e80f8ccb652e Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:16:24 2022 +0000 align with main Signed-off-by: Yi Dong <[email protected]> commit eaf6757ca5be8e099492f57c81d984429b0ad49c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 13:12:11 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:10:58 2022 +0000 same as main Signed-off-by: Yi Dong <[email protected]> commit e335de51bcc0d681c58b568c3d8c238bc5687c3b Merge: c231086e0 4463a9fe9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:08:09 2022 +0000 Merge branch 'main' into universal_prompt commit c231086e057f1efaa915f691d84664cb3d5aad85 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 19:59:12 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:56:17 2022 +0000 default to pad Signed-off-by: Yi Dong <[email protected]> commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:55:44 2022 +0000 add the option to pad the tokens Signed-off-by: Yi Dong <[email protected]> commit 876dc395b43fdeeaa2bcbbe13c76523633764c33 Merge: fbb0f4035 fe3c77ee9 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:20:47 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392 Author: Yi Dong <[email protected]> Date: Wed Oct 12 18:59:49 2022 +0000 fix import again Signed-off-by: Yi Dong <[email protected]> commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 16:00:24 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937 Author: Yi Dong <[email protected]> Date: Wed Oct 12 15:58:32 2022 +0000 enable server Signed-off-by: Yi Dong <[email protected]> commit cbe05d9fbc978f812cfbb671f45f147f300713c4 Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:07:28 2022 +0000 fix comment error Signed-off-by: Yi Dong <[email protected]> commit 1948048922e726ec6131e44b1a745389f18d4ef2 Merge: 232c2cce3 984f5c09a Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:05:30 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 232c2cce34d7a8b902da406706f3dd9b39475091 Merge: 34c8a68df 658243fb6 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:50:00 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f Merge: 658243fb6 3fda5de46 Author: Yi Dong <[email protected]> Date: Wed Oct 12 08:42:11 2022 -0400 Merge branch 'main' into fix_global_init commit 658243fb6580191b5d60edd30cde16dcc23cbb85 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:40:57 2022 +0000 fix import error Signed-off-by: Yi Dong <[email protected]> commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c Author: Yi Dong <[email protected]> Date: Tue Oct 11 22:44:12 2022 +0000 update the fused kernel Signed-off-by: Yi Dong <[email protected]> commit 536cf6bef9447b75843fad630729c47a2fba35f3 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:44:52 2022 -0700 add the missing file Signed-off-by: Yi Dong <[email protected]> commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:43:14 2022 -0700 fix fused softmax Signed-off-by: Yi Dong <[email protected]> commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:16:48 2022 -0700 move global step to base Signed-off-by: Yi Dong <[email protected]> commit 34c8a68df084b18d377e84415d9f07b2cd6673dd Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:50:11 2022 +0000 fix pipeline for eval Signed-off-by: Yi Dong <[email protected]> commit eee5d38218f26660c3ffebe9f615c850c80a1f0d Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:48:22 2022 +0000 fix for pipleline parallel Signed-off-by: Yi Dong <[email protected]> commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d Merge: 125e49947 e3b4c4d1f Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:13 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 125e4994760448ff75dd9328395813eda1c87547 Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:04 2022 +0000 add share option Signed-off-by: Yi Dong <[email protected]> commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368 Author: Yi Dong <[email protected]> Date: Wed Oct 5 11:43:48 2022 -0700 make sure consolidation works Signed-off-by: Yi Dong <[email protected]> commit a5c833964ecf05dc460ca1da69275c4019742150 Merge: 2a07ab52d abcb74be2 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:29 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 2a07ab52d95f15ba666823028c69e23825666c05 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:23 2022 +0000 added requirement Signed-off-by: Yi Dong <[email protected]> commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:39:42 2022 +0000 added a simple web server Signed-off-by: Yi Dong <[email protected]> commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b Author: Yi Dong <[email protected]> Date: Wed Oct 5 06:54:12 2022 -0700 fix empty val loss Signed-off-by: Yi Dong <[email protected]> commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26 Author: Yi Dong <[email protected]> Date: Tue Oct 4 19:25:30 2022 -0700 text gen working Signed-off-by: Yi Dong <[email protected]> commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba Author: Yi Dong <[email protected]> Date: Tue Oct 4 16:08:40 2022 -0700 first change Signed-off-by: Yi Dong <[email protected]> commit 59d077585e6962a669b824af58f64e8a0bea6547 Author: Yi Dong <[email protected]> Date: Tue Oct 4 15:00:40 2022 -0700 revert Signed-off-by: Yi Dong <[email protected]> commit 12a0f3902d99e9179403644bd951c045df716ca7 Author: Yi Dong <[email protected]> Date: Tue Oct 4 21:26:23 2022 +0000 init imp Signed-off-by: Yi Dong <[email protected]> commit 62a15dfd943cc48be495ac61b9f2f00995775c5f Merge: 82c90d2cd e0cc6b767 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:58:26 2022 -0700 Merge branch 'main' into universal_prompt commit 82c90d2cd0fd156f16a4b899f8c741d598f33990 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:17:13 2022 -0700 add sync Signed-off-by: Yi Dong <[email protected]> commit 9819b703eef877d90cd1257bf3610c69de9b4d7e Author: Yi Dong <[email protected]> Date: Sun Oct 2 17:52:34 2022 -0700 fix save model Signed-off-by: root <[email protected]> commit e4937e2fc5fb7d70754c97668416e4a69c3079fe Author: Yi Dong <[email protected]> Date: Sat Oct 1 18:56:09 2022 +0000 working Signed-off-by: Yi Dong <[email protected]> commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b Author: Yi Dong <[email protected]> Date: Sat Oct 1 17:34:03 2022 +0000 calcuate the mask Signed-off-by: Yi Dong <[email protected]> commit 9db3bc13eb65a94a475b837603351da68e3745bc Author: Yi Dong <[email protected]> Date: Fri Sep 30 23:26:32 2022 +0000 fix bug in datasets Signed-off-by: Yi Dong <[email protected]> commit f289900375d4412f53f8110be00fec6587627550 Author: Yi Dong <[email protected]> Date: Fri Sep 30 22:29:40 2022 +0000 update the code Signed-off-by: Yi Dong <[email protected]> commit 8e28a1f208aabaab72dbe769e72756baada04d99 Author: Yi Dong <[email protected]> Date: Fri Sep 30 21:52:52 2022 +0000 added new ds Signed-off-by: Yi Dong <[email protected]> commit 8d41315bab7ce90e200a8a7d1023c34f8e046897 Author: Yi Dong <[email protected]> Date: Fri Sep 30 18:57:09 2022 +0000 added new files Signed-off-by: Yi Dong <[email protected]> commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f Merge: cbe8b7ab1 fa6cd8588 Author: Yi Dong <[email protected]> Date: Thu Sep 29 21:43:29 2022 +0000 Merge branch 'llm-prompt-learning-improvements' into universal_prompt commit fa6cd858839277939446afe7275976078d54c512 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Sep 29 16:47:30 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0 Merge: 7d6d46742 8d670bc77 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 09:43:27 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 7d6d46742170a66758287a207d67e1b1bfd15613 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 16:42:43 2022 +0000 Removed inference step and added sentence peice check to predict step Signed-off-by: Virginia Adams <[email protected]> commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 15:26:32 2022 +0000 fixed first stage check for pipeline parallel T5 pt Signed-off-by: Virginia Adams <[email protected]> commit 3637be2b258c8d9028856f9971edb7da4a8121f0 Merge: a3ea722fd 986a76612 Author: Virginia Adams <[email protected]> Date: Wed Sep 28 10:23:30 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4 Merge: 770967a52 971485ce7 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 13:35:52 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 770967a5251a474b6dcc2d44bf9a2076adbcb604 Merge: d23bf6c30 e3ac280a8 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:17:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit d23bf6c30acc0e3f6af9b4e24547669866a34d62 Merge: de6a31651 333d2b749 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:05:16 2022 -0700 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit de6a31651e63d88a42b971794d93f18ff5a3cdff Author: Virginia Adams <[email protected]> Date: Mon Sep 26 17:00:53 2022 +0000 Updated PP check to be on first stage pipeline only Signed-off-by: Virginia Adams <[email protected]> commit 333d2b7498e6742ce66436f733c980a74616900c Merge: 592c0986a a39fc925a Author: Virginia Adams <[email protected]> Date: Fri Sep 23 16:11:21 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 592c0986a476a91b57b8605d7b70830d7acfa021 Author: Virginia Adams <[email protected]> Date: Fri Sep 23 23:08:41 2022 +0000 Fixed unused import and CI test bug Signed-off-by: Virginia Adams <[email protected]> commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455 Merge: ce4b72c8c b566c2d0e Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:25 2022 +0000 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit ce4b72c8c52f32be336e323dd78a38089edc3e7c Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:16 2022 +0000 Switch to import from base class Signed-off-by: Virginia Adams <[email protected]> commit b566c2d0e35a068f758fd1310bc620a47be4590b Merge: 6621f2854 e872061ac Author: Virginia Adams <[email protected]> Date: Fri Sep 23 10:09:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 6621f28543828a48484a5637f6c9f3ccb23a5b02 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:47:35 2022 +0000 python format fix Signed-off-by: Virginia Adams <[email protected]> commit 8deafc8987b6af5f7b99a250310f57a40198c37f Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:28:02 2022 +0000 Save .nemo on new best val score Signed-off-by: Virginia Adams <[email protected]> commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 18:03:19 2022 +0000 Added automatic checkpoint to nemo file method Signed-off-by: Virginia Adams <[email protected]> commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b Author: Virginia Adams <[email protected]> Date: Wed Sep 14 02:11:56 2022 +0000 Make GPT use base prompt learning model class: Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yi Dong <[email protected]> * fix LGTM Signed-off-by: Yi Dong <[email protected]> * fix validation Signed-off-by: Yi Dong <[email protected]> * change for the lm eval Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * make text generation work in data parallel environment Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * implement the service with rest service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * surpress log Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Update config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore function needed for NMT Signed-off-by: MaximumEntropy <[email protected]> * handles no answer only Signed-off-by: Yi Dong <[email protected]> * Fix config Signed-off-by: MaximumEntropy <[email protected]> * added knn to web Signed-off-by: Yi Dong <[email protected]> * fix lgtm.com comments Signed-off-by: Yi Dong <[email protected]> * output the retrieved context Signed-off-by: Yi Dong <[email protected]> * allow no neighbor query Signed-off-by: Yi Dong <[email protected]> * remove the imports Signed-off-by: Yi Dong <[email protected]> * warn only once Signed-off-by: Yi Dong <[email protected]> * Change output file format from JSON to JSONL Signed-off-by: MaximumEntropy <[email protected]> * new t0 dataset Signed-off-by: Yi Dong <[email protected]> * Add T0 data preproc scripts Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Merge and multiprocessing Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for is_correct Signed-off-by: MaximumEntropy <[email protected]> * fix epoch > 2 Signed-off-by: Yi Dong <[email protected]> * handles multiple dataloader Signed-off-by: Yi Dong <[email protected]> * remove template Signed-off-by: Yi Dong <[email protected]> * Refactor T0 dataset Signed-off-by: MaximumEntropy <[email protected]> * Add script to merge train folder into individual training files to minimize number of blends Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added on the fly service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add combo instance Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added combo service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * send weights back to server Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix index store Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add reset button Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add add eos Signed-off-by: Yi Dong <[email protected]> * use a seperate bert service Signed-off-by: Yi Dong <[email protected]> * no loss of accuracy Signed-off-by: Yi Dong <[email protected]> * pin the gradio version Signed-off-by: Yi Dong <[email protected]> * Remove bin compat Signed-off-by: MaximumEntropy <[email protected]> * Fix header lines Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * evaluate based on text generation Signed-off-by: Yi Dong <[email protected]> * exact match result aggregation Signed-off-by: Yi Dong <[email protected]> * working SP and SA Signed-off-by: Yi Dong <[email protected]> * sync Signed-off-by: Yi Dong <[email protected]> * fix checkpoint Signed-off-by: Yi Dong <[email protected]> * fix eval Signed-off-by: Yi Dong <[email protected]> * backup states Signed-off-by: Yi Dong <[email protected]> * backup states reset Signed-off-by: Yi Dong <[email protected]> * fix the bug Signed-off-by: Yi Dong <[email protected]> * fix evaluation for sentence piece Signed-off-by: Yi Dong <[email protected]> * fix a bug Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * potential fix in the future Signed-off-by: Yi Dong <[email protected]> * remove the universal codes Signed-off-by: Yi Dong <[email protected]> * remove universal strategy Signed-off-by: Yi Dong <[email protected]> * address reviewer comment Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add align function docstrings and make most args optional Signed-off-by: Elena Rastorgueva <[email protected]> * Remove redundant returns of viterbi and log probs matrices Signed-off-by: Elena Rastorgueva <[email protected]> * Rename h# to <initial_silence> Signed-off-by: Elena Rastorgueva <[email protected]> * Update manifest format description in README Signed-off-by: Elena Rastorgueva <[email protected]> * always remove any spaces from utt_id Signed-off-by: Elena Rastorgueva <[email protected]> * Patch the hanging of threads on very large stderr (#5589) (#5590) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * O2 style amp for gpt3 ptuning (#5246) * enable amp o2 plugin Signed-off-by: Jimmy Zhang <[email protected]> * only create master param if param requires gradient Signed-off-by: Jimmy Zhang <[email protected]> * remove pytorch autocast Signed-off-by: Jimmy Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Jimmy Zhang <[email protected]> * Update optimizer_with_main_params.py Signed-off-by: JimmyZhang12 <[email protected]> * create master grad only if param group requires grad Signed-off-by: Jimmy Zhang <[email protected]> * fix grad scaler for pp > 1 Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: JimmyZhang12 <[email protected]> Co-authored-by: Jimmy Zhang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Better patch hydra (#5591) (#5592) * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Yet another fix with hydra multirun (#5594) (#5595) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add RETRO model documentation (#5578) * added retro doc Signed-off-by: Yi Dong <[email protected]> * finish data part Signed-off-by: Yi Dong <[email protected]> * added the data format Signed-off-by: Yi Dong <[email protected]> * added training script Signed-off-by: Yi Dong <[email protected]> * added training and evaluation steps Signed-off-by: Yi Dong <[email protected]> * edit the text Signed-off-by: Yi Dong <[email protected]> * added the images Signed-off-by: Yi Dong <[email protected]> * fix beginning Signed-off-by: Yi Dong <[email protected]> * fix the grammar Signed-off-by: Yi Dong <[email protected]> * trim it down Signed-off-by: Yi Dong <[email protected]> * add wandb option Signed-off-by: Yi Dong <[email protected]> * add reference Signed-off-by: Yi Dong <[email protected]> * fix path Signed-off-by: Yi Dong <[email protected]> * added the parameters table Signed-off-by: Yi Dong <[email protected]> * fix section Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix: setup_multiple validation/test data (#5585) Fix: setup_multiple validation/test data (#5585) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move to optimizer based EMA implementation (#5169) * Move to optimizer Signed-off-by: SeanNaren <[email protected]> * Fix replacing weights Signed-off-by: SeanNaren <[email protected]> * Allow swapping of weights be optional Signed-off-by: SeanNaren <[email protected]> * Save 2 models Signed-off-by: SeanNaren <[email protected]> * Use different hook Signed-off-by: SeanNaren <[email protected]> * Expose cpu device Signed-off-by: SeanNaren <[email protected]> * Add clause to see if this fixes issue with O2 optimizer Signed-off-by: SeanNaren <[email protected]> * Try to get O2 working Signed-off-by: SeanNaren <[email protected]> * WIP Signed-off-by: SeanNaren <[email protected]> * Fixes Signed-off-by: SeanNaren <[email protected]> * Fixes to tests Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Remove import Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Add comment Signed-off-by: SeanNaren <[email protected]> * Remove overwrite Signed-off-by: SeanNaren <[email protected]> * Add BatchNorm, currently tests fail Signed-off-by: SeanNaren <[email protected]> * Fix tests/functionality for batch norm Signed-off-by: SeanNaren <[email protected]> * Get rid of NLP changes Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * AIStore for ASR datasets (#5462) AIStore for ASR datasets Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add support for MHA adapters to ASR (#5396) * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * Temporary fixes to new signature of mixin Signed-off-by: smajumdar <[email protected]> * Add adapter util for constants, add all mha adapters. Signed-off-by: smajumdar <[email protected]> * Update name of function Signed-off-by: smajumdar <[email protected]> * Roll back changes to convASR Signed-off-by: smajumdar <[email protected]> * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * First draft of Conformer support for MHA attention Signed-off-by: smajumdar <[email protected]> * Add some preliminary tests Signed-off-by: smajumdar <[email protected]> * Add support for projection of the hidden dimension for attention Signed-off-by: smajumdar <[email protected]> * Add support for squeezeformer Signed-off-by: smajumdar <[email protected]> * Update train adapter config Signed-off-by: smajumdar <[email protected]> * Add tests for squeezeformer and unit tests for new modules Signed-off-by: smajumdar <[email protected]> * Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path Signed-off-by: smajumdar <[email protected]> * Update location of adapters Signed-off-by: smajumdar <[email protected]> * Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v Signed-off-by: smajumdar <[email protected]> * Update expmanager to clean up checkpoints Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update training scripts Signed-off-by: smajumdar <[email protected]> * Update config and docs Signed-off-by: smajumdar <[email protected]> * Expose nemo delete function Signed-off-by: smajumdar <[email protected]> * Correct adapter partial state saving Signed-off-by: smajumdar <[email protected]> * Correct a bug with state management of adapter tokens Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Pull down EMA test Signed-off-by: smajumdar <[email protected]> * Correct name of adapter module utility class Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Co-authored-by: Jocelyn <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Create separator parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Call align function with hydra config Signed-off-by: Elena Rastorgueva <[email protected]> * update usage example Signed-off-by: Elena Rastorgueva <[email protected]> * Update Dockerfile (#5614) (#5616) Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future. Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate pretrained_name and model_path parameters Signed-off-by: Elena Rastorgueva <[email protected]> * make "optional" tags bold in markdown Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move non-main functions to utils dir Signed-off-by: Elena Rastorgueva <[email protected]> * Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615) Signed-off-by: Ante Jukić <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] fix ranges of char set for accented letters. (#5607) * [TTS] fix ranges of char set for accented letters. * remove digits pattern and added unit tests for math operators. Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Change success message to reduce confusion (#5621) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update documentation and tutorials for Adapters (#5610) * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update test Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Rename test file Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602) * [TTS] add type hints and change variable names for tokenizers and g2p Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * 1. Added missing import for gather_objects. (#5627) Signed-off-by: Micha Livne <[email protected]> Signed-off-by: Micha Livne <[email protected]> Co-authored-by: Micha Livne <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fixed RadTTS unit test (#5572) Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove tests (#5633) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Conformer local attention (#5525) * local attn and merge Signed-off-by: sam1373 <[email protected]> * optional Signed-off-by: sam1373 <[email protected]> * override Signed-off-by: sam1373 <[email protected]> * incorporate comments Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> * fix Signed-off-by: sam1373 <[email protected]> * comment Signed-off-by: sam1373 <[email protected]> * changes, test Signed-off-by: sam1373 <[email protected]> * changes Signed-off-by: sam1373 <[email protected]> * check att context Signed-off-by: sam1373 <[email protected]> * readme link Signed-off-by: sam1373 <[email protected]> * utils Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> Signed-off-by: sam1373 <[email protected]> Signed-off-by: Samuel Kriman <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add core classes and functions for online clustering diarizer part 1 (#5526) * Add core classes and functions for online clustering diarizer Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add audio to labels code Signed-off-by: Taejin Park <[email protected]> * resolve type errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added unit=tests for very short audio Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Filled all missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved conflict and added missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed unit-test errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix the wrongly added file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * Fix wrongly included file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * resolve code quality issue Signed-off-by: Taejin Park <[email protected]> * Fixed unit-test errors and bugs Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * changed total_sec for offline_clustering toy_data in unit-tests Signed-off-by: Taejin Park <[email protected]> * fixed merging index offset bug Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * only including part 1 files Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed unused function Signed-off-by: Taejin Park <[email protected]> * fixed unused imports Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * divided nmesc_clustering.py into two and reflected first-pass comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * adding offline/online_clustering.py Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix code QL autocomment Signed-off-by: Taejin Park <[email protected]> * Removed unused imports Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/online_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Reflected comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved code scanning issue Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/offline_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641) * add stt_eo_conformer_ctc_large model * stt_eo_conformer_transducer_large Co-authored-by: Andrei Andrusenko <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Removed unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Specify that filepaths need to be absolute Signed-off-by: Elena Rastorgueva <[email protected]> * replaces any spaces in utt_id with dashes Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make hydra script callable by another script Signed-off-by: Elena Rastorgueva <[email protected]> * do not specify default model or model_downsample_factor Signed-off-by: Elena Rastorgueva <[email protected]> * [Dockerfile] Remove AIS archive from docker image (#5629) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Measure audio_sr from audio instead of needing to specify Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541) * Chinese TTS replaces default pypinyin dict * Add jieba word segmenter as an option Signed-off-by: Yuekai Zhang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate parameters for device of transcription and viterbi steps Signed-off-by: Elena Rastorgueva <[email protected]> * Add mention of gecko Signed-off-by: Elena Rastorgueva <[email protected]> * [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [Add] ASR+VAD Inference Pipeline (#5575) Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets. Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: fayejf <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * rename separator to ctm_grouping_separator and refactor Signed-off-by: Elena Rastorgueva <[email protected]> * Bert interleaved (#5556) * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * Update bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Adding tests * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * Fix bug fix sequence parallel and Interleaved * Fix bug fix sequence parallel and Interleaved Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add duration padding support for RADTTS inference (#5650) * Added duration padding support for RADTTS inference * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Kevin Shih <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Add remove_blank_tokens_from_ctm parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Dont save initial_silence line in CTM Signed-off-by: Elena Rastorgueva <[email protected]> * Add DLLogger support to exp_manager (#5658) * Add DLLogger support to exp_manager Signed-off-by: Alexandre Milesi <[email protected]> * Move dllogger to separate file and check import Signed-off-by: Alexandre Milesi <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Remove unused import Signed-off-by: Alexandre Milesi <[email protected]> Signed-off-by: Alexandre Milesi <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add minimum_timestamp_duration parameter Signed-off-by: Elena Rastorgueva <[email protected]> * add suggestion about removing blanks to README Signed-off-by: Elena Rastorgueva <[email protected]> * reorder args Signed-off-by: Elena Rastorgueva <[email protected]> * clarify description of ctm_grouping_separator in README Signed-off-by: Elena Rastorgueva <[email protected]> * update docstring Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Port stateless timer to exp manager (#5584) * Port stateless timer to exp manager Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes and remove from all megatron code Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change message Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix EMA restart by allowing device to be set by the class init (#5668) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP (moved to separate repo) - merge to main (#5630) * Remove sdp files from tools folder Signed-off-by: Elena Rastorgueva <[email protected]> * Add page to docs with new SDP location Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add interface for making amax reduction optional for FP8 (#5447) * add TE interface for making amax reduction optional Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Kirthi Shankar Sivamani <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add tts dict cust notebook (#5662) * add tts dict cust notebook Signed-off-by: ekmb <[email protected]> * review Signed-off-by: ekmb <[email protected]> * fixed audio links Signed-off-by: ekmb <[email protected]> * remove old notebook Signed-off-by: ekmb <[email protected]> * fix typo Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [ASR] Audio processing base, multi-channel enhancement models (#5356) * Audio processing base model, enc-mask-dec enhancement, tests and modules Signed-off-by: Ante Jukić <[email protected]> * Addressed review comments Signed-off-by: Ante Jukić <[email protected]> * Fixed CodeQL warnings Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments: - renamed AudioProcessingModel to AudioToAudioModel - various small modifications - updated unit tests Signed-off-by: Ante Jukić <[email protected]> * Addressed comments - Moved spectrogram to audio_preprocessing - Renamed MultichannelFeatures - Updated config and unit tests Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Expose ClusteringDiarizer device (#5681) * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add Beam Search support to ASR transcribe() (#5443) * Add support for beam decoding via high level API. Signed-off-by: smajumdar <[email protected]> * Add ctc decoding section Signed-off-by: smajumdar <[email protected]> * Update ctc transcribe API to return results from beam search Signed-off-by: smajumdar <[email protected]> * Add argument to preserve arpa file Signed-off-by: smajumdar <[email protected]> * Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding Signed-off-by: smajumdar <[email protected]> * Update eval script and doc to use new API Signed-off-by: smajumdar <[email protected]> * Add tests for ctc greedy decoding Signed-off-by: smajumdar <[email protected]> * Address reviewer comments and add docstrings Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix changes and address comments Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: Samuel Kriman <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Propagate attention_dropout flag for GPT-3 (#5669) * Propagate attention_dropout flag for GPT-3 Signed-off-by: Mikołaj Błaż <[email protected]> * Add default to megatron_gpt_config Signed-off-by: Mikołaj Błaż <[email protected]> Signed-off-by: Mikołaj Błaż <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <complex451@gmail…

* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <[email protected]> * remove cmudict downloading Signed-off-by: Oktai Tatanov <[email protected]> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <[email protected]> * add libritts processing Signed-off-by: Oktai Tatanov <[email protected]> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <[email protected]> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <[email protected]> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <[email protected]> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <[email protected]> * fix optimizers Signed-off-by: Oktai Tatanov <[email protected]> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Jason <[email protected]> Signed-off-by: Jason <[email protected]> * make new commit Signed-off-by: Jason <[email protected]> * add copyright headers Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * rename README Signed-off-by: Oktai Tatanov <[email protected]> * fix style without vits_modules Signed-off-by: Oktai Tatanov <[email protected]> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <[email protected]> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <[email protected]> * update package info Signed-off-by: ericharper <[email protected]> * new exp * update branch Signed-off-by: ericharper <[email protected]> * Restored tests previously disabled for 22.03 base (NVIDIA#4109) Signed-off-by: Boris Fomitchev <[email protected]> * add augmentation to label models (NVIDIA#4113) * add augmentation to label models Signed-off-by: nithinraok <[email protected]> * duration fix Signed-off-by: nithinraok <[email protected]> * Call register_bert_model after assigning self.bert_model variable (NVIDIA#4116) Signed-off-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <[email protected]> * fixes for code review Signed-off-by: Alexandra Antonova <[email protected]> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> * update the default (NVIDIA#4135) Signed-off-by: ekmb <[email protected]> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <[email protected]> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Do not create tmp directory Signed-off-by: PeganovAnton <[email protected]> * Fix parameter name Signed-off-by: PeganovAnton <[email protected]> * finish cherry-pick op Signed-off-by: PeganovAnton <[email protected]> * Fix labels errors Signed-off-by: PeganovAnton <[email protected]> * Remove duplicate stage Signed-off-by: PeganovAnton <[email protected]> * Change target branch Signed-off-by: PeganovAnton <[email protected]> * fix typo (NVIDIA#4140) Signed-off-by: Yang Zhang <[email protected]> * Fix/punctuation avoid overwritting tmp files (NVIDIA#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Use built-in tempfile library Signed-off-by: PeganovAnton <[email protected]> * Fix code style Signed-off-by: PeganovAnton <[email protected]> * bug_fix_diarization_manifest_creation (NVIDIA#4125) Signed-off-by: Yang Zhang <[email protected]> Co-authored-by: Nithin Rao <[email protected]> * fix doc (NVIDIA#4146) Signed-off-by: Yang Zhang <[email protected]> * Tacotron2 retrain (NVIDIA#4103) * fix yaml Signed-off-by: treacker <[email protected]> * Fix for new TTSDataset class Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * inference fix Signed-off-by: treacker <[email protected]> * removed old code Signed-off-by: treacker <[email protected]> * updated parser logic Signed-off-by: treacker <[email protected]> * reverted version update Signed-off-by: treacker <[email protected]> * refactored parser logic Signed-off-by: treacker <[email protected]> * Updated Jenkinsfile Signed-off-by: treacker <[email protected]> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Update Jenkinsfile Signed-off-by: treacker <[email protected]> * Update tacotron.yaml Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: treacker <[email protected]> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Fixed jenkins Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Multiprocess improvements (NVIDIA#4127) * initial commit Signed-off-by: nithinraok <[email protected]> * start fix Signed-off-by: nithinraok <[email protected]> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <[email protected]> * updated scp to filelist Signed-off-by: nithinraok <[email protected]> * WaveGlow input type fixes (NVIDIA#4151) Signed-off-by: Jocelyn Huang <[email protected]> * notebooks' link, typo and import fix (NVIDIA#4158) * redo missing pr 4007 Signed-off-by: fayejf <[email protected]> * remove extremely unreliable links Signed-off-by: fayejf <[email protected]> * Thutmose tagger bug fixes (NVIDIA#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <[email protected]> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <[email protected]> * changed model location to nemo Signed-off-by: Alexandra Antonova <[email protected]> * style fix Signed-off-by: Alexandra Antonova <[email protected]> * fix space Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * update speaker docs (NVIDIA#4164) * update speaker docs Signed-off-by: nithinraok <[email protected]> * chunks -> segments Signed-off-by: nithinraok <[email protected]> * Khz -> kHz Signed-off-by: nithinraok <[email protected]> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <[email protected]> * Fix link to inference notebook (NVIDIA#5247) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Update ASR scores table (NVIDIA#5254) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Fix links to speaker identification notebook (NVIDIA#5260) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> * Minor typo fixes in TTS tutorial (NVIDIA#5266) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Pcla tutorial fixes (NVIDIA#5271) * Fixed typos Signed-off-by: Matvei Novikov <[email protected]> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <[email protected]> * Fixed typo Signed-off-by: Matvei Novikov <[email protected]> * Fixed branch variable Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix bug into Dialogue tutorial (NVIDIA#5277) * Typo fix (NVIDIA#5288) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix dialogue tutorial bug (NVIDIA#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA#5310) * typo fix Signed-off-by: fayejf <[email protected]> * udpate transcribe Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Add italian model checkpoints (NVIDIA#5316) Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Igor Gitman <[email protected]> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <[email protected]> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA#5313) * fixes Signed-off-by: Matvei Novikov <[email protected]> * fixes Signed-off-by: Matvei Novikov <[email protected]> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA#5359) * Add Gradio App to ASR Docs (NVIDIA#5270) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * tutorial fixes (NVIDIA#5354) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Add SDP documentation (NVIDIA#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <[email protected]> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <[email protected]> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <[email protected]> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <[email protected]> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <[email protected]> * Make config example less confusing Signed-off-by: Elena Rastorgueva <[email protected]> * Fix typo Signed-off-by: Elena Rastorgueva <[email protected]> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused imports Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <[email protected]> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Co-authored-by: Igor Gitman <[email protected]> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <[email protected]> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA#5379) * Fixed bug in notebook (NVIDIA#5382) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> * Force MHA QKV onto fp32 (NVIDIA#5391) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <[email protected]> * Added more saftey checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> * Megatron Export Update (NVIDIA#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * disable pc test (NVIDIA#5426) Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (NVIDIA#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA#5242) (NVIDIA#5446) Signed-off-by: Vahid <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> * Set sync_batch_comm in other places (NVIDIA#5448) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Radtts 1.13 (NVIDIA#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 plus (NVIDIA#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Add num layers check (NVIDIA#5470) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Change to kwargs (NVIDIA#5475) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) (NVIDIA#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * minor bug fix (NVIDIA#5521) Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> * added set_start_method + function param bugfix (NVIDIA#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (NVIDIA#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * Remove broadcast (NVIDIA#5558) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <[email protected]> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <[email protected]> * storing * updated version Signed-off-by: Evgeniy Shabalin <[email protected]> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed issues Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <[email protected]> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <[email protected]> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <[email protected]> * typecheck fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <[email protected]> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <[email protected]> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: nithinraok <[email protected]> Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Sasha Meister <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Evgeniy Shabalin <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Oktai Tatanov <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: bene-ges <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Jocelyn <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Sean Naren <[email protected]> Co-authored-by: Matvei Novikov <[email protected]> Co-authored-by: Zhilin Wang <[email protected]> Co-authored-by: Igor Gitman <[email protected]> Co-authored-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: Taejin Park <[email protected]> Co-authored-by: Adi Renduchintala <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Co-authored-by: Xuesong Yang <[email protected]>

* update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (NVIDIA#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (NVIDIA#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (NVIDIA#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) (NVIDIA#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (NVIDIA#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (NVIDIA#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]>

* Merge r1.13.0 main (#5570) * update branch Signed-off-by: ericharper <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * update branch Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> * revert Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Optimized loop and bugfix in SDE (#5573) - Fixed bug with loading custom data attributes from JSON in Speech Data Explorer Signed-off-by: George Zelenfroynd <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update torchmetrics (#5566) * add task arg Signed-off-by: nithinraok <[email protected]> * update state Signed-off-by: nithinraok <[email protected]> Signed-off-by: nithinraok <[email protected]> Co-authored-by: Taejin Park <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove useless files. (#5580) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add initial NFA code Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make use of the specified device during viterbi decoding Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL notes Signed-off-by: Elena Rastorgueva <[email protected]> * Fix CodeQL warning Signed-off-by: Elena Rastorgueva <[email protected]> * Add an option to defer data setup from ``__init__`` to ``setup`` (#5569) * Add an option to defer dataloader setup from __init__ to setup Signed-off-by: Ante Jukić <[email protected]> * Updated doc Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make utt_id specified by number of parts of audio_filepath user wishes to use Signed-off-by: Elena Rastorgueva <[email protected]> * remove audio_sr TODO - reduce risk of silent bugs Signed-off-by: Elena Rastorgueva <[email protected]> * Add check that model is CTC Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Text generation improvement (UI client, data parallel support) (#5437) * Squashed commit of the following: commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 15:07:42 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 35b424044fe80c3081e7756ab21244f701716f7e Author: Yi Dong <[email protected]> Date: Thu Oct 13 08:04:49 2022 -0700 get rid of base Signed-off-by: Yi Dong <[email protected]> commit 2955210e2311791543538cfbb5ad26b79414c954 Merge: d52edef8c eaf6757ca Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:17:02 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit d52edef8cd7b36593838fb270047e80f8ccb652e Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:16:24 2022 +0000 align with main Signed-off-by: Yi Dong <[email protected]> commit eaf6757ca5be8e099492f57c81d984429b0ad49c Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Oct 13 13:12:11 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:10:58 2022 +0000 same as main Signed-off-by: Yi Dong <[email protected]> commit e335de51bcc0d681c58b568c3d8c238bc5687c3b Merge: c231086e0 4463a9fe9 Author: Yi Dong <[email protected]> Date: Thu Oct 13 13:08:09 2022 +0000 Merge branch 'main' into universal_prompt commit c231086e057f1efaa915f691d84664cb3d5aad85 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 19:59:12 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:56:17 2022 +0000 default to pad Signed-off-by: Yi Dong <[email protected]> commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:55:44 2022 +0000 add the option to pad the tokens Signed-off-by: Yi Dong <[email protected]> commit 876dc395b43fdeeaa2bcbbe13c76523633764c33 Merge: fbb0f4035 fe3c77ee9 Author: Yi Dong <[email protected]> Date: Wed Oct 12 19:20:47 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392 Author: Yi Dong <[email protected]> Date: Wed Oct 12 18:59:49 2022 +0000 fix import again Signed-off-by: Yi Dong <[email protected]> commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Wed Oct 12 16:00:24 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937 Author: Yi Dong <[email protected]> Date: Wed Oct 12 15:58:32 2022 +0000 enable server Signed-off-by: Yi Dong <[email protected]> commit cbe05d9fbc978f812cfbb671f45f147f300713c4 Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:07:28 2022 +0000 fix comment error Signed-off-by: Yi Dong <[email protected]> commit 1948048922e726ec6131e44b1a745389f18d4ef2 Merge: 232c2cce3 984f5c09a Author: Yi Dong <[email protected]> Date: Wed Oct 12 13:05:30 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 232c2cce34d7a8b902da406706f3dd9b39475091 Merge: 34c8a68df 658243fb6 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:50:00 2022 +0000 Merge branch 'fix_global_init' into universal_prompt commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f Merge: 658243fb6 3fda5de46 Author: Yi Dong <[email protected]> Date: Wed Oct 12 08:42:11 2022 -0400 Merge branch 'main' into fix_global_init commit 658243fb6580191b5d60edd30cde16dcc23cbb85 Author: Yi Dong <[email protected]> Date: Wed Oct 12 12:40:57 2022 +0000 fix import error Signed-off-by: Yi Dong <[email protected]> commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c Author: Yi Dong <[email protected]> Date: Tue Oct 11 22:44:12 2022 +0000 update the fused kernel Signed-off-by: Yi Dong <[email protected]> commit 536cf6bef9447b75843fad630729c47a2fba35f3 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:44:52 2022 -0700 add the missing file Signed-off-by: Yi Dong <[email protected]> commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:43:14 2022 -0700 fix fused softmax Signed-off-by: Yi Dong <[email protected]> commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263 Author: Yi Dong <[email protected]> Date: Tue Oct 11 14:16:48 2022 -0700 move global step to base Signed-off-by: Yi Dong <[email protected]> commit 34c8a68df084b18d377e84415d9f07b2cd6673dd Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:50:11 2022 +0000 fix pipeline for eval Signed-off-by: Yi Dong <[email protected]> commit eee5d38218f26660c3ffebe9f615c850c80a1f0d Author: Yi Dong <[email protected]> Date: Thu Oct 6 13:48:22 2022 +0000 fix for pipleline parallel Signed-off-by: Yi Dong <[email protected]> commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d Merge: 125e49947 e3b4c4d1f Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:13 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 125e4994760448ff75dd9328395813eda1c87547 Author: Yi Dong <[email protected]> Date: Wed Oct 5 19:29:04 2022 +0000 add share option Signed-off-by: Yi Dong <[email protected]> commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368 Author: Yi Dong <[email protected]> Date: Wed Oct 5 11:43:48 2022 -0700 make sure consolidation works Signed-off-by: Yi Dong <[email protected]> commit a5c833964ecf05dc460ca1da69275c4019742150 Merge: 2a07ab52d abcb74be2 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:29 2022 +0000 Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt commit 2a07ab52d95f15ba666823028c69e23825666c05 Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:40:23 2022 +0000 added requirement Signed-off-by: Yi Dong <[email protected]> commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c Author: Yi Dong <[email protected]> Date: Wed Oct 5 18:39:42 2022 +0000 added a simple web server Signed-off-by: Yi Dong <[email protected]> commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b Author: Yi Dong <[email protected]> Date: Wed Oct 5 06:54:12 2022 -0700 fix empty val loss Signed-off-by: Yi Dong <[email protected]> commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26 Author: Yi Dong <[email protected]> Date: Tue Oct 4 19:25:30 2022 -0700 text gen working Signed-off-by: Yi Dong <[email protected]> commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba Author: Yi Dong <[email protected]> Date: Tue Oct 4 16:08:40 2022 -0700 first change Signed-off-by: Yi Dong <[email protected]> commit 59d077585e6962a669b824af58f64e8a0bea6547 Author: Yi Dong <[email protected]> Date: Tue Oct 4 15:00:40 2022 -0700 revert Signed-off-by: Yi Dong <[email protected]> commit 12a0f3902d99e9179403644bd951c045df716ca7 Author: Yi Dong <[email protected]> Date: Tue Oct 4 21:26:23 2022 +0000 init imp Signed-off-by: Yi Dong <[email protected]> commit 62a15dfd943cc48be495ac61b9f2f00995775c5f Merge: 82c90d2cd e0cc6b767 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:58:26 2022 -0700 Merge branch 'main' into universal_prompt commit 82c90d2cd0fd156f16a4b899f8c741d598f33990 Author: Yi Dong <[email protected]> Date: Tue Oct 4 11:17:13 2022 -0700 add sync Signed-off-by: Yi Dong <[email protected]> commit 9819b703eef877d90cd1257bf3610c69de9b4d7e Author: Yi Dong <[email protected]> Date: Sun Oct 2 17:52:34 2022 -0700 fix save model Signed-off-by: root <[email protected]> commit e4937e2fc5fb7d70754c97668416e4a69c3079fe Author: Yi Dong <[email protected]> Date: Sat Oct 1 18:56:09 2022 +0000 working Signed-off-by: Yi Dong <[email protected]> commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b Author: Yi Dong <[email protected]> Date: Sat Oct 1 17:34:03 2022 +0000 calcuate the mask Signed-off-by: Yi Dong <[email protected]> commit 9db3bc13eb65a94a475b837603351da68e3745bc Author: Yi Dong <[email protected]> Date: Fri Sep 30 23:26:32 2022 +0000 fix bug in datasets Signed-off-by: Yi Dong <[email protected]> commit f289900375d4412f53f8110be00fec6587627550 Author: Yi Dong <[email protected]> Date: Fri Sep 30 22:29:40 2022 +0000 update the code Signed-off-by: Yi Dong <[email protected]> commit 8e28a1f208aabaab72dbe769e72756baada04d99 Author: Yi Dong <[email protected]> Date: Fri Sep 30 21:52:52 2022 +0000 added new ds Signed-off-by: Yi Dong <[email protected]> commit 8d41315bab7ce90e200a8a7d1023c34f8e046897 Author: Yi Dong <[email protected]> Date: Fri Sep 30 18:57:09 2022 +0000 added new files Signed-off-by: Yi Dong <[email protected]> commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f Merge: cbe8b7ab1 fa6cd8588 Author: Yi Dong <[email protected]> Date: Thu Sep 29 21:43:29 2022 +0000 Merge branch 'llm-prompt-learning-improvements' into universal_prompt commit fa6cd858839277939446afe7275976078d54c512 Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Date: Thu Sep 29 16:47:30 2022 +0000 [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0 Merge: 7d6d46742 8d670bc77 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 09:43:27 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 7d6d46742170a66758287a207d67e1b1bfd15613 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 16:42:43 2022 +0000 Removed inference step and added sentence peice check to predict step Signed-off-by: Virginia Adams <[email protected]> commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2 Author: Virginia Adams <[email protected]> Date: Thu Sep 29 15:26:32 2022 +0000 fixed first stage check for pipeline parallel T5 pt Signed-off-by: Virginia Adams <[email protected]> commit 3637be2b258c8d9028856f9971edb7da4a8121f0 Merge: a3ea722fd 986a76612 Author: Virginia Adams <[email protected]> Date: Wed Sep 28 10:23:30 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4 Merge: 770967a52 971485ce7 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 13:35:52 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 770967a5251a474b6dcc2d44bf9a2076adbcb604 Merge: d23bf6c30 e3ac280a8 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:17:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit d23bf6c30acc0e3f6af9b4e24547669866a34d62 Merge: de6a31651 333d2b749 Author: Virginia Adams <[email protected]> Date: Mon Sep 26 10:05:16 2022 -0700 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit de6a31651e63d88a42b971794d93f18ff5a3cdff Author: Virginia Adams <[email protected]> Date: Mon Sep 26 17:00:53 2022 +0000 Updated PP check to be on first stage pipeline only Signed-off-by: Virginia Adams <[email protected]> commit 333d2b7498e6742ce66436f733c980a74616900c Merge: 592c0986a a39fc925a Author: Virginia Adams <[email protected]> Date: Fri Sep 23 16:11:21 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 592c0986a476a91b57b8605d7b70830d7acfa021 Author: Virginia Adams <[email protected]> Date: Fri Sep 23 23:08:41 2022 +0000 Fixed unused import and CI test bug Signed-off-by: Virginia Adams <[email protected]> commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455 Merge: ce4b72c8c b566c2d0e Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:25 2022 +0000 Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements commit ce4b72c8c52f32be336e323dd78a38089edc3e7c Author: Virginia Adams <[email protected]> Date: Fri Sep 23 18:57:16 2022 +0000 Switch to import from base class Signed-off-by: Virginia Adams <[email protected]> commit b566c2d0e35a068f758fd1310bc620a47be4590b Merge: 6621f2854 e872061ac Author: Virginia Adams <[email protected]> Date: Fri Sep 23 10:09:03 2022 -0700 Merge branch 'main' into llm-prompt-learning-improvements commit 6621f28543828a48484a5637f6c9f3ccb23a5b02 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:47:35 2022 +0000 python format fix Signed-off-by: Virginia Adams <[email protected]> commit 8deafc8987b6af5f7b99a250310f57a40198c37f Author: Virginia Adams <[email protected]> Date: Wed Sep 14 20:28:02 2022 +0000 Save .nemo on new best val score Signed-off-by: Virginia Adams <[email protected]> commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41 Author: Virginia Adams <[email protected]> Date: Wed Sep 14 18:03:19 2022 +0000 Added automatic checkpoint to nemo file method Signed-off-by: Virginia Adams <[email protected]> commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b Author: Virginia Adams <[email protected]> Date: Wed Sep 14 02:11:56 2022 +0000 Make GPT use base prompt learning model class: Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yi Dong <[email protected]> * fix LGTM Signed-off-by: Yi Dong <[email protected]> * fix validation Signed-off-by: Yi Dong <[email protected]> * change for the lm eval Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * make text generation work in data parallel environment Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * implement the service with rest service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * surpress log Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * Update config Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Restore function needed for NMT Signed-off-by: MaximumEntropy <[email protected]> * handles no answer only Signed-off-by: Yi Dong <[email protected]> * Fix config Signed-off-by: MaximumEntropy <[email protected]> * added knn to web Signed-off-by: Yi Dong <[email protected]> * fix lgtm.com comments Signed-off-by: Yi Dong <[email protected]> * output the retrieved context Signed-off-by: Yi Dong <[email protected]> * allow no neighbor query Signed-off-by: Yi Dong <[email protected]> * remove the imports Signed-off-by: Yi Dong <[email protected]> * warn only once Signed-off-by: Yi Dong <[email protected]> * Change output file format from JSON to JSONL Signed-off-by: MaximumEntropy <[email protected]> * new t0 dataset Signed-off-by: Yi Dong <[email protected]> * Add T0 data preproc scripts Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Merge and multiprocessing Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix for is_correct Signed-off-by: MaximumEntropy <[email protected]> * fix epoch > 2 Signed-off-by: Yi Dong <[email protected]> * handles multiple dataloader Signed-off-by: Yi Dong <[email protected]> * remove template Signed-off-by: Yi Dong <[email protected]> * Refactor T0 dataset Signed-off-by: MaximumEntropy <[email protected]> * Add script to merge train folder into individual training files to minimize number of blends Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added on the fly service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add combo instance Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added combo service Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * send weights back to server Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix index store Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Minor changes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add reset button Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add add eos Signed-off-by: Yi Dong <[email protected]> * use a seperate bert service Signed-off-by: Yi Dong <[email protected]> * no loss of accuracy Signed-off-by: Yi Dong <[email protected]> * pin the gradio version Signed-off-by: Yi Dong <[email protected]> * Remove bin compat Signed-off-by: MaximumEntropy <[email protected]> * Fix header lines Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * evaluate based on text generation Signed-off-by: Yi Dong <[email protected]> * exact match result aggregation Signed-off-by: Yi Dong <[email protected]> * working SP and SA Signed-off-by: Yi Dong <[email protected]> * sync Signed-off-by: Yi Dong <[email protected]> * fix checkpoint Signed-off-by: Yi Dong <[email protected]> * fix eval Signed-off-by: Yi Dong <[email protected]> * backup states Signed-off-by: Yi Dong <[email protected]> * backup states reset Signed-off-by: Yi Dong <[email protected]> * fix the bug Signed-off-by: Yi Dong <[email protected]> * fix evaluation for sentence piece Signed-off-by: Yi Dong <[email protected]> * fix a bug Signed-off-by: Yi Dong <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * potential fix in the future Signed-off-by: Yi Dong <[email protected]> * remove the universal codes Signed-off-by: Yi Dong <[email protected]> * remove universal strategy Signed-off-by: Yi Dong <[email protected]> * address reviewer comment Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add align function docstrings and make most args optional Signed-off-by: Elena Rastorgueva <[email protected]> * Remove redundant returns of viterbi and log probs matrices Signed-off-by: Elena Rastorgueva <[email protected]> * Rename h# to <initial_silence> Signed-off-by: Elena Rastorgueva <[email protected]> * Update manifest format description in README Signed-off-by: Elena Rastorgueva <[email protected]> * always remove any spaces from utt_id Signed-off-by: Elena Rastorgueva <[email protected]> * Patch the hanging of threads on very large stderr (#5589) (#5590) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * O2 style amp for gpt3 ptuning (#5246) * enable amp o2 plugin Signed-off-by: Jimmy Zhang <[email protected]> * only create master param if param requires gradient Signed-off-by: Jimmy Zhang <[email protected]> * remove pytorch autocast Signed-off-by: Jimmy Zhang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Jimmy Zhang <[email protected]> * Update optimizer_with_main_params.py Signed-off-by: JimmyZhang12 <[email protected]> * create master grad only if param group requires grad Signed-off-by: Jimmy Zhang <[email protected]> * fix grad scaler for pp > 1 Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: Jimmy Zhang <[email protected]> Signed-off-by: JimmyZhang12 <[email protected]> Co-authored-by: Jimmy Zhang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Better patch hydra (#5591) (#5592) * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * Readd buffereing and thread drain to Hydra Launcher Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Yet another fix with hydra multirun (#5594) (#5595) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add RETRO model documentation (#5578) * added retro doc Signed-off-by: Yi Dong <[email protected]> * finish data part Signed-off-by: Yi Dong <[email protected]> * added the data format Signed-off-by: Yi Dong <[email protected]> * added training script Signed-off-by: Yi Dong <[email protected]> * added training and evaluation steps Signed-off-by: Yi Dong <[email protected]> * edit the text Signed-off-by: Yi Dong <[email protected]> * added the images Signed-off-by: Yi Dong <[email protected]> * fix beginning Signed-off-by: Yi Dong <[email protected]> * fix the grammar Signed-off-by: Yi Dong <[email protected]> * trim it down Signed-off-by: Yi Dong <[email protected]> * add wandb option Signed-off-by: Yi Dong <[email protected]> * add reference Signed-off-by: Yi Dong <[email protected]> * fix path Signed-off-by: Yi Dong <[email protected]> * added the parameters table Signed-off-by: Yi Dong <[email protected]> * fix section Signed-off-by: Yi Dong <[email protected]> Signed-off-by: Yi Dong <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix: setup_multiple validation/test data (#5585) Fix: setup_multiple validation/test data (#5585) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move to optimizer based EMA implementation (#5169) * Move to optimizer Signed-off-by: SeanNaren <[email protected]> * Fix replacing weights Signed-off-by: SeanNaren <[email protected]> * Allow swapping of weights be optional Signed-off-by: SeanNaren <[email protected]> * Save 2 models Signed-off-by: SeanNaren <[email protected]> * Use different hook Signed-off-by: SeanNaren <[email protected]> * Expose cpu device Signed-off-by: SeanNaren <[email protected]> * Add clause to see if this fixes issue with O2 optimizer Signed-off-by: SeanNaren <[email protected]> * Try to get O2 working Signed-off-by: SeanNaren <[email protected]> * WIP Signed-off-by: SeanNaren <[email protected]> * Fixes Signed-off-by: SeanNaren <[email protected]> * Fixes to tests Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Remove import Signed-off-by: SeanNaren <[email protected]> * Add guard Signed-off-by: SeanNaren <[email protected]> * Add comment Signed-off-by: SeanNaren <[email protected]> * Remove overwrite Signed-off-by: SeanNaren <[email protected]> * Add BatchNorm, currently tests fail Signed-off-by: SeanNaren <[email protected]> * Fix tests/functionality for batch norm Signed-off-by: SeanNaren <[email protected]> * Get rid of NLP changes Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * AIStore for ASR datasets (#5462) AIStore for ASR datasets Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add support for MHA adapters to ASR (#5396) * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * Temporary fixes to new signature of mixin Signed-off-by: smajumdar <[email protected]> * Add adapter util for constants, add all mha adapters. Signed-off-by: smajumdar <[email protected]> * Update name of function Signed-off-by: smajumdar <[email protected]> * Roll back changes to convASR Signed-off-by: smajumdar <[email protected]> * Convert AbstractAdapterModule to AbstractAdapterMixin Signed-off-by: smajumdar <[email protected]> * First draft of Conformer support for MHA attention Signed-off-by: smajumdar <[email protected]> * Add some preliminary tests Signed-off-by: smajumdar <[email protected]> * Add support for projection of the hidden dimension for attention Signed-off-by: smajumdar <[email protected]> * Add support for squeezeformer Signed-off-by: smajumdar <[email protected]> * Update train adapter config Signed-off-by: smajumdar <[email protected]> * Add tests for squeezeformer and unit tests for new modules Signed-off-by: smajumdar <[email protected]> * Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path Signed-off-by: smajumdar <[email protected]> * Update location of adapters Signed-off-by: smajumdar <[email protected]> * Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v Signed-off-by: smajumdar <[email protected]> * Update expmanager to clean up checkpoints Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * Add docstrings and update tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update training scripts Signed-off-by: smajumdar <[email protected]> * Update config and docs Signed-off-by: smajumdar <[email protected]> * Expose nemo delete function Signed-off-by: smajumdar <[email protected]> * Correct adapter partial state saving Signed-off-by: smajumdar <[email protected]> * Correct a bug with state management of adapter tokens Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Pull down EMA test Signed-off-by: smajumdar <[email protected]> * Correct name of adapter module utility class Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Co-authored-by: Jocelyn <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Create separator parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Call align function with hydra config Signed-off-by: Elena Rastorgueva <[email protected]> * update usage example Signed-off-by: Elena Rastorgueva <[email protected]> * Update Dockerfile (#5614) (#5616) Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future. Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Signed-off-by: He Huang (Steve) <[email protected]> Co-authored-by: He Huang (Steve) <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate pretrained_name and model_path parameters Signed-off-by: Elena Rastorgueva <[email protected]> * make "optional" tags bold in markdown Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Move non-main functions to utils dir Signed-off-by: Elena Rastorgueva <[email protected]> * Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615) Signed-off-by: Ante Jukić <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] fix ranges of char set for accented letters. (#5607) * [TTS] fix ranges of char set for accented letters. * remove digits pattern and added unit tests for math operators. Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Change success message to reduce confusion (#5621) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Update documentation and tutorials for Adapters (#5610) * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * Improve docs for adapter and tests Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update test Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Rename test file Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602) * [TTS] add type hints and change variable names for tokenizers and g2p Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * 1. Added missing import for gather_objects. (#5627) Signed-off-by: Micha Livne <[email protected]> Signed-off-by: Micha Livne <[email protected]> Co-authored-by: Micha Livne <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Fixed RadTTS unit test (#5572) Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * remove tests (#5633) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Conformer local attention (#5525) * local attn and merge Signed-off-by: sam1373 <[email protected]> * optional Signed-off-by: sam1373 <[email protected]> * override Signed-off-by: sam1373 <[email protected]> * incorporate comments Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> * fix Signed-off-by: sam1373 <[email protected]> * comment Signed-off-by: sam1373 <[email protected]> * changes, test Signed-off-by: sam1373 <[email protected]> * changes Signed-off-by: sam1373 <[email protected]> * check att context Signed-off-by: sam1373 <[email protected]> * readme link Signed-off-by: sam1373 <[email protected]> * utils Signed-off-by: sam1373 <[email protected]> * update Signed-off-by: sam1373 <[email protected]> Signed-off-by: sam1373 <[email protected]> Signed-off-by: Samuel Kriman <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add core classes and functions for online clustering diarizer part 1 (#5526) * Add core classes and functions for online clustering diarizer Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add audio to labels code Signed-off-by: Taejin Park <[email protected]> * resolve type errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added unit=tests for very short audio Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Filled all missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved conflict and added missing docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed unit-test errors Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix the wrongly added file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * Fix wrongly included file - megatron_gpt_model.py Signed-off-by: Taejin Park <[email protected]> * resolve code quality issue Signed-off-by: Taejin Park <[email protected]> * Fixed unit-test errors and bugs Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * changed total_sec for offline_clustering toy_data in unit-tests Signed-off-by: Taejin Park <[email protected]> * fixed merging index offset bug Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * only including part 1 files Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed unused function Signed-off-by: Taejin Park <[email protected]> * fixed unused imports Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * divided nmesc_clustering.py into two and reflected first-pass comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * adding offline/online_clustering.py Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix code QL autocomment Signed-off-by: Taejin Park <[email protected]> * Removed unused imports Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/online_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Reflected comments Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolved code scanning issue Signed-off-by: Taejin Park <[email protected]> * Update nemo/collections/asr/parts/utils/offline_clustering.py Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Sean Naren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641) * add stt_eo_conformer_ctc_large model * stt_eo_conformer_transducer_large Co-authored-by: Andrei Andrusenko <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Removed unused import Signed-off-by: Elena Rastorgueva <[email protected]> * Specify that filepaths need to be absolute Signed-off-by: Elena Rastorgueva <[email protected]> * replaces any spaces in utt_id with dashes Signed-off-by: Elena Rastorgueva <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Elena Rastorgueva <[email protected]> * Make hydra script callable by another script Signed-off-by: Elena Rastorgueva <[email protected]> * do not specify default model or model_downsample_factor Signed-off-by: Elena Rastorgueva <[email protected]> * [Dockerfile] Remove AIS archive from docker image (#5629) Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Measure audio_sr from audio instead of needing to specify Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541) * Chinese TTS replaces default pypinyin dict * Add jieba word segmenter as an option Signed-off-by: Yuekai Zhang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Make separate parameters for device of transcription and viterbi steps Signed-off-by: Elena Rastorgueva <[email protected]> * Add mention of gecko Signed-off-by: Elena Rastorgueva <[email protected]> * [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [Add] ASR+VAD Inference Pipeline (#5575) Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets. Signed-off-by: stevehuang52 <[email protected]> Co-authored-by: fayejf <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * rename separator to ctm_grouping_separator and refactor Signed-off-by: Elena Rastorgueva <[email protected]> * Bert interleaved (#5556) * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding SP and SAR support Bert * Adding SP and SAR support Bert * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Adding Sequence parallel support to Bert * Update bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Adding tests * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * Adding interleaved pipeline parallelism * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Addressing Eric's comments * Fix bug fix sequence parallel and Interleaved * Fix bug fix sequence parallel and Interleaved Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add duration padding support for RADTTS inference (#5650) * Added duration padding support for RADTTS inference * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Kevin Shih <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Add remove_blank_tokens_from_ctm parameter Signed-off-by: Elena Rastorgueva <[email protected]> * Dont save initial_silence line in CTM Signed-off-by: Elena Rastorgueva <[email protected]> * Add DLLogger support to exp_manager (#5658) * Add DLLogger support to exp_manager Signed-off-by: Alexandre Milesi <[email protected]> * Move dllogger to separate file and check import Signed-off-by: Alexandre Milesi <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Remove unused import Signed-off-by: Alexandre Milesi <[email protected]> Signed-off-by: Alexandre Milesi <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * add minimum_timestamp_duration parameter Signed-off-by: Elena Rastorgueva <[email protected]> * add suggestion about removing blanks to README Signed-off-by: Elena Rastorgueva <[email protected]> * reorder args Signed-off-by: Elena Rastorgueva <[email protected]> * clarify description of ctm_grouping_separator in README Signed-off-by: Elena Rastorgueva <[email protected]> * update docstring Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664) Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Port stateless timer to exp manager (#5584) * Port stateless timer to exp manager Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes and remove from all megatron code Signed-off-by: MaximumEntropy <[email protected]> * Fixes Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Change message Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Fix EMA restart by allowing device to be set by the class init (#5668) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP (moved to separate repo) - merge to main (#5630) * Remove sdp files from tools folder Signed-off-by: Elena Rastorgueva <[email protected]> * Add page to docs with new SDP location Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add interface for making amax reduction optional for FP8 (#5447) * add TE interface for making amax reduction optional Signed-off-by: Kirthi Shankar Sivamani <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Kirthi Shankar Sivamani <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [TTS] add tts dict cust notebook (#5662) * add tts dict cust notebook Signed-off-by: ekmb <[email protected]> * review Signed-off-by: ekmb <[email protected]> * fixed audio links Signed-off-by: ekmb <[email protected]> * remove old notebook Signed-off-by: ekmb <[email protected]> * fix typo Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * [ASR] Audio processing base, multi-channel enhancement models (#5356) * Audio processing base model, enc-mask-dec enhancement, tests and modules Signed-off-by: Ante Jukić <[email protected]> * Addressed review comments Signed-off-by: Ante Jukić <[email protected]> * Fixed CodeQL warnings Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments Signed-off-by: Ante Jukić <[email protected]> * Addressed PR comments: - renamed AudioProcessingModel to AudioToAudioModel - various small modifications - updated unit tests Signed-off-by: Ante Jukić <[email protected]> * Addressed comments - Moved spectrogram to audio_preprocessing - Renamed MultichannelFeatures - Updated config and unit tests Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Ante Jukić <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Expose ClusteringDiarizer device (#5681) * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> * Expose device for users to set Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Add Beam Search support to ASR transcribe() (#5443) * Add support for beam decoding via high level API. Signed-off-by: smajumdar <[email protected]> * Add ctc decoding section Signed-off-by: smajumdar <[email protected]> * Update ctc transcribe API to return results from beam search Signed-off-by: smajumdar <[email protected]> * Add argument to preserve arpa file Signed-off-by: smajumdar <[email protected]> * Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding Signed-off-by: smajumdar <[email protected]> * Update eval script and doc to use new API Signed-off-by: smajumdar <[email protected]> * Add tests for ctc greedy decoding Signed-off-by: smajumdar <[email protected]> * Address reviewer comments and add docstrings Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix changes and address comments Signed-off-by: smajumdar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: smajumdar <[email protected]> Co-authored-by: Samuel Kriman <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <[email protected]> * Propagate attention_dropout flag for GPT-3 (#5669) * Propagate attention_dropout flag for GPT-3 Signed-off-by: Mikołaj Błaż <[email protected]> * Add default to megatron_gpt_config Signed-off-by: Mikołaj Błaż <[email protected]> Signed-off-by: Mikołaj Błaż <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Eric Harper <complex451@gmail…

* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <[email protected]> * remove cmudict downloading Signed-off-by: Oktai Tatanov <[email protected]> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <[email protected]> * add libritts processing Signed-off-by: Oktai Tatanov <[email protected]> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <[email protected]> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <[email protected]> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <[email protected]> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <[email protected]> * fix optimizers Signed-off-by: Oktai Tatanov <[email protected]> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Jason <[email protected]> Signed-off-by: Jason <[email protected]> * make new commit Signed-off-by: Jason <[email protected]> * add copyright headers Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * rename README Signed-off-by: Oktai Tatanov <[email protected]> * fix style without vits_modules Signed-off-by: Oktai Tatanov <[email protected]> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <[email protected]> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <[email protected]> * update package info Signed-off-by: ericharper <[email protected]> * new exp * update branch Signed-off-by: ericharper <[email protected]> * Restored tests previously disabled for 22.03 base (NVIDIA#4109) Signed-off-by: Boris Fomitchev <[email protected]> * add augmentation to label models (NVIDIA#4113) * add augmentation to label models Signed-off-by: nithinraok <[email protected]> * duration fix Signed-off-by: nithinraok <[email protected]> * Call register_bert_model after assigning self.bert_model variable (NVIDIA#4116) Signed-off-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <[email protected]> * fixes for code review Signed-off-by: Alexandra Antonova <[email protected]> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> * update the default (NVIDIA#4135) Signed-off-by: ekmb <[email protected]> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <[email protected]> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Fix typo Signed-off-by: PeganovAnton <[email protected]> * Do not create tmp directory Signed-off-by: PeganovAnton <[email protected]> * Fix parameter name Signed-off-by: PeganovAnton <[email protected]> * finish cherry-pick op Signed-off-by: PeganovAnton <[email protected]> * Fix labels errors Signed-off-by: PeganovAnton <[email protected]> * Remove duplicate stage Signed-off-by: PeganovAnton <[email protected]> * Change target branch Signed-off-by: PeganovAnton <[email protected]> * fix typo (NVIDIA#4140) Signed-off-by: Yang Zhang <[email protected]> * Fix/punctuation avoid overwritting tmp files (NVIDIA#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Remove accidental changes Signed-off-by: PeganovAnton <[email protected]> * Use built-in tempfile library Signed-off-by: PeganovAnton <[email protected]> * Fix code style Signed-off-by: PeganovAnton <[email protected]> * bug_fix_diarization_manifest_creation (NVIDIA#4125) Signed-off-by: Yang Zhang <[email protected]> Co-authored-by: Nithin Rao <[email protected]> * fix doc (NVIDIA#4146) Signed-off-by: Yang Zhang <[email protected]> * Tacotron2 retrain (NVIDIA#4103) * fix yaml Signed-off-by: treacker <[email protected]> * Fix for new TTSDataset class Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * added wandb logging Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * fix numpy version Signed-off-by: treacker <[email protected]> * inference fix Signed-off-by: treacker <[email protected]> * removed old code Signed-off-by: treacker <[email protected]> * updated parser logic Signed-off-by: treacker <[email protected]> * reverted version update Signed-off-by: treacker <[email protected]> * refactored parser logic Signed-off-by: treacker <[email protected]> * Updated Jenkinsfile Signed-off-by: treacker <[email protected]> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Made backward compatibility Signed-off-by: treacker <[email protected]> * Update Jenkinsfile Signed-off-by: treacker <[email protected]> * Update tacotron.yaml Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * cleaned up TN/ ITN doc (NVIDIA#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <[email protected]> * fix typo Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> * fix image Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: treacker <[email protected]> * Check implicit grad acc in GLUE dataset building (NVIDIA#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <[email protected]> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Fixed jenkins Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> * Refactoring Signed-off-by: treacker <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Multiprocess improvements (NVIDIA#4127) * initial commit Signed-off-by: nithinraok <[email protected]> * start fix Signed-off-by: nithinraok <[email protected]> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <[email protected]> * updated scp to filelist Signed-off-by: nithinraok <[email protected]> * WaveGlow input type fixes (NVIDIA#4151) Signed-off-by: Jocelyn Huang <[email protected]> * notebooks' link, typo and import fix (NVIDIA#4158) * redo missing pr 4007 Signed-off-by: fayejf <[email protected]> * remove extremely unreliable links Signed-off-by: fayejf <[email protected]> * Thutmose tagger bug fixes (NVIDIA#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * fix model location Signed-off-by: Alexandra Antonova <[email protected]> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <[email protected]> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <[email protected]> * changed model location to nemo Signed-off-by: Alexandra Antonova <[email protected]> * style fix Signed-off-by: Alexandra Antonova <[email protected]> * fix space Signed-off-by: Alexandra Antonova <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> * update speaker docs (NVIDIA#4164) * update speaker docs Signed-off-by: nithinraok <[email protected]> * chunks -> segments Signed-off-by: nithinraok <[email protected]> * Khz -> kHz Signed-off-by: nithinraok <[email protected]> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <[email protected]> * Fix link to inference notebook (NVIDIA#5247) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Update ASR scores table (NVIDIA#5254) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Fix links to speaker identification notebook (NVIDIA#5260) Signed-off-by: SeanNaren <[email protected]> Signed-off-by: SeanNaren <[email protected]> * Minor typo fixes in TTS tutorial (NVIDIA#5266) Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> * Pcla tutorial fixes (NVIDIA#5271) * Fixed typos Signed-off-by: Matvei Novikov <[email protected]> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <[email protected]> * Fixed typo Signed-off-by: Matvei Novikov <[email protected]> * Fixed branch variable Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix bug into Dialogue tutorial (NVIDIA#5277) * Typo fix (NVIDIA#5288) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Fix dialogue tutorial bug (NVIDIA#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA#5310) * typo fix Signed-off-by: fayejf <[email protected]> * udpate transcribe Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Add italian model checkpoints (NVIDIA#5316) Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Igor Gitman <[email protected]> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <[email protected]> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA#5313) * fixes Signed-off-by: Matvei Novikov <[email protected]> * fixes Signed-off-by: Matvei Novikov <[email protected]> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA#5359) * Add Gradio App to ASR Docs (NVIDIA#5270) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * tutorial fixes (NVIDIA#5354) Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> * Add SDP documentation (NVIDIA#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <[email protected]> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <[email protected]> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <[email protected]> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <[email protected]> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <[email protected]> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <[email protected]> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <[email protected]> * Make config example less confusing Signed-off-by: Elena Rastorgueva <[email protected]> * Fix typo Signed-off-by: Elena Rastorgueva <[email protected]> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <[email protected]> * Remove unused imports Signed-off-by: Elena Rastorgueva <[email protected]> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <[email protected]> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Co-authored-by: Igor Gitman <[email protected]> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <[email protected]> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA#5378) Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA#5379) * Fixed bug in notebook (NVIDIA#5382) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> * Force MHA QKV onto fp32 (NVIDIA#5391) Signed-off-by: smajumdar <[email protected]> Signed-off-by: smajumdar <[email protected]> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <[email protected]> * Added more saftey checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> * Megatron Export Update (NVIDIA#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <[email protected]> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <[email protected]> * added comment Signed-off-by: David Mosallanezhad <[email protected]> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <[email protected]> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <[email protected]> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <[email protected]> * updated del onnx_model place Signed-off-by: David Mosallanezhad <[email protected]> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <[email protected]> * disable pc test (NVIDIA#5426) Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Disable sync_batch_comm in validation_step for GPT (NVIDIA#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <[email protected]> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <[email protected]> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <[email protected]> * Empty Signed-off-by: MaximumEntropy <[email protected]> * Comment out test Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA#5242) (NVIDIA#5446) Signed-off-by: Vahid <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> * Set sync_batch_comm in other places (NVIDIA#5448) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Radtts 1.13 (NVIDIA#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Radtts 1.13 plus (NVIDIA#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Add num layers check (NVIDIA#5470) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Change to kwargs (NVIDIA#5475) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) (NVIDIA#5478) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA#5480) * updated export_utils Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <[email protected]> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> * minor bug fix (NVIDIA#5521) Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> * added set_start_method + function param bugfix (NVIDIA#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: ericharper <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <[email protected]> * remove notebook (NVIDIA#5548) Signed-off-by: ericharper <[email protected]> Signed-off-by: ericharper <[email protected]> * Remove broadcast (NVIDIA#5558) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> * update readme Signed-off-by: ericharper <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <[email protected]> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <[email protected]> * storing * updated version Signed-off-by: Evgeniy Shabalin <[email protected]> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed issues Signed-off-by: Evgeniy Shabalin <[email protected]> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <[email protected]> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * remove old cython code Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * Enhancements Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <[email protected]> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <[email protected]> * typecheck fix Signed-off-by: Evgeniy Shabalin <[email protected]> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <[email protected]> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <[email protected]> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <[email protected]> Signed-off-by: Jason <[email protected]> Signed-off-by: ericharper <[email protected]> Signed-off-by: Boris Fomitchev <[email protected]> Signed-off-by: nithinraok <[email protected]> Signed-off-by: Yang Zhang <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: smajumdar <[email protected]> Signed-off-by: SeanNaren <[email protected]> Signed-off-by: Matvei Novikov <[email protected]> Signed-off-by: Igor Gitman <[email protected]> Signed-off-by: Sasha Meister <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Elena Rastorgueva <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: David Mosallanezhad <[email protected]> Signed-off-by: Markel Sanz Ausin <[email protected]> Signed-off-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Vahid <[email protected]> Signed-off-by: Vladimir Bataev <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: Evgeniy Shabalin <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: [email protected] <[email protected]> Co-authored-by: Oktai Tatanov <[email protected]> Co-authored-by: jasonjjl1999 <[email protected]> Co-authored-by: martynwei <[email protected]> Co-authored-by: Ryan Hong <[email protected]> Co-authored-by: Jason <[email protected]> Co-authored-by: ericharper <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Nithin Rao <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: Ramanathan Arunachalam <[email protected]> Co-authored-by: bene-ges <[email protected]> Co-authored-by: Alexandra Antonova <[email protected]> Co-authored-by: Yang Zhang <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Jocelyn <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Sean Naren <[email protected]> Co-authored-by: Matvei Novikov <[email protected]> Co-authored-by: Zhilin Wang <[email protected]> Co-authored-by: Igor Gitman <[email protected]> Co-authored-by: Sasha Meister <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <[email protected]> Co-authored-by: Taejin Park <[email protected]> Co-authored-by: Adi Renduchintala <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: David <[email protected]> Co-authored-by: David Mosallanezhad <[email protected]> Co-authored-by: Boris Fomitchev <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Markel Sanz Ausin <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Vladimir Bataev <[email protected]> Co-authored-by: Vahid Noroozi <[email protected]> Co-authored-by: Xuesong Yang <[email protected]>

Export fixes for Riva

c6f217e

Signed-off-by: Boris Fomitchev <[email protected]>

borisfom requested a review from titu1994 November 23, 2022 23:06

Cleaning up training_utils

dffeb03

Signed-off-by: Boris Fomitchev <[email protected]>

titu1994 approved these changes Nov 23, 2022

View reviewed changes

titu1994 merged commit 10966a1 into NVIDIA:r1.13.0 Nov 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Export fixes for Riva #5496

Export fixes for Riva #5496

borisfom commented Nov 23, 2022

Export fixes for Riva #5496

Export fixes for Riva #5496

Conversation

borisfom commented Nov 23, 2022

What does this PR do ?