Fix for concat map dataset #5133

1-800-BAD-CODE · 2022-10-10T22:26:31Z

What does this PR do ?

Fix for concat map dataset #5117

Only caveat I know of is that it will not reshuffle each epoch when using round robin. I'm not sure if anyone actually uses round robin in practice, though.

Usage

from collections import defaultdict

import torch
from pytorch_lightning import seed_everything

from nemo.collections.common.data import ConcatMapDataset

seed_everything(42)


class IntegerDataset(torch.utils.data.Dataset):
    def __init__(self, n: int, dataset_id: int):
        self._ints = list(range(n))
        self._id = torch.tensor(dataset_id)

    def __len__(self):
        return len(self._ints)

    def __getitem__(self, idx):
        worker_info = torch.utils.data.get_worker_info()
        worker_id = worker_info.id
        return self._id, worker_id, self._ints[idx]


for sampling_technique in ["temperature", "random", "round-robin"]:
    probs = [0.1, 0.2, 0.3]
    concat_dataset = ConcatMapDataset(
        datasets=[
            IntegerDataset(5, 0),
            IntegerDataset(10, 1),
            IntegerDataset(20, 2),
        ],
        seed=123456,
        sampling_temperature=2,
        sampling_probabilities=probs,
        sampling_technique=sampling_technique
    )

    dataloader = torch.utils.data.DataLoader(
        dataset=concat_dataset,
        batch_size=1,
        num_workers=2,
        shuffle=False  # Should be False to preserve round robin
    )

    print(f"Item-by-item results for technique '{sampling_technique}':")
    collected = defaultdict(list)
    for batch in dataloader:
        batch_dataset_id, batch_worker_id, batch_value = batch
        print(
            f"Dataset {batch_dataset_id.item()} worker {batch_worker_id.item()} fetched {batch_value.item()}"
        )
        collected[batch_dataset_id.item()].append(batch_value.item())

    print("*" * 80)
    print(f"Summary of returned results for each dataset for technique '{sampling_technique}'")
    if sampling_technique == "random":
        print(f"\t{probs=}")
    for dataset_id, values in sorted(collected.items(), key=lambda x: x[0]):
        dataset_raw_length = len(concat_dataset.datasets[dataset_id])
        print(f"\tDataset #{dataset_id} (raw length {dataset_raw_length}): {values}")
    print("*" * 80)
    print()

Output:

Item-by-item results for technique 'temperature':
Global seed set to 42
Dataset 2 worker 0 fetched 0
Dataset 2 worker 1 fetched 12
Dataset 1 worker 0 fetched 3
Dataset 1 worker 1 fetched 1
Dataset 0 worker 0 fetched 0
Dataset 2 worker 1 fetched 18
Dataset 1 worker 0 fetched 5
Dataset 0 worker 1 fetched 4
Dataset 0 worker 0 fetched 3
Dataset 2 worker 1 fetched 1
Dataset 0 worker 0 fetched 2
Dataset 0 worker 1 fetched 1
Dataset 2 worker 0 fetched 3
Dataset 1 worker 1 fetched 6
Dataset 0 worker 0 fetched 1
Dataset 1 worker 1 fetched 2
Dataset 2 worker 0 fetched 5
Dataset 0 worker 1 fetched 3
Dataset 1 worker 0 fetched 9
Dataset 0 worker 1 fetched 0
Dataset 2 worker 0 fetched 9
Dataset 2 worker 1 fetched 8
Dataset 2 worker 0 fetched 13
Dataset 2 worker 1 fetched 7
Dataset 1 worker 0 fetched 4
Dataset 2 worker 1 fetched 19
Dataset 2 worker 0 fetched 4
Dataset 2 worker 1 fetched 16
Dataset 2 worker 0 fetched 10
Dataset 2 worker 1 fetched 17
Dataset 0 worker 0 fetched 2
Dataset 0 worker 1 fetched 4
Dataset 1 worker 0 fetched 7
Dataset 1 worker 1 fetched 0
Dataset 0 worker 0 fetched 1
Dataset 1 worker 1 fetched 8
Dataset 0 worker 0 fetched 2
Dataset 1 worker 1 fetched 5
Dataset 2 worker 0 fetched 14
Dataset 2 worker 1 fetched 2
Dataset 1 worker 0 fetched 0
Dataset 2 worker 1 fetched 6
Dataset 1 worker 0 fetched 7
Dataset 2 worker 1 fetched 15
Dataset 0 worker 0 fetched 3
Dataset 2 worker 1 fetched 11
********************************************************************************
Summary of returned results for each dataset for technique 'temperature'
	Dataset #0; raw length 5; num drawn = 13; values = [0, 4, 3, 2, 1, 1, 3, 0, 2, 4, 1, 2, 3]
	Dataset #1; raw length 10; num drawn = 13; values = [3, 1, 5, 6, 2, 9, 4, 7, 0, 8, 5, 0, 7]
	Dataset #2; raw length 20; num drawn = 20; values = [0, 12, 18, 1, 3, 5, 9, 8, 13, 7, 19, 4, 16, 10, 17, 14, 2, 6, 15, 11]
********************************************************************************

Item-by-item results for technique 'random':
Dataset 2 worker 0 fetched 0
Dataset 2 worker 1 fetched 12
Dataset 2 worker 0 fetched 18
Dataset 1 worker 1 fetched 3
Dataset 0 worker 0 fetched 0
Dataset 2 worker 1 fetched 1
Dataset 1 worker 0 fetched 1
Dataset 0 worker 1 fetched 4
Dataset 0 worker 0 fetched 3
Dataset 2 worker 1 fetched 3
Dataset 0 worker 0 fetched 2
Dataset 1 worker 1 fetched 5
Dataset 1 worker 0 fetched 6
Dataset 2 worker 1 fetched 5
Dataset 1 worker 0 fetched 2
Dataset 2 worker 1 fetched 9
Dataset 2 worker 0 fetched 8
Dataset 2 worker 1 fetched 13
Dataset 0 worker 0 fetched 1
Dataset 1 worker 1 fetched 9
Dataset 1 worker 0 fetched 4
Dataset 0 worker 1 fetched 0
Dataset 2 worker 0 fetched 7
Dataset 2 worker 1 fetched 19
Dataset 2 worker 0 fetched 4
Dataset 2 worker 1 fetched 16
Dataset 1 worker 0 fetched 7
Dataset 2 worker 1 fetched 10
Dataset 2 worker 0 fetched 17
Dataset 2 worker 1 fetched 14
Dataset 2 worker 0 fetched 2
Dataset 2 worker 1 fetched 6
Dataset 0 worker 0 fetched 3
Dataset 0 worker 1 fetched 2
Dataset 0 worker 0 fetched 1
Dataset 1 worker 1 fetched 0
Dataset 1 worker 0 fetched 8
Dataset 1 worker 1 fetched 8
Dataset 0 worker 0 fetched 4
Dataset 1 worker 1 fetched 7
Dataset 2 worker 0 fetched 15
Dataset 2 worker 1 fetched 11
********************************************************************************
Summary of returned results for each dataset for technique 'random'
	probs=[0.1, 0.2, 0.3]
	Dataset #0; raw length 5; num drawn = 10; values = [0, 4, 3, 2, 1, 0, 3, 2, 1, 4]
	Dataset #1; raw length 10; num drawn = 12; values = [3, 1, 5, 6, 2, 9, 4, 7, 0, 8, 8, 7]
	Dataset #2; raw length 20; num drawn = 20; values = [0, 12, 18, 1, 3, 5, 9, 8, 13, 7, 19, 4, 16, 10, 17, 14, 2, 6, 15, 11]
********************************************************************************

Item-by-item results for technique 'round-robin':
Dataset 0 worker 0 fetched 0
Dataset 1 worker 1 fetched 3
Dataset 2 worker 0 fetched 0
Dataset 0 worker 1 fetched 4
Dataset 1 worker 0 fetched 1
Dataset 2 worker 1 fetched 12
Dataset 0 worker 0 fetched 3
Dataset 1 worker 1 fetched 5
Dataset 2 worker 0 fetched 18
Dataset 0 worker 1 fetched 2
Dataset 1 worker 0 fetched 6
Dataset 2 worker 1 fetched 1
Dataset 0 worker 0 fetched 1
Dataset 1 worker 1 fetched 2
Dataset 2 worker 0 fetched 3
Dataset 0 worker 1 fetched 0
Dataset 1 worker 0 fetched 9
Dataset 2 worker 1 fetched 5
Dataset 0 worker 0 fetched 2
Dataset 1 worker 1 fetched 4
Dataset 2 worker 0 fetched 9
Dataset 0 worker 1 fetched 1
Dataset 1 worker 0 fetched 7
Dataset 2 worker 1 fetched 8
Dataset 0 worker 0 fetched 3
Dataset 1 worker 1 fetched 0
Dataset 2 worker 0 fetched 13
Dataset 0 worker 1 fetched 4
Dataset 1 worker 0 fetched 8
Dataset 2 worker 1 fetched 7
Dataset 0 worker 0 fetched 1
Dataset 1 worker 1 fetched 6
Dataset 2 worker 0 fetched 19
Dataset 0 worker 1 fetched 3
Dataset 1 worker 0 fetched 3
Dataset 2 worker 1 fetched 4
Dataset 0 worker 0 fetched 2
Dataset 1 worker 1 fetched 5
Dataset 2 worker 0 fetched 16
Dataset 0 worker 1 fetched 4
Dataset 1 worker 0 fetched 9
Dataset 2 worker 1 fetched 10
Dataset 0 worker 0 fetched 0
Dataset 1 worker 1 fetched 8
Dataset 2 worker 0 fetched 17
Dataset 0 worker 1 fetched 3
Dataset 1 worker 0 fetched 7
Dataset 2 worker 1 fetched 14
Dataset 0 worker 0 fetched 2
Dataset 1 worker 1 fetched 1
Dataset 2 worker 0 fetched 2
Dataset 0 worker 1 fetched 0
Dataset 1 worker 0 fetched 2
Dataset 2 worker 1 fetched 6
Dataset 0 worker 0 fetched 1
Dataset 1 worker 1 fetched 4
Dataset 2 worker 0 fetched 15
Dataset 0 worker 1 fetched 4
Dataset 1 worker 0 fetched 0
Dataset 2 worker 1 fetched 11
********************************************************************************
Summary of returned results for each dataset for technique 'round-robin'
	Dataset #0; raw length 5; num drawn = 20; values = [0, 4, 3, 2, 1, 0, 2, 1, 3, 4, 1, 3, 2, 4, 0, 3, 2, 0, 1, 4]
	Dataset #1; raw length 10; num drawn = 20; values = [3, 1, 5, 6, 2, 9, 4, 7, 0, 8, 6, 3, 5, 9, 8, 7, 1, 2, 4, 0]
	Dataset #2; raw length 20; num drawn = 20; values = [0, 12, 18, 1, 3, 5, 9, 8, 13, 7, 19, 4, 16, 10, 17, 14, 2, 6, 15, 11]
********************************************************************************

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

for more information, see https://pre-commit.ci

github-actions · 2022-10-27T02:13:06Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

okuchaiev · 2022-10-27T17:14:04Z

@1-800-BAD-CODE is this still WIP or ready for review?

1-800-BAD-CODE · 2022-10-31T00:39:33Z

@okuchaiev ready for review; I previously changed the status but didn't update the title.

I'll add some of my own critical comments:

I believe that round-robin should shuffle every epoch, but this PR does not.
For large datasets, the non-round-robin techniques don't re-sample from the dataset each epoch, and some samples from each dataset are used more than the others (the ones chosen during the initial oversampling).

These are slightly harder to solve for a map-style dataset, since when the data loader says "give me item 113" we should always return the same item "113", whereas with the iterable-style datasets we simply need to return an arbitrary unique item.

MaximumEntropy

Thanks for the fixes!

* change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]>

* first commit on eval_diar_with_asr.py Signed-off-by: Taejin Park <[email protected]> * Add a standalone diarization-ASR evaluation transcript Signed-off-by: Taejin Park <[email protected]> * Fixed examples in docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed staticmethod error Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> * fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> * fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * combine into 1 commit Signed-off-by: Taejin Park <[email protected]> * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add MoE support for T5 model (w/o expert parallel) (#5409) * clean Signed-off-by: Abhinav Khattar <[email protected]> * kwarg ref Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * extra args Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * rm prints Signed-off-by: Abhinav Khattar <[email protected]> * style Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * review comments Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) (#5416) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Fix for concat map dataset (#5133) * change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Add temporary fix for CUDA issue in Dockerfile (#5421) (#5422) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (#5413) (#5428) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) (#5432) This reverts commit 0718b17. Co-authored-by: yaoyu-33 <[email protected]> * [ITN] fix year date graph, cardinals extension for hundreds (#5435) * wip Signed-off-by: ekmb <[email protected]> * add lociko's hundreds extension for cardinals Signed-off-by: ekmb <[email protected]> * add optional end Signed-off-by: ekmb <[email protected]> * restart ci Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * update doc in terms of get_label for lang id model (#5366) * reflect PR 5278 ion doc Signed-off-by: fayejf <[email protected]> * reflect comment Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) (#5433) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed bug in notebook (#5382) (#5394) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: Virginia Adams <[email protected]> * Fixing bug in Megatron BERT when loss mask is all zeros (#5424) * Fixing bug when loss mask is fully zero Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * Use updated API for overlapping grad sync with pipeline parallelism (#5236) Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Tim Moon <[email protected]> * support to disable sequence length + 1 input tokens for each sample in MegatronGPT (#5363) * support to disable sequence length + 1 input tokens for MegatronGPT * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * [TTS] Create script for processing TTS training audio (#5262) * Create script for processing TTS training audio * Update VAD trimming logic * Remove unused import Signed-off-by: Ryan <[email protected]> * [TTS] remove useless logic for set_tokenizer. (#5430) Signed-off-by: Xuesong Yang <[email protected]> * Fix setting up of `ReduceLROnPlateau` learning rate scheduler (#5444) * Fix tests Signed-off-by: PeganovAnton <[email protected]> * Add accidentally lost changes Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: PeganovAnton <[email protected]> * Create codeql.yml (#5445) Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> * Fix for getting tokenizer in character-based ASR models when using tarred dataset (#5442) Signed-off-by: Jonghwan Hyeon <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> * Combine 5 commits adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * moved eval_der function and fixed tqdm options Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Changed minor error in docstrings Signed-off-by: Taejin Park <[email protected]> * removed score_labels and changed leave=True Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Ryan <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Abhinav Khattar <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Shane Carroll <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: Tim Moon <[email protected]> Co-authored-by: anmolgupt <[email protected]> Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: Ryan Langman <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Jonghwan Hyeon <[email protected]>

* first commit on eval_diar_with_asr.py Signed-off-by: Taejin Park <[email protected]> * Add a standalone diarization-ASR evaluation transcript Signed-off-by: Taejin Park <[email protected]> * Fixed examples in docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed staticmethod error Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> * fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> * fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * combine into 1 commit Signed-off-by: Taejin Park <[email protected]> * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add MoE support for T5 model (w/o expert parallel) (NVIDIA#5409) * clean Signed-off-by: Abhinav Khattar <[email protected]> * kwarg ref Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * extra args Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * rm prints Signed-off-by: Abhinav Khattar <[email protected]> * style Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * review comments Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) (NVIDIA#5416) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Fix for concat map dataset (NVIDIA#5133) * change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) (NVIDIA#5422) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) (NVIDIA#5428) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) (NVIDIA#5432) This reverts commit 0718b17. Co-authored-by: yaoyu-33 <[email protected]> * [ITN] fix year date graph, cardinals extension for hundreds (NVIDIA#5435) * wip Signed-off-by: ekmb <[email protected]> * add lociko's hundreds extension for cardinals Signed-off-by: ekmb <[email protected]> * add optional end Signed-off-by: ekmb <[email protected]> * restart ci Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * update doc in terms of get_label for lang id model (NVIDIA#5366) * reflect PR 5278 ion doc Signed-off-by: fayejf <[email protected]> * reflect comment Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) (NVIDIA#5433) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed bug in notebook (NVIDIA#5382) (NVIDIA#5394) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: Virginia Adams <[email protected]> * Fixing bug in Megatron BERT when loss mask is all zeros (NVIDIA#5424) * Fixing bug when loss mask is fully zero Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * Use updated API for overlapping grad sync with pipeline parallelism (NVIDIA#5236) Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Tim Moon <[email protected]> * support to disable sequence length + 1 input tokens for each sample in MegatronGPT (NVIDIA#5363) * support to disable sequence length + 1 input tokens for MegatronGPT * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * [TTS] Create script for processing TTS training audio (NVIDIA#5262) * Create script for processing TTS training audio * Update VAD trimming logic * Remove unused import Signed-off-by: Ryan <[email protected]> * [TTS] remove useless logic for set_tokenizer. (NVIDIA#5430) Signed-off-by: Xuesong Yang <[email protected]> * Fix setting up of `ReduceLROnPlateau` learning rate scheduler (NVIDIA#5444) * Fix tests Signed-off-by: PeganovAnton <[email protected]> * Add accidentally lost changes Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: PeganovAnton <[email protected]> * Create codeql.yml (NVIDIA#5445) Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> * Fix for getting tokenizer in character-based ASR models when using tarred dataset (NVIDIA#5442) Signed-off-by: Jonghwan Hyeon <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> * Combine 5 commits adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * moved eval_der function and fixed tqdm options Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Changed minor error in docstrings Signed-off-by: Taejin Park <[email protected]> * removed score_labels and changed leave=True Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Ryan <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Abhinav Khattar <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Shane Carroll <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: Tim Moon <[email protected]> Co-authored-by: anmolgupt <[email protected]> Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: Ryan Langman <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Jonghwan Hyeon <[email protected]> Signed-off-by: shane carroll <[email protected]>

* change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Signed-off-by: Hainan Xu <[email protected]>

* first commit on eval_diar_with_asr.py Signed-off-by: Taejin Park <[email protected]> * Add a standalone diarization-ASR evaluation transcript Signed-off-by: Taejin Park <[email protected]> * Fixed examples in docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed staticmethod error Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> * fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> * fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * combine into 1 commit Signed-off-by: Taejin Park <[email protected]> * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add MoE support for T5 model (w/o expert parallel) (NVIDIA#5409) * clean Signed-off-by: Abhinav Khattar <[email protected]> * kwarg ref Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * extra args Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * rm prints Signed-off-by: Abhinav Khattar <[email protected]> * style Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * review comments Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) (NVIDIA#5416) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Fix for concat map dataset (NVIDIA#5133) * change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) (NVIDIA#5422) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) (NVIDIA#5428) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) (NVIDIA#5432) This reverts commit 0718b17. Co-authored-by: yaoyu-33 <[email protected]> * [ITN] fix year date graph, cardinals extension for hundreds (NVIDIA#5435) * wip Signed-off-by: ekmb <[email protected]> * add lociko's hundreds extension for cardinals Signed-off-by: ekmb <[email protected]> * add optional end Signed-off-by: ekmb <[email protected]> * restart ci Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * update doc in terms of get_label for lang id model (NVIDIA#5366) * reflect PR 5278 ion doc Signed-off-by: fayejf <[email protected]> * reflect comment Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) (NVIDIA#5433) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed bug in notebook (NVIDIA#5382) (NVIDIA#5394) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: Virginia Adams <[email protected]> * Fixing bug in Megatron BERT when loss mask is all zeros (NVIDIA#5424) * Fixing bug when loss mask is fully zero Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * Use updated API for overlapping grad sync with pipeline parallelism (NVIDIA#5236) Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Tim Moon <[email protected]> * support to disable sequence length + 1 input tokens for each sample in MegatronGPT (NVIDIA#5363) * support to disable sequence length + 1 input tokens for MegatronGPT * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * [TTS] Create script for processing TTS training audio (NVIDIA#5262) * Create script for processing TTS training audio * Update VAD trimming logic * Remove unused import Signed-off-by: Ryan <[email protected]> * [TTS] remove useless logic for set_tokenizer. (NVIDIA#5430) Signed-off-by: Xuesong Yang <[email protected]> * Fix setting up of `ReduceLROnPlateau` learning rate scheduler (NVIDIA#5444) * Fix tests Signed-off-by: PeganovAnton <[email protected]> * Add accidentally lost changes Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: PeganovAnton <[email protected]> * Create codeql.yml (NVIDIA#5445) Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> * Fix for getting tokenizer in character-based ASR models when using tarred dataset (NVIDIA#5442) Signed-off-by: Jonghwan Hyeon <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> * Combine 5 commits adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * moved eval_der function and fixed tqdm options Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Changed minor error in docstrings Signed-off-by: Taejin Park <[email protected]> * removed score_labels and changed leave=True Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Ryan <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Abhinav Khattar <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Shane Carroll <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: Tim Moon <[email protected]> Co-authored-by: anmolgupt <[email protected]> Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: Ryan Langman <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Jonghwan Hyeon <[email protected]> Signed-off-by: Hainan Xu <[email protected]>

* change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Signed-off-by: Hainan Xu <[email protected]>

* first commit on eval_diar_with_asr.py Signed-off-by: Taejin Park <[email protected]> * Add a standalone diarization-ASR evaluation transcript Signed-off-by: Taejin Park <[email protected]> * Fixed examples in docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed staticmethod error Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> * fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> * fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * combine into 1 commit Signed-off-by: Taejin Park <[email protected]> * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add MoE support for T5 model (w/o expert parallel) (NVIDIA#5409) * clean Signed-off-by: Abhinav Khattar <[email protected]> * kwarg ref Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * extra args Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * rm prints Signed-off-by: Abhinav Khattar <[email protected]> * style Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * review comments Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) (NVIDIA#5416) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Fix for concat map dataset (NVIDIA#5133) * change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) (NVIDIA#5422) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) (NVIDIA#5428) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) (NVIDIA#5432) This reverts commit 0718b17. Co-authored-by: yaoyu-33 <[email protected]> * [ITN] fix year date graph, cardinals extension for hundreds (NVIDIA#5435) * wip Signed-off-by: ekmb <[email protected]> * add lociko's hundreds extension for cardinals Signed-off-by: ekmb <[email protected]> * add optional end Signed-off-by: ekmb <[email protected]> * restart ci Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * update doc in terms of get_label for lang id model (NVIDIA#5366) * reflect PR 5278 ion doc Signed-off-by: fayejf <[email protected]> * reflect comment Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) (NVIDIA#5433) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed bug in notebook (NVIDIA#5382) (NVIDIA#5394) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: Virginia Adams <[email protected]> * Fixing bug in Megatron BERT when loss mask is all zeros (NVIDIA#5424) * Fixing bug when loss mask is fully zero Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * Use updated API for overlapping grad sync with pipeline parallelism (NVIDIA#5236) Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Tim Moon <[email protected]> * support to disable sequence length + 1 input tokens for each sample in MegatronGPT (NVIDIA#5363) * support to disable sequence length + 1 input tokens for MegatronGPT * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * [TTS] Create script for processing TTS training audio (NVIDIA#5262) * Create script for processing TTS training audio * Update VAD trimming logic * Remove unused import Signed-off-by: Ryan <[email protected]> * [TTS] remove useless logic for set_tokenizer. (NVIDIA#5430) Signed-off-by: Xuesong Yang <[email protected]> * Fix setting up of `ReduceLROnPlateau` learning rate scheduler (NVIDIA#5444) * Fix tests Signed-off-by: PeganovAnton <[email protected]> * Add accidentally lost changes Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: PeganovAnton <[email protected]> * Create codeql.yml (NVIDIA#5445) Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> * Fix for getting tokenizer in character-based ASR models when using tarred dataset (NVIDIA#5442) Signed-off-by: Jonghwan Hyeon <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> * Combine 5 commits adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * moved eval_der function and fixed tqdm options Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Changed minor error in docstrings Signed-off-by: Taejin Park <[email protected]> * removed score_labels and changed leave=True Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Ryan <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Abhinav Khattar <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Shane Carroll <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: Tim Moon <[email protected]> Co-authored-by: anmolgupt <[email protected]> Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: Ryan Langman <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Jonghwan Hyeon <[email protected]> Signed-off-by: Hainan Xu <[email protected]>

* change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]>

* change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Signed-off-by: andrusenkoau <[email protected]>

* first commit on eval_diar_with_asr.py Signed-off-by: Taejin Park <[email protected]> * Add a standalone diarization-ASR evaluation transcript Signed-off-by: Taejin Park <[email protected]> * Fixed examples in docstrings Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixed staticmethod error Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> * fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> * fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * combine into 1 commit Signed-off-by: Taejin Park <[email protected]> * Added description on eval modes Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Add MoE support for T5 model (w/o expert parallel) (NVIDIA#5409) * clean Signed-off-by: Abhinav Khattar <[email protected]> * kwarg ref Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * extra args Signed-off-by: Abhinav Khattar <[email protected]> * test Signed-off-by: Abhinav Khattar <[email protected]> * rm prints Signed-off-by: Abhinav Khattar <[email protected]> * style Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * review comments Signed-off-by: Abhinav Khattar <[email protected]> * review comments Signed-off-by: Abhinav Khattar <[email protected]> * fix Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA#5410) (NVIDIA#5416) Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Fix for concat map dataset (NVIDIA#5133) * change for concat map dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Exhaust longest dataset * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: 1-800-BAD-CODE <> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> * Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421) (NVIDIA#5422) Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: Yu Yao <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA#5413) (NVIDIA#5428) * Fix Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA#5339) * Initial refactor Signed-off-by: MaximumEntropy <[email protected]> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <[email protected]> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <[email protected]> * Fixes for eval Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <[email protected]> * Refactor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <[email protected]> * Remove comments Signed-off-by: MaximumEntropy <[email protected]> * Minor Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <[email protected]> * Remove old comment Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA#5421)" (NVIDIA#5431) (NVIDIA#5432) This reverts commit 0718b17. Co-authored-by: yaoyu-33 <[email protected]> * [ITN] fix year date graph, cardinals extension for hundreds (NVIDIA#5435) * wip Signed-off-by: ekmb <[email protected]> * add lociko's hundreds extension for cardinals Signed-off-by: ekmb <[email protected]> * add optional end Signed-off-by: ekmb <[email protected]> * restart ci Signed-off-by: ekmb <[email protected]> Signed-off-by: ekmb <[email protected]> * update doc in terms of get_label for lang id model (NVIDIA#5366) * reflect PR 5278 ion doc Signed-off-by: fayejf <[email protected]> * reflect comment Signed-off-by: fayejf <[email protected]> Signed-off-by: fayejf <[email protected]> * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA#5420) (NVIDIA#5433) * Revert workers workaround Signed-off-by: MaximumEntropy <[email protected]> * Fix in config Signed-off-by: MaximumEntropy <[email protected]> * Fix Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> * Fixed bug in notebook (NVIDIA#5382) (NVIDIA#5394) Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Co-authored-by: Virginia Adams <[email protected]> * Fixing bug in Megatron BERT when loss mask is all zeros (NVIDIA#5424) * Fixing bug when loss mask is fully zero Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update megatron_bert_model.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> * Update dataset_utils.py Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * Use updated API for overlapping grad sync with pipeline parallelism (NVIDIA#5236) Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Tim Moon <[email protected]> * support to disable sequence length + 1 input tokens for each sample in MegatronGPT (NVIDIA#5363) * support to disable sequence length + 1 input tokens for MegatronGPT * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> * [TTS] Create script for processing TTS training audio (NVIDIA#5262) * Create script for processing TTS training audio * Update VAD trimming logic * Remove unused import Signed-off-by: Ryan <[email protected]> * [TTS] remove useless logic for set_tokenizer. (NVIDIA#5430) Signed-off-by: Xuesong Yang <[email protected]> * Fix setting up of `ReduceLROnPlateau` learning rate scheduler (NVIDIA#5444) * Fix tests Signed-off-by: PeganovAnton <[email protected]> * Add accidentally lost changes Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: PeganovAnton <[email protected]> * Create codeql.yml (NVIDIA#5445) Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> * Fix for getting tokenizer in character-based ASR models when using tarred dataset (NVIDIA#5442) Signed-off-by: Jonghwan Hyeon <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> * Combine 5 commits adding diar_infer_general.yaml Signed-off-by: Taejin Park <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> Update codeql.yml Signed-off-by: Somshubra Majumdar <[email protected]> fix msdd_model in general yaml file Signed-off-by: Taejin Park <[email protected]> fixed errors in yaml file Signed-off-by: Taejin Park <[email protected]> * moved eval_der function and fixed tqdm options Signed-off-by: Taejin Park <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Changed minor error in docstrings Signed-off-by: Taejin Park <[email protected]> * removed score_labels and changed leave=True Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Taejin Park <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Signed-off-by: MaximumEntropy <[email protected]> Signed-off-by: Yu Yao <[email protected]> Signed-off-by: ekmb <[email protected]> Signed-off-by: fayejf <[email protected]> Signed-off-by: Virginia Adams <[email protected]> Signed-off-by: Shanmugam Ramasamy <[email protected]> Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Ryan <[email protected]> Signed-off-by: Xuesong Yang <[email protected]> Signed-off-by: PeganovAnton <[email protected]> Signed-off-by: Somshubra Majumdar <[email protected]> Signed-off-by: Jonghwan Hyeon <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Abhinav Khattar <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <[email protected]> Co-authored-by: Shane Carroll <[email protected]> Co-authored-by: Oleksii Kuchaiev <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: Yi Dong <[email protected]> Co-authored-by: Evelina <[email protected]> Co-authored-by: fayejf <[email protected]> Co-authored-by: Virginia Adams <[email protected]> Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: Tim Moon <[email protected]> Co-authored-by: anmolgupt <[email protected]> Co-authored-by: Anmol Gupta <[email protected]> Co-authored-by: Ryan Langman <[email protected]> Co-authored-by: Xuesong Yang <[email protected]> Co-authored-by: PeganovAnton <[email protected]> Co-authored-by: Somshubra Majumdar <[email protected]> Co-authored-by: Jonghwan Hyeon <[email protected]> Signed-off-by: andrusenkoau <[email protected]>

1-800-BAD-CODE and others added 7 commits October 10, 2022 18:19

change for concat map dataset

4918ee4

[pre-commit.ci] auto fixes from pre-commit.com hooks

7211fb7

for more information, see https://pre-commit.ci

Exhaust longest dataset

8d320ce

Merge remote-tracking branch 'upstream/main' into fix_concat_map_dataset

6475852

Merge branch 'NVIDIA:main' into fix_concat_map_dataset

20baba8

resolve upstream

d8b4b32

[pre-commit.ci] auto fixes from pre-commit.com hooks

6d4be8e

for more information, see https://pre-commit.ci

1-800-BAD-CODE marked this pull request as ready for review October 11, 2022 00:02

MaximumEntropy requested review from aklife97 and MaximumEntropy October 12, 2022 17:22

github-actions bot added the stale label Oct 27, 2022

Merge branch 'main' into fix_concat_map_dataset

3a16148

github-actions bot removed the stale label Oct 28, 2022

1-800-BAD-CODE changed the title ~~[WIP] fix for concat map dataset~~ Fix for concat map dataset Oct 31, 2022

Merge branch 'main' into fix_concat_map_dataset

623bd9c

MaximumEntropy approved these changes Nov 4, 2022

View reviewed changes

Merge branch 'main' into fix_concat_map_dataset

a1acd0b

okuchaiev merged commit c21074e into NVIDIA:main Nov 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for concat map dataset #5133

Fix for concat map dataset #5133

1-800-BAD-CODE commented Oct 10, 2022 •

edited

Loading

github-actions bot commented Oct 27, 2022

okuchaiev commented Oct 27, 2022

1-800-BAD-CODE commented Oct 31, 2022

MaximumEntropy left a comment

Fix for concat map dataset #5133

Fix for concat map dataset #5133

Conversation

1-800-BAD-CODE commented Oct 10, 2022 • edited Loading

What does this PR do ?

Usage

Before your PR is "Ready for review"

github-actions bot commented Oct 27, 2022

okuchaiev commented Oct 27, 2022

1-800-BAD-CODE commented Oct 31, 2022

MaximumEntropy left a comment

Choose a reason for hiding this comment

1-800-BAD-CODE commented Oct 10, 2022 •

edited

Loading