TTS Collection #874

blisc · 2020-07-20T14:53:35Z

Adds TTS to candidate

Changes outside TTS collection

ASR collection

rename _AudioDataset to _AudioTextDataset
_AudioTextDataset's signature changed, it no longer accepts featurizer but accepts sample_rate, int_values, and augmentor so that it can instantiate WaveformFeaturizer itself.
AudioPreprocessor no longer has a get_seq_len function. get_features is now expected to return processed_signal and processed_length

Core collection

Adds LogEpochTimeCallback to nemo.collections.core.callbacks

Core

from_config_dict now accepts *args and **kwargs passthroughs to hydra.utils.instantiate
Tweaked error messages inside type checking so it is more informative

TODO

Signed-off-by: Jason <[email protected]>

lgtm-com · 2020-07-28T20:55:32Z

This pull request introduces 1 alert when merging e173943 into b883b31 - view on LGTM.com

new alerts:

1 for Unused import

Signed-off-by: Jason <[email protected]>

lgtm-com · 2020-07-28T21:58:46Z

This pull request introduces 1 alert when merging 3f48fac into 054cc02 - view on LGTM.com

new alerts:

1 for Unused import

Signed-off-by: Jason <[email protected]>

okuchaiev

Overall it looks good, but there are few small issues here and there - please see comments.

okuchaiev · 2020-07-29T17:35:21Z

nemo/collections/tts/models/tacotron2.py

+        return mel_loss + gate_loss
+
+
+@experimental  # TODO: Need to implement abstract methods: list_available_models, from_pretrained, export but how?


you'll only need to implement "list_available_model()" once you have cloud locations

Now I get TypeError: Can't instantiate abstract class Tacotron2Model with abstract methods export

And hydra.errors.HydraException: Error calling 'nemo.collections.tts.modules.tacotron2.Encoder' : Can't instantiate abstract class Encoder with abstract methods restore_from, save_to

"Can't instantiate abstract class Tacotron2Model with abstract method" this should not happen if you merge/rebase latest candidate into your branch

okuchaiev · 2020-07-29T17:39:14Z

nemo/collections/tts/models/tacotron2.py

+    validation_ds: Optional[Dict] = None
+
+
+class Tacotron2Loss(Loss):


this class should go to nemo/collections/tts/losses/* or to nemo/collections/common/losses/* and also get some docstring

Why move it? This loss is only ever going to be used in this model. It is so redundant that I have to define it as it's own separate class rather than a function of Tacotron2Model.

With the new function-level override of typecheck(), it doesn't even need to inherit from Typing to use NeMo's typing. Tacotron2Loss doesn't have an init nor does it use self, it's a function.

For consistency. We put all losses other losses under collections/collection_name/losses/* . why make exception here?

Moved.
I maintain that collections/collection_name/losses/* shouldn't exist apart from losses inside common. NLP's losses is empty and ASR only has CTC that is only used in CTC models so it should be moved there and losses folder should be removed.

Let me qualify my above statement. If a loss is only ever going to be used in a single model, it should not be in the losses folder. And if the losses folder is empty, it should be deleted.

Side note, It is incorrect that the class does not need to implement Typing. All of the actual tests lie inside the Typing class, and the decorator simply dispatches these checks. This is by design to support graph level checks. The decor will raise Runtime error if placed on a class that doesn't implement Typing.

okuchaiev · 2020-07-29T17:39:38Z

nemo/collections/tts/models/tacotron2.py

+
+@experimental  # TODO: Need to implement abstract methods: list_available_models, from_pretrained, export but how?
+class Tacotron2Model(ModelPT):
+    # TODO: tensorboard for training


what is the problem with TB for training?

nemo/collections/tts/models/tacotron2.py

nemo/collections/tts/modules/tacotron2.py

Signed-off-by: Jason <[email protected]>

okuchaiev · 2020-07-30T16:51:46Z

nemo/collections/tts/models/tacotron2.py

+        elif not isinstance(cfg, DictConfig):
+            raise ValueError(f"cfg was type: {type(cfg)}. Expected either a dict or a DictConfig")
+        # Ensure passed cfg is compliant with schema
+        OmegaConf.merge(cfg, schema)


@titu1994 @ericharper @tkornuta-nvidia - let's chat (outside of this PR) if we'd like to adapt something like this in general for Models

okuchaiev · 2020-07-30T16:52:57Z

nemo/core/classes/modelPT.py

@@ -77,7 +77,7 @@ def __init__(self, cfg: DictConfig, trainer: Trainer = None):
        self._scheduler = None
        self._trainer = trainer

-        if self._cfg is not None:
+        if self._cfg is not None:  # TODO: This check is redundant since we know cfg is an instance of DictConfig


true - feel free to remove

titu1994

Minor comments regarding possible refactor, but overall not necessary to merge. LGTM

titu1994 · 2020-07-30T17:17:36Z

nemo/collections/tts/models/waveglow.py

+            raise ValueError(f"No dataset for {name}")  # TODO
+        if "dataloader_params" not in cfg or not isinstance(cfg["dataloader_params"], (dict, DictConfig)):
+            raise ValueError(f"No dataloder_params for {name}")  # TODO
+        if shuffle_should_be:


How about "should_shuffle_train_val"

titu1994 · 2020-07-30T17:19:30Z

nemo/collections/tts/modules/waveglow.py

+from nemo.utils.decorators import experimental
+
+
+class OperationMode(Enum):


Mode should be unnecessary after the function override of typecheck if you would like to refactor, but this is fine too.

* add structure Signed-off-by: Jason <[email protected]> * add files Signed-off-by: Jason <[email protected]> * add init Signed-off-by: Jason <[email protected]> * fix init Signed-off-by: Jason <[email protected]> * update taco Signed-off-by: Jason <[email protected]> * format Signed-off-by: Jason <[email protected]> * typo Signed-off-by: Jason <[email protected]> * val change Signed-off-by: Jason <[email protected]> * update Signed-off-by: Jason <[email protected]> * update Signed-off-by: Jason <[email protected]> * update Signed-off-by: Jason <[email protected]> * add header Signed-off-by: Jason <[email protected]> * add waveglow Signed-off-by: Jason <[email protected]> * waveglow fix Signed-off-by: Jason <[email protected]> * waveglow fix Signed-off-by: Jason <[email protected]> * waveglow fix Signed-off-by: Jason <[email protected]> * add O1 Signed-off-by: Jason <[email protected]> * add todo Signed-off-by: Jason <[email protected]> * change batch sizes Signed-off-by: Jason <[email protected]> * move Signed-off-by: Jason <[email protected]> * move Signed-off-by: Jason <[email protected]> * V1 Signed-off-by: Jason <[email protected]> * V1 Signed-off-by: Jason <[email protected]> * structure Signed-off-by: Jason <[email protected]> * clean up Signed-off-by: Jason <[email protected]> * fix tacotron2 Signed-off-by: Jason <[email protected]> * add files Signed-off-by: Jason <[email protected]> * update yamls Signed-off-by: Jason <[email protected]> * fix waveglow 1/2 Signed-off-by: Jason <[email protected]> * fix waveglow 2/3 Signed-off-by: Jason <[email protected]> * fix waveglow 3/? Signed-off-by: Jason <[email protected]> * merge asr and tts Signed-off-by: Jason <[email protected]> * update waveglow Signed-off-by: Jason <[email protected]> * isort Signed-off-by: Jason <[email protected]> * update configs Signed-off-by: Jason <[email protected]> * update waveglow's dataloader Signed-off-by: Jason <[email protected]> * update and refactor Signed-off-by: Jason <[email protected]> * update t2 and waveglow Signed-off-by: Jason <[email protected]> * enforce dictconfig; style Signed-off-by: Jason <[email protected]> * add fastdevruns Signed-off-by: Jason <[email protected]> * name Signed-off-by: Jason <[email protected]> * import Signed-off-by: Jason <[email protected]> * update tests Signed-off-by: Jason <[email protected]> * yaml Signed-off-by: Jason <[email protected]> * use hydra rnner Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * move callback to common Signed-off-by: Jason <[email protected]> * fixed tacotron 2; add typing to all neuralmodules Signed-off-by: Jason <[email protected]> * flatten Signed-off-by: Jason <[email protected]> * Style Signed-off-by: Jason <[email protected]> * patch jenkins Signed-off-by: Jason <[email protected]> * change typeheck logic Signed-off-by: Jason <[email protected]> * fix t2 Signed-off-by: Jason <[email protected]> * fix wg Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * update configs Signed-off-by: Jason <[email protected]> * add num_workers Signed-off-by: Jason <[email protected]> * update config Signed-off-by: Jason <[email protected]> * update gitignore Signed-off-by: Jason <[email protected]> * config Signed-off-by: Jason <[email protected]> * enable fp16 on t2 Signed-off-by: Jason <[email protected]> * enable fp16 on t2 Signed-off-by: Jason <[email protected]> * enable fp16 on t2 Signed-off-by: Jason <[email protected]> * fix t2 Signed-off-by: Jason <[email protected]> * fixes Signed-off-by: Jason <[email protected]> * add typing to models; use nemo loss class Signed-off-by: Jason <[email protected]> * make wg work Signed-off-by: Jason <[email protected]> * make t2 work Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * fix wg Signed-off-by: Jason <[email protected]> * add better debug info to shape check Signed-off-by: Jason <[email protected]> * lgtm import error Signed-off-by: Jason <[email protected]> * lower batch size for testing Signed-off-by: Jason <[email protected]> * address comments Signed-off-by: Jason <[email protected]> * address comments and remove experimental Signed-off-by: Jason <[email protected]> * standardize Signed-off-by: Jason <[email protected]> * mark back as experimental Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * experimental Signed-off-by: Jason <[email protected]> Signed-off-by: Jocelyn Huang <[email protected]>

* add structure Signed-off-by: Jason <[email protected]> * add files Signed-off-by: Jason <[email protected]> * add init Signed-off-by: Jason <[email protected]> * fix init Signed-off-by: Jason <[email protected]> * update taco Signed-off-by: Jason <[email protected]> * format Signed-off-by: Jason <[email protected]> * typo Signed-off-by: Jason <[email protected]> * val change Signed-off-by: Jason <[email protected]> * update Signed-off-by: Jason <[email protected]> * update Signed-off-by: Jason <[email protected]> * update Signed-off-by: Jason <[email protected]> * add header Signed-off-by: Jason <[email protected]> * add waveglow Signed-off-by: Jason <[email protected]> * waveglow fix Signed-off-by: Jason <[email protected]> * waveglow fix Signed-off-by: Jason <[email protected]> * waveglow fix Signed-off-by: Jason <[email protected]> * add O1 Signed-off-by: Jason <[email protected]> * add todo Signed-off-by: Jason <[email protected]> * change batch sizes Signed-off-by: Jason <[email protected]> * move Signed-off-by: Jason <[email protected]> * move Signed-off-by: Jason <[email protected]> * V1 Signed-off-by: Jason <[email protected]> * V1 Signed-off-by: Jason <[email protected]> * structure Signed-off-by: Jason <[email protected]> * clean up Signed-off-by: Jason <[email protected]> * fix tacotron2 Signed-off-by: Jason <[email protected]> * add files Signed-off-by: Jason <[email protected]> * update yamls Signed-off-by: Jason <[email protected]> * fix waveglow 1/2 Signed-off-by: Jason <[email protected]> * fix waveglow 2/3 Signed-off-by: Jason <[email protected]> * fix waveglow 3/? Signed-off-by: Jason <[email protected]> * merge asr and tts Signed-off-by: Jason <[email protected]> * update waveglow Signed-off-by: Jason <[email protected]> * isort Signed-off-by: Jason <[email protected]> * update configs Signed-off-by: Jason <[email protected]> * update waveglow's dataloader Signed-off-by: Jason <[email protected]> * update and refactor Signed-off-by: Jason <[email protected]> * update t2 and waveglow Signed-off-by: Jason <[email protected]> * enforce dictconfig; style Signed-off-by: Jason <[email protected]> * add fastdevruns Signed-off-by: Jason <[email protected]> * name Signed-off-by: Jason <[email protected]> * import Signed-off-by: Jason <[email protected]> * update tests Signed-off-by: Jason <[email protected]> * yaml Signed-off-by: Jason <[email protected]> * use hydra rnner Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * move callback to common Signed-off-by: Jason <[email protected]> * fixed tacotron 2; add typing to all neuralmodules Signed-off-by: Jason <[email protected]> * flatten Signed-off-by: Jason <[email protected]> * Style Signed-off-by: Jason <[email protected]> * patch jenkins Signed-off-by: Jason <[email protected]> * change typeheck logic Signed-off-by: Jason <[email protected]> * fix t2 Signed-off-by: Jason <[email protected]> * fix wg Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * update configs Signed-off-by: Jason <[email protected]> * add num_workers Signed-off-by: Jason <[email protected]> * update config Signed-off-by: Jason <[email protected]> * update gitignore Signed-off-by: Jason <[email protected]> * config Signed-off-by: Jason <[email protected]> * enable fp16 on t2 Signed-off-by: Jason <[email protected]> * enable fp16 on t2 Signed-off-by: Jason <[email protected]> * enable fp16 on t2 Signed-off-by: Jason <[email protected]> * fix t2 Signed-off-by: Jason <[email protected]> * fixes Signed-off-by: Jason <[email protected]> * add typing to models; use nemo loss class Signed-off-by: Jason <[email protected]> * make wg work Signed-off-by: Jason <[email protected]> * make t2 work Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * fix wg Signed-off-by: Jason <[email protected]> * add better debug info to shape check Signed-off-by: Jason <[email protected]> * lgtm import error Signed-off-by: Jason <[email protected]> * lower batch size for testing Signed-off-by: Jason <[email protected]> * address comments Signed-off-by: Jason <[email protected]> * address comments and remove experimental Signed-off-by: Jason <[email protected]> * standardize Signed-off-by: Jason <[email protected]> * mark back as experimental Signed-off-by: Jason <[email protected]> * style Signed-off-by: Jason <[email protected]> * experimental Signed-off-by: Jason <[email protected]> Signed-off-by: Samuel Kriman <[email protected]>

blisc added 30 commits July 1, 2020 12:06

add structure

89f6fa9

Signed-off-by: Jason <[email protected]>

add files

8008f86

Signed-off-by: Jason <[email protected]>

add init

d098a6b

Signed-off-by: Jason <[email protected]>

fix init

e0298ea

Signed-off-by: Jason <[email protected]>

Merge remote-tracking branch 'nvidia/candidate' into candidate_tts_jason

61e084a

Signed-off-by: Jason <[email protected]>

update taco

51eb125

Signed-off-by: Jason <[email protected]>

format

641d122

Signed-off-by: Jason <[email protected]>

typo

0bbe9e4

Signed-off-by: Jason <[email protected]>

val change

491a171

Signed-off-by: Jason <[email protected]>

update

8ee9091

Signed-off-by: Jason <[email protected]>

update

872c11a

Signed-off-by: Jason <[email protected]>

update

f519dca

Signed-off-by: Jason <[email protected]>

add header

7a5a862

Signed-off-by: Jason <[email protected]>

Merge remote-tracking branch 'nvidia/candidate' into candidate_tts_jason

95e9d92

Signed-off-by: Jason <[email protected]>

add waveglow

5e2aef1

Signed-off-by: Jason <[email protected]>

waveglow fix

c427ae0

Signed-off-by: Jason <[email protected]>

waveglow fix

9196b62

Signed-off-by: Jason <[email protected]>

waveglow fix

6a8156a

Signed-off-by: Jason <[email protected]>

add O1

2475136

Signed-off-by: Jason <[email protected]>

add todo

602f82f

Signed-off-by: Jason <[email protected]>

change batch sizes

c47b3a4

Signed-off-by: Jason <[email protected]>

Merge branch 'candidate_tts_jason' into candidate_tts_v2

9d525a5

Signed-off-by: Jason <[email protected]>

move

ef39f6d

Signed-off-by: Jason <[email protected]>

move

ea35ec7

Signed-off-by: Jason <[email protected]>

V1

c099f87

Signed-off-by: Jason <[email protected]>

V1

7384204

Signed-off-by: Jason <[email protected]>

structure

05789a7

Signed-off-by: Jason <[email protected]>

clean up

ebc1868

Signed-off-by: Jason <[email protected]>

fix tacotron2

24f1078

Signed-off-by: Jason <[email protected]>

add files

2887dc0

Signed-off-by: Jason <[email protected]>

blisc added 3 commits July 28, 2020 13:33

make wg work

615184f

Signed-off-by: Jason <[email protected]>

make t2 work

aaa0761

Signed-off-by: Jason <[email protected]>

style

1f73b7c

Signed-off-by: Jason <[email protected]>

blisc dismissed titu1994’s stale review via 1f73b7c July 28, 2020 20:43

blisc requested review from okuchaiev and titu1994 July 28, 2020 20:43

merge

e173943

Signed-off-by: Jason <[email protected]>

fix wg

3f48fac

Signed-off-by: Jason <[email protected]>

blisc added 3 commits July 28, 2020 15:01

add better debug info to shape check

28cc1b5

Signed-off-by: Jason <[email protected]>

lgtm import error

01c0dbf

Signed-off-by: Jason <[email protected]>

lower batch size for testing

a566b65

Signed-off-by: Jason <[email protected]>

okuchaiev requested changes Jul 29, 2020

View reviewed changes

blisc added 10 commits July 29, 2020 11:22

address comments

2961c39

Signed-off-by: Jason <[email protected]>

address comments and remove experimental

6b404cc

Signed-off-by: Jason <[email protected]>

Merge remote-tracking branch 'nvidia/candidate' into candidate_tts_v2

19519e1

Signed-off-by: Jason <[email protected]>

standardize

6a9ef48

Signed-off-by: Jason <[email protected]>

Merge remote-tracking branch 'nvidia/candidate' into candidate_tts_v2

17617b6

Signed-off-by: Jason <[email protected]>

Merge remote-tracking branch 'nvidia/candidate' into candidate_tts_v2

ace6626

Signed-off-by: Jason <[email protected]>

mark back as experimental

e9a7689

Signed-off-by: Jason <[email protected]>

style

26afd7b

Signed-off-by: Jason <[email protected]>

experimental

2934284

Signed-off-by: Jason <[email protected]>

add losses

7bb88ec

Signed-off-by: Jason <[email protected]>

okuchaiev approved these changes Jul 30, 2020

View reviewed changes

titu1994 approved these changes Jul 30, 2020

View reviewed changes

blisc merged commit 50cbeec into NVIDIA:candidate Jul 30, 2020

blisc deleted the candidate_tts_v2 branch July 30, 2020 17:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TTS Collection #874

TTS Collection #874

blisc commented Jul 20, 2020 •

edited

Loading

lgtm-com bot commented Jul 28, 2020

lgtm-com bot commented Jul 28, 2020

okuchaiev left a comment

okuchaiev Jul 29, 2020

blisc Jul 29, 2020

blisc Jul 29, 2020

okuchaiev Jul 29, 2020

okuchaiev Jul 29, 2020

blisc Jul 29, 2020

blisc Jul 29, 2020 •

edited

Loading

okuchaiev Jul 29, 2020

blisc Jul 30, 2020

blisc Jul 30, 2020

titu1994 Jul 30, 2020 •

edited

Loading

okuchaiev Jul 29, 2020

okuchaiev Jul 30, 2020

okuchaiev Jul 30, 2020

titu1994 left a comment

titu1994 Jul 30, 2020

titu1994 Jul 30, 2020 •

edited

Loading

		return mel_loss + gate_loss


		@experimental # TODO: Need to implement abstract methods: list_available_models, from_pretrained, export but how?

		validation_ds: Optional[Dict] = None


		class Tacotron2Loss(Loss):

		from nemo.utils.decorators import experimental


		class OperationMode(Enum):

TTS Collection #874

TTS Collection #874

Conversation

blisc commented Jul 20, 2020 • edited Loading

Changes outside TTS collection

ASR collection

Core collection

Core

TODO

lgtm-com bot commented Jul 28, 2020

lgtm-com bot commented Jul 28, 2020

okuchaiev left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

blisc Jul 29, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

titu1994 Jul 30, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

titu1994 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

titu1994 Jul 30, 2020 • edited Loading

Choose a reason for hiding this comment

blisc commented Jul 20, 2020 •

edited

Loading

blisc Jul 29, 2020 •

edited

Loading

titu1994 Jul 30, 2020 •

edited

Loading

titu1994 Jul 30, 2020 •

edited

Loading