bug report: empy batch when using --max-tokens < 128 #347

robertodessi · 2018-11-05T14:45:54Z

I noticed that when setting max tokens < 128 in the dummy step before starting the actual training an error occurs.

| training on 1 GPUs
| max tokens per GPU = 25 and max sentences per GPU = None
Traceback (most recent call last):
  File "train.py", line 358, in <module>
    main(args)
  File "train.py", line 78, in main
    trainer.dummy_train_step([dummy_batch])
  File "/home/roberto.dessi/new_fairseq/fairseq/trainer.py", line 326, in dummy_train_step
    self.train_step(dummy_batch, dummy_batch=True)
  File "/home/roberto.dessi/new_fairseq/fairseq/trainer.py", line 176, in train_step
    ignore_grad
  File "/home/roberto.dessi/new_fairseq/fairseq/tasks/fairseq_task.py", line 169, in train_step
    loss, sample_size, logging_output = criterion(model, sample)
  File "/home/roberto.dessi/.virtualenvs/work/lib/python3.6/site-packages/torch/nn/modules/module.py", line 477, in __call__
    result = self.forward(*input, **kwargs)
  File "/home/roberto.dessi/new_fairseq/fairseq/criterions/cross_entropy.py", line 30, in forward
    net_output = model(**sample['net_input'])
TypeError: 'NoneType' object is not subscriptable

The error is caused by https://github.com/robertodessi/fairseq/blob/master/fairseq/data/language_pair_dataset.py#L195

It seems to me that a small value of max_tokens causes that division to be 0 and to create an empty dummy batch that raise the above error.

It works when I replace that line with:

bsz = max(num_tokens // max(src_len, tgt_len), 1)

The text was updated successfully, but these errors were encountered:

Summary: Pull Request resolved: facebookresearch#366 Differential Revision: D13058513 Pulled By: myleott fbshipit-source-id: a146d2cfb345d404775ed8d6b8e4a4ad4e7a33b4

…ookresearch#347) * do some changes for aishell/ASR/transducer_stateless/export.py

myleott added a commit that referenced this issue Nov 14, 2018

Fix dummy batch when --max-tokens is small (fixes #347)

9e4e7fc

facebook-github-bot closed this as completed in 161d1e0 Nov 14, 2018

prashantserai mentioned this issue Apr 22, 2019

Issue with max-tokens < 128 #649

Closed

yfyeung pushed a commit to yfyeung/fairseq that referenced this issue Dec 6, 2023

Do some changes for aishell/ASR/transducer stateless/export.py (faceb…

f783e10

…ookresearch#347) * do some changes for aishell/ASR/transducer_stateless/export.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug report: empy batch when using --max-tokens < 128 #347

bug report: empy batch when using --max-tokens < 128 #347

robertodessi commented Nov 5, 2018

bug report: empy batch when using --max-tokens < 128 #347

bug report: empy batch when using --max-tokens < 128 #347

Comments

robertodessi commented Nov 5, 2018