Migrate transducer input checks to C++ #1391

carolineechen · 2021-03-15T15:07:52Z

Move input checks for RNNT from Python to C++

RNN Transducer Loss Issue: #1240

torchaudio/csrc/transducer.cpp

vincentqb

Thanks for working on this! I've noted a few small changes (and some things we'll think about in separate PRs)

torchaudio/csrc/transducer.cpp

vincentqb · 2021-03-15T19:57:14Z

torchaudio/csrc/transducer.cpp

+  TORCH_CHECK(
+      input_lengths.size(0) == acts.size(0),
+      "each output sequence must have a length");
+  TORCH_CHECK(
+      label_lengths.size(0) == acts.size(0),
+      "each example must have a label length");


Let's be explicit and consistent with python naming:

"batch dimension mismatch between acts and act_lens: each example must have a length" "batch dimension mismatch between acts and label_lens: each example must have a label length"

(As follow-up to this PR: we'll change the names of the variables in the C++ function to match those of the python function:

""" Args: acts (Tensor): Tensor of dimension (batch, time, label, class) containing output from network before applying ``torch.nn.functional.log_softmax``. labels (Tensor): Tensor of dimension (batch, max label length) containing the labels padded by zero act_lens (Tensor): Tensor of dimension (batch) containing the length of each output sequence label_lens (Tensor): Tensor of dimension (batch) containing the length of each output sequence """

and, also, I just realized the two last descriptions are the same in the python documentation :)

""" Args: acts (Tensor): Tensor of dimension (batch, time, label, class) containing output *sequence* from network before applying ``torch.nn.functional.log_softmax``. labels (Tensor): Tensor of dimension (batch, max label length) containing the labels padded by zero act_lens (Tensor): Tensor of dimension (batch) containing the length of each output sequence label_lens (Tensor): Tensor of dimension (batch) containing the length of each *label* """

added * for suggested change.)

torchaudio/csrc/transducer.cpp

vincentqb · 2021-03-15T20:18:14Z

torchaudio/csrc/transducer.cpp

  int maxT = acts.size(1);
  int maxU = acts.size(2);
  int minibatch_size = acts.size(0);
  int alphabet_size = acts.size(3);

+  TORCH_CHECK(
+      at::max(input_lengths).item().toInt() == maxT, "input length mismatch");


(Follow-up beyond this PR: let's improve the readability here: "The maximum length of a sequence in acts must be equal to the maximal value given in act_lens" ?)

vincentqb · 2021-03-15T21:32:18Z

torchaudio/csrc/transducer.cpp

+      at::max(input_lengths).item().toInt() == maxT, "input length mismatch");
+  TORCH_CHECK(
+      at::max(label_lengths).item().toInt() + 1 == maxU,
+      "output length mismatch");


(Follow-up beyond this PR: we'll want to improve this message too)

* Update index.rst * Update layout.html

facebook-github-bot added the CLA Signed label Mar 15, 2021

carolineechen requested review from vincentqb and mthrok March 15, 2021 15:09

migrate transducer input checks to c++

ce61ffe

carolineechen force-pushed the migrate_transducer_input_checks branch from 2d15d0f to ce61ffe Compare March 15, 2021 15:16

mthrok reviewed Mar 15, 2021

View reviewed changes

torchaudio/csrc/transducer.cpp Outdated Show resolved Hide resolved

torchaudio/csrc/transducer.cpp Outdated Show resolved Hide resolved

mthrok approved these changes Mar 15, 2021

View reviewed changes

vincentqb reviewed Mar 15, 2021

View reviewed changes

carolineechen force-pushed the migrate_transducer_input_checks branch from dbaf97c to 21b5abf Compare March 15, 2021 23:12

improve error messages

9b72d80

carolineechen force-pushed the migrate_transducer_input_checks branch from 21b5abf to 9b72d80 Compare March 15, 2021 23:14

carolineechen merged commit f06074a into pytorch:master Mar 16, 2021

carolineechen deleted the migrate_transducer_input_checks branch March 16, 2021 13:45

mthrok pushed a commit to mthrok/audio that referenced this pull request Dec 13, 2022

Brianjo expand fix (pytorch#1391)

4d8e788

* Update index.rst * Update layout.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate transducer input checks to C++ #1391

Migrate transducer input checks to C++ #1391

carolineechen commented Mar 15, 2021

vincentqb left a comment

vincentqb Mar 15, 2021

vincentqb Mar 15, 2021

vincentqb Mar 15, 2021

vincentqb Mar 15, 2021

Migrate transducer input checks to C++ #1391

Migrate transducer input checks to C++ #1391

Conversation

carolineechen commented Mar 15, 2021

vincentqb left a comment

Choose a reason for hiding this comment

vincentqb Mar 15, 2021

Choose a reason for hiding this comment

vincentqb Mar 15, 2021

Choose a reason for hiding this comment

vincentqb Mar 15, 2021

Choose a reason for hiding this comment

vincentqb Mar 15, 2021

Choose a reason for hiding this comment