Use padding_idx=None for nn.Embedding() in the decoder model #1206

csukuangfj · 2023-08-08T13:23:55Z

We need to change sherpa/sherpa-onnx/sherpa-ncnn to use [-1, 0] as the initial tokens during decoding instead of [0, 0].
The consequence is that if we need to re-export the model, otherwise it will cause runtime error saying that

IndexError: index out of range in self

If we don't change sherpa/sherpa-onnx/sherpa-ncnn, a user has reported that the WER becomes worse.

The text was updated successfully, but these errors were encountered:

See also k2-fsa/icefall#1206 and k2-fsa/icefall#1208

kamirdin · 2023-12-17T02:22:00Z

I don't understand that we train the model use [0,0], why we decode by [-1,0]？

        blank_id = self.decoder.blank_id
        sos_y = add_sos(y, sos_id=blank_id)

        # sos_y_padded: [B, S + 1], start with SOS.
        sos_y_padded = sos_y.pad(mode="constant", padding_value=blank_id)

icefall/egs/librispeech/ASR/zipformer/model.py

Line 208 in 10a2347

sos_y_padded = sos_y.pad(mode="constant", padding_value=blank_id)

csukuangfj · 2023-12-17T04:55:18Z

I don't understand that we train the model use [0,0], why we decode by [-1,0]？
        blank_id = self.decoder.blank_id
        sos_y = add_sos(y, sos_id=blank_id)

        # sos_y_padded: [B, S + 1], start with SOS.
        sos_y_padded = sos_y.pad(mode="constant", padding_value=blank_id)
icefall/egs/librispeech/ASR/zipformer/model.py

Line 208 in 10a2347

sos_y_padded = sos_y.pad(mode="constant", padding_value=blank_id)

please think about the input of the conv module in the decoder model.

csukuangfj mentioned this issue Aug 9, 2023

Fix initial tokens for decoding k2-fsa/sherpa-onnx#246

Merged

ezerhouni mentioned this issue Aug 14, 2023

Update padding modified beam search #1217

Merged

csukuangfj added a commit to csukuangfj/sherpa that referenced this issue Aug 25, 2023

Fix initial tokens for decoding.

b403b33

See also k2-fsa/icefall#1206 and k2-fsa/icefall#1208

csukuangfj mentioned this issue Aug 25, 2023

Fix initial tokens for decoding. k2-fsa/sherpa#464

Merged

csukuangfj added a commit to k2-fsa/sherpa that referenced this issue Aug 25, 2023

Fix initial tokens for decoding. (#464)

1ad6943

See also k2-fsa/icefall#1206 and k2-fsa/icefall#1208

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use padding_idx=None for nn.Embedding() in the decoder model #1206

Use padding_idx=None for nn.Embedding() in the decoder model #1206

csukuangfj commented Aug 8, 2023

kamirdin commented Dec 17, 2023

csukuangfj commented Dec 17, 2023

Use padding_idx=None for nn.Embedding() in the decoder model #1206

Use padding_idx=None for nn.Embedding() in the decoder model #1206

Comments

csukuangfj commented Aug 8, 2023

kamirdin commented Dec 17, 2023

csukuangfj commented Dec 17, 2023