You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Oct 13, 2022. It is now read-only.
See the code below:
snowfall/snowfall/models/transformer.py
Line 162 in 3502531
snowfall/snowfall/models/transformer.py
Line 167 in 3502531
snowfall/snowfall/models/transformer.py
Line 179 in 3502531
snowfall/snowfall/models/transformer.py
Line 709 in 3502531
snowfall/snowfall/models/transformer.py
Lines 720 to 721 in 3502531
You can see that
ys_in_pad
is padded with eos_id, which is a positive word piece ID.However, it is using -1 to compute the mask for
ys_in_pad
.This bug may explain why the WERs differ with respect to batch size. It also affects the training, I guess.
The text was updated successfully, but these errors were encountered: