Fix preprocessing for WMT14 En-De to replicate Scaling NMT paper #203

myleott · 2018-06-28T17:04:19Z

use newstest2013 for validation instead of splitting the training set
apply length filtering before BPE
final dataset is ~4.5M documents
confirmed this new dataset gives results on par with the Scaling NMT paper

…ebookresearch#203)

Fix preprocessing for WMT14 En-De to replicate Scaling NMT paper

6a94526

myleott requested a review from edunov June 28, 2018 17:04

facebook-github-bot added the CLA Signed label Jun 28, 2018

myleott mentioned this pull request Jun 28, 2018

How to reproduce the result on WMT14 En-De #202

Closed

myleott merged commit a75c309 into master Jun 28, 2018

myleott deleted the preprocess_wmt_en_de branch June 28, 2018 18:19

myleott pushed a commit that referenced this pull request Aug 28, 2018

Fix weight decay (#203)

e53a9b6

moussaKam pushed a commit to moussaKam/language-adaptive-pretraining that referenced this pull request Sep 29, 2020

Fix preprocessing for WMT14 En-De to replicate Scaling NMT paper (fac…

8e0eb5c

…ebookresearch#203)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix preprocessing for WMT14 En-De to replicate Scaling NMT paper #203

Fix preprocessing for WMT14 En-De to replicate Scaling NMT paper #203

myleott commented Jun 28, 2018 •

edited

Loading

Fix preprocessing for WMT14 En-De to replicate Scaling NMT paper #203

Fix preprocessing for WMT14 En-De to replicate Scaling NMT paper #203

Conversation

myleott commented Jun 28, 2018 • edited Loading

myleott commented Jun 28, 2018 •

edited

Loading