Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TODO list for Transformer. #7355

Closed
22 tasks done
lcy-seso opened this issue Jan 9, 2018 · 2 comments
Closed
22 tasks done

TODO list for Transformer. #7355

lcy-seso opened this issue Jan 9, 2018 · 2 comments
Assignees
Labels

Comments

@lcy-seso
Copy link
Contributor

lcy-seso commented Jan 9, 2018

  • implement they layer normalization operator and the Python wrapper.
    • CPU implementation.
    • GPU implementation.
    • python wrapper.
  • enhance the matmul operator to support 4-D tensor as its inputs Does it need to enhance matmul_op to support 4-D inputs #7319. fixed by PR: Enhance matmul_op to support 4-D inputs #7656
  • prepare the dataset.
    fixed by PR: Add WMT16 into dataset. #7661
  • wrap the masked positional embedding.
  • enhance the lookup_table operator to support the special token: padding index. Support padding_idx in the lookup_table_op. #7309.
  • wrap the multi-head dot product attention. This is different to ConvS2S.
  • wrap the positional-wise feed-forward network.
  • wrap the basic computation block.
  • build the entire model.
  • enhance the documentation of operators used in Transformer.
  • add beam search for Transformer.
  • clean codes and merge the entire project into the models repo (merge the work part by part).
  • Learning Rate Scheduler
  • Residual Dropout
  • Label Smoothing
    • label smooth operator.
    • python wrapper.
  • Scaled Dot Product Attention
  • Weight sharing between embedding and pre-softmax linear transformation layers
@lcy-seso lcy-seso added the NMT label Jan 9, 2018
@lcy-seso
Copy link
Contributor Author

lcy-seso commented Jan 9, 2018

Hi, @abhinavarora and @kavyasrinet I just add this TODO list according to my understanding. Please feel free to make any change to this list, and arrange the work as needed.

Some steps are shared between ConvS2S and Transformer. All the works listed above have not been assigned yet. We can share the workload.

P.S The orders above do not mean the work can only be done by following the same order. We can arrange it as needed.

@kavyasrinet
Copy link

Thank you so much @lcy-seso for compiling the list above. We were planning to work on this compilation on our end as well. We will update this soon, and add the additional items to the list above.

@lcy-seso lcy-seso self-assigned this Jan 19, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants