Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New apex compatible squad #19

Merged
merged 2 commits into from
Apr 30, 2020
Merged

New apex compatible squad #19

merged 2 commits into from
Apr 30, 2020

Conversation

jeffra
Copy link
Contributor

@jeffra jeffra commented Apr 29, 2020

This update is required before we can merge microsoft/DeepSpeed#205

@jeffra jeffra merged commit 9e2c735 into master Apr 30, 2020
@jeffra jeffra deleted the jeffra/new_amp_squad branch April 30, 2020 21:49
jeffra pushed a commit that referenced this pull request Sep 2, 2020
* update bing_bert example to use sparse transformer

* Updated teh BertSparseSelfAttention example based on the ST updates

* updated bing_bert example based on final updates for Sparse Attention; also added un/pad of Bert layer input

* updated based on Tunji's comment: added a separate script for SA

* fixed a typo

* added an exception when both transformer kernel and SA are set together.
jeffra added a commit that referenced this pull request Sep 2, 2020
* update bing_bert example to use sparse attention (#19)

* update bing_bert example to use sparse transformer

* Updated teh BertSparseSelfAttention example based on the ST updates

* updated bing_bert example based on final updates for Sparse Attention; also added un/pad of Bert layer input

* updated based on Tunji's comment: added a separate script for SA

* fixed a typo

* added an exception when both transformer kernel and SA are set together.

* fixed an issue with last PR: removed keyword self for function call as it was moved out of class

Co-authored-by: Arash Ashari <[email protected]>
Co-authored-by: arashashari <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants