Nezha Pytorch implementation #17776

sijunhe · 2022-06-19T15:01:48Z

What does this PR do?

This PR adds a pytorch implementation of the NEZHA model to transformers. NEZHA was introduced by Huawei Noah's Ark Lab in late 2019 and it is widely used in the Chinese NLP community. This implementation is based on the official pytorch implementation of NEZHA and the current BERT pytorch implementation . The model checkpoints are also from the official implementation.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Since the model is quite similar to bert, maybe @LysandreJik?

…into vqa_pipeline

Co-authored-by: NielsRogge <[email protected]>

…into vqa_pipeline

Co-authored-by: NielsRogge <[email protected]>

HuggingFaceDocBuilderDev · 2022-06-19T16:13:28Z

The documentation is not available anymore as the PR was closed or merged.

sijunhe · 2022-06-21T02:41:21Z

ready for review! I'll upload the rest of the pre-trained models later today

sgugger

Very nice addition! This PR is in great shape, I just have two comments.

Remove the capital Z in all NeZhaXxx classes (so NezhaConfig, NezhaModel etc).
Make sure the classes that are perfect duplicates of BERT classes have a # Copied from statement like this one for RoBERTa.

Also, if the model's default tokenizer is BertTokenizer, consider adding a mapping from the model config to the BERT tokenizers in tokenization_auto.py.

README.md

src/transformers/__init__.py

sgugger · 2022-06-21T13:38:44Z

src/transformers/models/nezha/modeling_nezha.py

+    # [to be uploaded] "sijunhe/nezha-large-wwm",
+    # [to be uploaded] "sijunhe/nezha-cn-large",


Flagging those so we don't forget to wait for them to be uploaded before merging :-)

src/transformers/models/nezha/modeling_nezha.py

tests/models/nezha/test_modeling_nezha.py

sijunhe · 2022-06-22T07:06:02Z

addressed all the comments from @sgugger and uploaded the two remaining models. Ready for a final round of review.

patrickvonplaten

Great work! The PR looks more or less ready to be merged already. Left some nits and it'd be nice if we try to add a maximum of # Copied from statements ... from BERT

src/transformers/models/nezha/modeling_nezha.py

sgugger · 2022-06-23T16:36:19Z

Thanks again for your contribution!

* wip * rebase * all tests pass * rebase * ready for PR * address comments * fix styles * add require_torch to pipeline test * remove remote image to improve CI consistency * address comments; fix tf/flax tests * address comments; fix tf/flax tests * fix tests; add alias * repo consistency tests * Update src/transformers/pipelines/visual_question_answering.py Co-authored-by: NielsRogge <[email protected]> * address comments * Update src/transformers/pipelines/visual_question_answering.py Co-authored-by: NielsRogge <[email protected]> * merge * wip * wip * wip * most basic tests passes * all tests pass now * relative embedding * wip * running make fixup * remove bert changes * fix doc * fix doc * fix issues * fix doc * address comments * fix CI * remove redundant copied from * address comments * fix broken test Co-authored-by: Sijun He <[email protected]> Co-authored-by: NielsRogge <[email protected]>

Sijun He and others added 30 commits May 17, 2022 22:27

wip

695d1fe

rebase

86ffa95

all tests pass

ba119ac

rebase

5ea5989

ready for PR

d51184d

address comments

c6b6b10

fix styles

7525ee5

add require_torch to pipeline test

0c66e63

remove remote image to improve CI consistency

d89746e

Merge branch 'huggingface:main' into vqa_pipeline

bc12be7

address comments; fix tf/flax tests

f061b64

Merge branch 'vqa_pipeline' of https://github.com/sijunhe/transformers …

c772723

…into vqa_pipeline

address comments; fix tf/flax tests

3bc012c

fix tests; add alias

d67caa6

repo consistency tests

f593234

Update src/transformers/pipelines/visual_question_answering.py

436736b

Co-authored-by: NielsRogge <[email protected]>

Merge remote-tracking branch 'origin/main' into vqa_pipeline

e61a87e

address comments

364e4c1

Merge branch 'vqa_pipeline' of https://github.com/sijunhe/transformers …

1fb2a1c

…into vqa_pipeline

Update src/transformers/pipelines/visual_question_answering.py

5072d13

Co-authored-by: NielsRogge <[email protected]>

Merge remote-tracking branch 'origin/main' into vqa_pipeline

fa1ca72

merge

27af975

wip

f6edede

wip

e074b10

wip

09239cb

most basic tests passes

691a543

all tests pass now

4dcb0da

relative embedding

0d235a5

wip

9f3ae75

running make fixup

a89a37b

sijunhe added 4 commits June 19, 2022 22:47

merging

975aae9

remove bert changes

e962154

fix doc

4cad6a2

fix doc

f67d578

sijunhe added 2 commits June 20, 2022 00:50

fix issues

95f892b

fix doc

523c107

sijunhe changed the title ~~[WIP] Nezha Pytorch implementation~~ Nezha Pytorch implementation Jun 21, 2022

sgugger reviewed Jun 21, 2022

View reviewed changes

address comments

b0819cc

sgugger requested a review from patrickvonplaten June 21, 2022 19:01

sijunhe added 3 commits June 22, 2022 11:51

Merge remote-tracking branch 'origin/main' into nezha

5df593c

fix CI

614bf51

remove redundant copied from

aa11673

sijunhe requested a review from sgugger June 22, 2022 07:05

patrickvonplaten approved these changes Jun 23, 2022

View reviewed changes

sijunhe added 3 commits June 23, 2022 23:35

address comments

1228b17

Merge remote-tracking branch 'origin/main' into nezha

1f35c9c

fix broken test

a622f17

sgugger merged commit 7cf52a4 into huggingface:main Jun 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nezha Pytorch implementation #17776

Nezha Pytorch implementation #17776

sijunhe commented Jun 19, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 19, 2022 •

edited

Loading

sijunhe commented Jun 21, 2022

sgugger left a comment

sgugger Jun 21, 2022

sijunhe commented Jun 22, 2022

patrickvonplaten left a comment

sgugger commented Jun 23, 2022

		# [to be uploaded] "sijunhe/nezha-large-wwm",
		# [to be uploaded] "sijunhe/nezha-cn-large",

Nezha Pytorch implementation #17776

Nezha Pytorch implementation #17776

Conversation

sijunhe commented Jun 19, 2022 • edited Loading

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Jun 19, 2022 • edited Loading

sijunhe commented Jun 21, 2022

sgugger left a comment

Choose a reason for hiding this comment

sgugger Jun 21, 2022

Choose a reason for hiding this comment

sijunhe commented Jun 22, 2022

patrickvonplaten left a comment

Choose a reason for hiding this comment

sgugger commented Jun 23, 2022

sijunhe commented Jun 19, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 19, 2022 •

edited

Loading