Fix tf shared embedding #17730

ArthurZucker · 2022-06-16T11:56:55Z

What does this PR do?

A hack was used to properly import the shared embedding weights, but it can be removed (removing it also is convenient for the sharding PR)

Found this while testing #17713. In HF's save_pretrained and load_pretrained the layer name is changed using name = "/".join(weight_name.split("/")[1:]). This was breaking for OPT as the layer name was 'decoder.embed_tokens/model.decoder.embed_tokens/weight:0' instead of 'tfopt_model/model/decoder/embed_tokens/weight:0'. The naming is strange and had to use a scope hack. The hack comes from BART

ArthurZucker · 2022-06-16T11:59:44Z

All the TF weights of OPT will need to be updated if this is approved. I think I can handle that along with #17713.

sgugger

Thanks for fixing!

sgugger · 2022-06-16T12:07:46Z

tests/models/opt/test_modeling_tf_opt.py

@@ -281,14 +281,14 @@ def test_inference_no_head(self):


 @require_tf
-@slow
+# @slow


Will need to be uncommented before merging.

Think it's already fixed in the new commit

tests/models/opt/test_modeling_tf_opt.py

HuggingFaceDocBuilderDev · 2022-06-16T12:17:35Z

The documentation is not available anymore as the PR was closed or merged.

* fix the naming * from pt in test for now * make style * slow test and removed from_pt

ArthurZucker added 2 commits June 16, 2022 13:55

fix the naming

c6d828f

from pt in test for now

f9920fe

ArthurZucker self-assigned this Jun 16, 2022

ArthurZucker added 2 commits June 16, 2022 14:01

make style

7dd27f3

slow test and removed from_pt

2efaa70

patrickvonplaten marked this pull request as ready for review June 16, 2022 12:07

sgugger approved these changes Jun 16, 2022

View reviewed changes

patrickvonplaten approved these changes Jun 16, 2022

View reviewed changes

patrickvonplaten merged commit f44e2c2 into huggingface:main Jun 16, 2022

ArthurZucker deleted the fix_tf_shared_embedding branch June 16, 2022 12:31

sgugger pushed a commit that referenced this pull request Jun 16, 2022

Fix tf shared embedding (#17730)

f8c8f4d

* fix the naming * from pt in test for now * make style * slow test and removed from_pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix tf shared embedding #17730

Fix tf shared embedding #17730

ArthurZucker commented Jun 16, 2022 •

edited

Loading

ArthurZucker commented Jun 16, 2022

sgugger left a comment

sgugger Jun 16, 2022

patrickvonplaten Jun 16, 2022

HuggingFaceDocBuilderDev commented Jun 16, 2022 •

edited

Loading

Fix tf shared embedding #17730

Fix tf shared embedding #17730

Conversation

ArthurZucker commented Jun 16, 2022 • edited Loading

What does this PR do?

ArthurZucker commented Jun 16, 2022

sgugger left a comment

Choose a reason for hiding this comment

sgugger Jun 16, 2022

Choose a reason for hiding this comment

patrickvonplaten Jun 16, 2022

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jun 16, 2022 • edited Loading

ArthurZucker commented Jun 16, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 16, 2022 •

edited

Loading