Improve encoder decoder model docs #17815

Threepointone4 · 2022-06-22T04:50:48Z

What does this PR do?

This PR improves the documentation of encoder decoder model.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Issues link

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@patrickvonplaten

…rModel_docs

HuggingFaceDocBuilderDev · 2022-06-22T05:00:15Z

The documentation is not available anymore as the PR was closed or merged.

docs/source/en/model_doc/encoder-decoder.mdx

NielsRogge · 2022-06-22T08:34:30Z

docs/source/en/model_doc/encoder-decoder.mdx

+>>> model = EncoderDecoderModel(config=config)
+```
+
+## Initialising [`EncoderDecoderModel`] from a pretrained encoder and a pretrained decoder.


Above you use initializing, here initialising.

docs/source/en/model_doc/encoder-decoder.mdx

NielsRogge

Thanks for improving this! Would be great to also improve the docs of VisionEncoderDecoderModel and SpeechEncoderDecoderModel.

NielsRogge · 2022-06-22T09:04:05Z

docs/source/en/model_doc/encoder-decoder.mdx

+>>> # the forward function automatically creates the correct decoder_input_ids
+>>> loss = model(input_ids=input_ids, labels=labels).loss
+```
+Detailed [colab](https://colab.research.google.com/drive/1WIk2bxglElfZewOHboPFNj8H44_VAyKE?usp=sharing#scrollTo=ZwQIEhKOrJpl) for training.


This notebook might already be outdated, cc @patrickvonplaten.

Also cc @ydshieh as we were planning on writing a blog post about them.

@NielsRogge should I remove the Colab link for now ?

Think ok to leave it for now :-)

ydshieh

Thank you for the improvement, @Threepointone4 !

docs/source/en/model_doc/encoder-decoder.mdx

ydshieh · 2022-06-22T09:31:08Z

docs/source/en/model_doc/encoder-decoder.mdx

+## Initialising [`EncoderDecoderModel`] from a pretrained encoder and a pretrained decoder.
+
+[`EncoderDecoderModel`] can be initialized from a pretrained encoder checkpoint and a pretrained decoder checkpoint. Note that any pretrained auto-encoding model, *e.g.* BERT, can serve as the encoder and both pretrained auto-encoding models, *e.g.* BERT, pretrained causal language models, *e.g.* GPT2, as well as the pretrained decoder part of sequence-to-sequence models, *e.g.* decoder of BART, can be used as the decoder.
+Depending on which architecture you choose as the decoder, the cross-attention layers might be randomly initialized.


I would be more careful to say the auto-encoding models that provide causal LM implementation.

Also the sentence is super long, it might be a good idea to split it.

ydshieh · 2022-06-22T09:46:35Z

docs/source/en/model_doc/encoder-decoder.mdx

+...     return_tensors="pt",
+... ).input_ids
+
+>>> labels = tokenizer(


(nit) it might be a good idea to add >>> # Let's use a summary of the above text as the target

patrickvonplaten · 2022-06-23T10:58:13Z

Agree with the reviews of @ydshieh and @NielsRogge ! @Threepointone4 do you want to apply them ? Think we can merge after :-)

Co-authored-by: NielsRogge <[email protected]>

Co-authored-by: Yih-Dar <[email protected]>

Co-authored-by: NielsRogge <[email protected]>

patrickvonplaten · 2022-06-24T12:47:59Z

docs/source/en/model_doc/encoder-decoder.mdx

+>>> # the forward function automatically creates the correct decoder_input_ids
+>>> loss = model(input_ids=input_ids, labels=labels).loss
+```
+Detailed [colab](https://colab.research.google.com/drive/1WIk2bxglElfZewOHboPFNj8H44_VAyKE?usp=sharing#scrollTo=ZwQIEhKOrJpl) for training.


Think ok to leave it for now :-)

patrickvonplaten · 2022-06-24T12:48:15Z

Great job @Threepointone4 ! Merging :-)

* Copied all the changes from the last PR * added in documentation_tests.txt * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <[email protected]> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <[email protected]> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: Yih-Dar <[email protected]> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <[email protected]> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <[email protected]> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <[email protected]> * Update docs/source/en/model_doc/encoder-decoder.mdx Co-authored-by: NielsRogge <[email protected]> Co-authored-by: vishwaspai <[email protected]> Co-authored-by: NielsRogge <[email protected]> Co-authored-by: Yih-Dar <[email protected]>

vishwaspai added 3 commits June 21, 2022 22:41

Copied all the changes from the last PR

ea7a8ff

added in documentation_tests.txt

b9e49be

Merge remote-tracking branch 'origin/main' into Improve_EncoderDecode…

2a404cb

…rModel_docs

Threepointone4 mentioned this pull request Jun 22, 2022

Improved Documentation for Encoder Decoder models #17287

Closed

3 tasks

NielsRogge reviewed Jun 22, 2022

View reviewed changes

docs/source/en/model_doc/encoder-decoder.mdx Outdated Show resolved Hide resolved

NielsRogge reviewed Jun 22, 2022

View reviewed changes

docs/source/en/model_doc/encoder-decoder.mdx Outdated Show resolved Hide resolved

NielsRogge reviewed Jun 22, 2022

View reviewed changes

docs/source/en/model_doc/encoder-decoder.mdx Outdated Show resolved Hide resolved

NielsRogge reviewed Jun 22, 2022

View reviewed changes

docs/source/en/model_doc/encoder-decoder.mdx Outdated Show resolved Hide resolved

NielsRogge reviewed Jun 22, 2022

View reviewed changes

docs/source/en/model_doc/encoder-decoder.mdx Outdated Show resolved Hide resolved

NielsRogge reviewed Jun 22, 2022

View reviewed changes

docs/source/en/model_doc/encoder-decoder.mdx Outdated Show resolved Hide resolved

NielsRogge approved these changes Jun 22, 2022

View reviewed changes

NielsRogge assigned patrickvonplaten Jun 22, 2022

NielsRogge reviewed Jun 22, 2022

View reviewed changes

ydshieh reviewed Jun 22, 2022

View reviewed changes

ydshieh approved these changes Jun 22, 2022

View reviewed changes

Threepointone4 and others added 7 commits June 23, 2022 18:35

Update docs/source/en/model_doc/encoder-decoder.mdx

ba02b96

Co-authored-by: NielsRogge <[email protected]>

Update docs/source/en/model_doc/encoder-decoder.mdx

fb2500d

Co-authored-by: NielsRogge <[email protected]>

Update docs/source/en/model_doc/encoder-decoder.mdx

759412b

Co-authored-by: Yih-Dar <[email protected]>

Update docs/source/en/model_doc/encoder-decoder.mdx

a47c932

Co-authored-by: NielsRogge <[email protected]>

Update docs/source/en/model_doc/encoder-decoder.mdx

a966aa6

Co-authored-by: NielsRogge <[email protected]>

Update docs/source/en/model_doc/encoder-decoder.mdx

3233205

Co-authored-by: NielsRogge <[email protected]>

Update docs/source/en/model_doc/encoder-decoder.mdx

1997490

Co-authored-by: NielsRogge <[email protected]>

patrickvonplaten approved these changes Jun 24, 2022

View reviewed changes

patrickvonplaten merged commit c2c0d9d into huggingface:main Jun 24, 2022

NielsRogge mentioned this pull request Jul 23, 2022

[EncoderDecoder] Improve docs #18271

Merged

lappemic mentioned this pull request May 13, 2024

Improve EncoderDecoderModel docs #16135

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve encoder decoder model docs #17815

Improve encoder decoder model docs #17815

Threepointone4 commented Jun 22, 2022

HuggingFaceDocBuilderDev commented Jun 22, 2022 •

edited

Loading

NielsRogge Jun 22, 2022

NielsRogge left a comment

NielsRogge Jun 22, 2022

Threepointone4 Jun 23, 2022

patrickvonplaten Jun 24, 2022

ydshieh left a comment

ydshieh Jun 22, 2022

ydshieh Jun 22, 2022

patrickvonplaten commented Jun 23, 2022

patrickvonplaten Jun 24, 2022

patrickvonplaten commented Jun 24, 2022

Improve encoder decoder model docs #17815

Improve encoder decoder model docs #17815

Conversation

Threepointone4 commented Jun 22, 2022

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Jun 22, 2022 • edited Loading

NielsRogge Jun 22, 2022

Choose a reason for hiding this comment

NielsRogge left a comment

Choose a reason for hiding this comment

NielsRogge Jun 22, 2022

Choose a reason for hiding this comment

Threepointone4 Jun 23, 2022

Choose a reason for hiding this comment

patrickvonplaten Jun 24, 2022

Choose a reason for hiding this comment

ydshieh left a comment

Choose a reason for hiding this comment

ydshieh Jun 22, 2022

Choose a reason for hiding this comment

ydshieh Jun 22, 2022

Choose a reason for hiding this comment

patrickvonplaten commented Jun 23, 2022

patrickvonplaten Jun 24, 2022

Choose a reason for hiding this comment

patrickvonplaten commented Jun 24, 2022

HuggingFaceDocBuilderDev commented Jun 22, 2022 •

edited

Loading