-
Notifications
You must be signed in to change notification settings - Fork 27.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding resource section to GPT-J docs #21270
Conversation
The documentation is not available anymore as the PR was closed or merged. |
Hello, I have been currently working on finding resources for GPT-J, and mainly I have been using the links mentioned in #20055 and searching GPT-J in each of the links. I found a few links, but I feel this is not the best way to find the resources. Can you share some tips for how you were able to find more resources? @stevhliu What I have so far: GPT-J Description: Blog Posts: NielsRogge's Transformers Tutorials: |
Thanks for your work, that's a great start and I think you have most of them! You can also add:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice, thanks for adding these! Left a few comments on formatting and using the right GPT model, after which we should be good :)
docs/source/en/model_doc/gptj.mdx
Outdated
- A blog post introducing GPT-J [GPT-J-6B: 6B JAX-Based Transformer](https://arankomatsuzaki.wordpress.com/2021/06/04/gpt-j/) 🌎 | ||
- A notebook for [GPT-J-6B Inference Demo](https://colab.research.google.com/github/kingoflolz/mesh-transformer-jax/blob/master/colab_demo.ipynb) | ||
- [Causal language modeling](https://huggingface.co/course/en/chapter7/6?fw=pt#training-a-causal-language-model-from-scratch) chapter of the 🤗 Hugging Face Course. | ||
- [`GPT2LMHeadModel`] is supported by this [causal language modeling example script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling#gpt-2gpt-and-causal-language-modeling), [text generation example script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/text-generation), and [notebook](https://colab.research.google.com/github/huggingface/notebooks/blob/main/examples/language_modeling.ipynb). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Replace GPT2 with GPTJ, same for the TF and Flax implementations
- [`GPT2LMHeadModel`] is supported by this [causal language modeling example script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling#gpt-2gpt-and-causal-language-modeling), [text generation example script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/text-generation), and [notebook](https://colab.research.google.com/github/huggingface/notebooks/blob/main/examples/language_modeling.ipynb). | |
- [`GPTJCausalLM`] is supported by this [causal language modeling example script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling#gpt-2gpt-and-causal-language-modeling), [text generation example script](https://github.com/huggingface/transformers/tree/main/examples/pytorch/text-generation), and [notebook](https://colab.research.google.com/github/huggingface/notebooks/blob/main/examples/language_modeling.ipynb). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have updated the labels. However, looking at the links, they mention GPT-2 but GPT-J is not mentioned as being supported. For example the script mentioned here: https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling#gpt-2gpt-and-causal-language-modeling.
My questions are:
(1) I am a bit confused about what these links are. Are these just scripts that make the modeling process easier? Clarification on what these are would be appreciated.
(2) Are there any scripts present which support GPT-J? I had a look but couldn't find anything.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- These are example scripts for how to finetune a model for a certain task if you prefer to just run a script instead of a notebook.
- The script above for causal language modeling (CLM) should support GPT-J and all models in general with a CLM pretraining objective (so anything in the GPT family).
It looks like the formatting for the docs is still not correct..the bulletpoints are all jumbled up. Looking into this... |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One last typo, and the formatting looks good on my end! 👍
Co-authored-by: Steven Liu <[email protected]>
I have marked the pull request as ready to review 👍 @stevhliu |
…transformers into Add_resources_for_GPT-J Fixed one of the labels
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for adding those!
* Added resource section to GPT-J docs * Added most of the links found * Addressing review comments * Fixing formatting * Update docs/source/en/model_doc/gptj.mdx Co-authored-by: Steven Liu <[email protected]> * Fixing one of the labels --------- Co-authored-by: Steven Liu <[email protected]>
* Added resource section to GPT-J docs * Added most of the links found * Addressing review comments * Fixing formatting * Update docs/source/en/model_doc/gptj.mdx Co-authored-by: Steven Liu <[email protected]> * Fixing one of the labels --------- Co-authored-by: Steven Liu <[email protected]>
What does this PR do?
Adds resources section to the GPT-J documents.
Fixes #20055 (issue)
Before submitting
Pull Request section?
to it if that's the case.
documentation guidelines, and
here are tips on formatting docstrings.
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@sgugger @stevhliu @MKhalusova