Add functional transformer demo #971

kiya00 · 2024-08-15T09:00:05Z

This notebook demonstrates the acceleration of a transformer model, implemented in a functional style, using Thunder. Key highlights:

Illustrates Thunder-compatible PyTorch code (it doesn't mean that more complicated code with object-oriented style cannot be handled)
Showcases successful execution of basic prompts using pre-trained weights
Provides a clear example of performance gains achieved through Thunder optimization (the only transformation involved here is the initial trace construction and "transform_for_execution")

The primary objective is to explain the characteristics of Thunder-friendly code and verify functionality with loaded pre-trained weights.

The code used in the notebook is adapted from https://gist.github.com/nreHieW/a4ae05d216c5326c9fb9a70fcdda3274

review-notebook-app · 2024-08-15T09:00:12Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

t-vi · 2024-08-15T09:24:16Z

Hi @kiya00, thank you for writing thunder tutorials!

I would be very keen to not give the impression that users need to convert their code to functional in order to run it through thunder. Is there a particular reason for starting at a functional transformer here?
I would venture that the same code with the computation in forward and weights as modules would work as well?

IvanYashchuk · 2024-08-15T10:02:08Z

I would be very keen to not give the impression that users need to convert their code to functional in order to run it through thunder.

That is certainly not part of the plan. Do you have suggestions on what should be changed to avoid making this impression?

Is there a particular reason for starting at a functional transformer here?

It's the simplest form of PyTorch code apart from the imperative style without any functions.

I would venture that the same code with the computation in forward and weights as modules would work as well?

Of course, LitGPT is an example of that.

t-vi · 2024-08-16T08:20:08Z

That is certainly not part of the plan. Do you have suggestions on what should be changed to avoid making this impression?

I think it's mainly a wording thing. The initial wording looked a bit like the functional part was the key to get it to run with thunder, e.g.

This will give us some insight into how to convert a PyTorch module into a simple "functional" Python function, allowing for seamless integration with Thunder.

seems quite odd to me.

At the other end of the spectrum would be something like "Usually, you can just apply thunder.jit to any PyTorch Module and this is the recommended way, but today we want to use thunder.jit with a transformer that is implemented as a function. Along the way find highlight a couple of things are not supported by thunder (yet?) and change them...."

The other part is that, jitting a module and grabbing the thunder.last_traces(tm) would also give you a fully functional transformer, and indeed there are cases when this is very useful.

kiya00 · 2024-08-26T14:04:24Z

Hi @t-vi @IvanYashchuk , I rephrased a bit, the main purpose of this notebook is to give an example of writing a simple functional python function for a pytorch module and thunder can also apply to this version. there's no implication that the function needs to be converted in any specific way to be compatible with Thunder. Sorry for the confusion of the initial draft. and I hope this revision more accurately conveys my intention, please help to take a look if I express what I meant

kiya00 · 2024-08-27T08:12:56Z

If we run this notebook in CI using the hugging face weights, the HF_TOKEN is needed and the weight is needed to download in Meta-Llama-3-8B/consolidated.00.pth under the same folder of the notebook

t-vi · 2024-08-27T08:34:26Z

I think it would be OK to skip it in the CI. (We have not been running full models in it.)

IvanYashchuk · 2024-09-02T13:25:25Z

If we run this notebook in CI using the hugging face weights, the HF_TOKEN is needed and the weight is needed to download in Meta-Llama-3-8B/consolidated.00.pth under the same folder of the notebook

Is there any other popular model that is not behind a registration wall?

IvanYashchuk · 2024-09-05T13:04:54Z

@lantiga, @t-vi could you please review this new tutorial?

IvanYashchuk · 2024-09-11T10:16:33Z

@lantiga, @t-vi could you please review this new tutorial?

t-vi

So first, it is a great tutorial.

I'm still not thrilled about the introduction:
These two things:

easily understood and optimized by both developers and compilers.

By the end of this notebook, you'll have a clear understanding of how functional programming principles can be leveraged to create more efficient and compiler-friendly transformer models.

are relatively dubious to me. How would a developer more easily understand and optimize a functional model? If that is so, why does PyTorch still do modular style?

Maybe we can put it more as

As part of compiling models such as LitGPT, Thunder produces a functional version - the computation trace and function - of the model. Here we implement such a functional model directly to understand what is usually done behind the scenes.

The other question I'd have is if our use of the code is OK here (did we ask the gist author, do we think that the notebook is affected by the copyright of the gist)?

kiya00 · 2024-09-25T12:42:30Z

The other question I'd have is if our use of the code is OK here (did we ask the gist author, do we think that the notebook is affected by the copyright of the gist)?

I've left a message to the author on the gist, hopefully we'll get some feedback soon.

t-vi · 2024-09-26T08:47:21Z

Supergood!

t-vi · 2024-09-26T20:37:18Z

In a v2 (absolutely not required in this PR), it might be interesting to compare functional version built here to the computation trace from jitting LitGPT.

notebooks/functional_transformer_example.ipynb

* Update the type signature of `rope` * Update the docstring of `rotate` Co-authored-by: beverlylytle <[email protected]>

Add functional transformer demo

5f776a9

kiya00 requested a review from IvanYashchuk August 15, 2024 09:00

rephrase

5c83b41

kiya00 marked this pull request as ready for review August 27, 2024 08:08

kiya00 requested review from mruberry, lantiga and t-vi as code owners August 27, 2024 08:08

IvanYashchuk added 14 commits September 2, 2024 15:48

Edit the intro text

ade6246

Use list instead of List

499b229

Edit named tuples intro

808a787

Edit intro to the functional implementation

8cf9271

.float() -> .to(float32)

dd0b3a7

Define eps variable in norm function

c0a4e6d

Add header comments for functions in transformer

8b28449

Import silu and softmax directly

47be7a4

Use .to(x.dtype) instead of .type_as(x)

dcdbdd2

Edit intro to weights loading

4330d37

Move import thunder to the cell where it's used

79a2b7a

Move tokenizer init to the cell where it's used

ed7bdcf

Edit text before Thunder execution

869d8b4

Edit the last paragraph

11e4165

Merge branch 'main' into transformer_demo

a8170c4

t-vi reviewed Sep 25, 2024

View reviewed changes

beverlylytle reviewed Oct 14, 2024

View reviewed changes

notebooks/functional_transformer_example.ipynb Outdated Show resolved Hide resolved

beverlylytle reviewed Oct 14, 2024

View reviewed changes

notebooks/functional_transformer_example.ipynb Outdated Show resolved Hide resolved

Apply suggestions from code review

cb44f07

* Update the type signature of `rope` * Update the docstring of `rotate` Co-authored-by: beverlylytle <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add functional transformer demo #971

Add functional transformer demo #971

kiya00 commented Aug 15, 2024 •

edited by IvanYashchuk

Loading

review-notebook-app bot commented Aug 15, 2024

t-vi commented Aug 15, 2024

IvanYashchuk commented Aug 15, 2024

t-vi commented Aug 16, 2024

kiya00 commented Aug 26, 2024

kiya00 commented Aug 27, 2024

t-vi commented Aug 27, 2024

IvanYashchuk commented Sep 2, 2024

IvanYashchuk commented Sep 5, 2024

IvanYashchuk commented Sep 11, 2024

t-vi left a comment

kiya00 commented Sep 25, 2024

t-vi commented Sep 26, 2024

t-vi commented Sep 26, 2024

Add functional transformer demo #971

Are you sure you want to change the base?

Add functional transformer demo #971

Conversation

kiya00 commented Aug 15, 2024 • edited by IvanYashchuk Loading

review-notebook-app bot commented Aug 15, 2024

t-vi commented Aug 15, 2024

IvanYashchuk commented Aug 15, 2024

t-vi commented Aug 16, 2024

kiya00 commented Aug 26, 2024

kiya00 commented Aug 27, 2024

t-vi commented Aug 27, 2024

IvanYashchuk commented Sep 2, 2024

IvanYashchuk commented Sep 5, 2024

IvanYashchuk commented Sep 11, 2024

t-vi left a comment

Choose a reason for hiding this comment

kiya00 commented Sep 25, 2024

t-vi commented Sep 26, 2024

t-vi commented Sep 26, 2024

kiya00 commented Aug 15, 2024 •

edited by IvanYashchuk

Loading