TF: generate without `tf.TensorArray` #17801

gante · 2022-06-21T15:02:28Z

What does this PR do?

Some models, like XLNet, need more than just the previous token when past is used. This PR solves this problem with the help of some refactoring -- we no longer use TensorArray, instead we scatter updates into a fixed-size tensor. This refactor simplifies generate, especially beam_search, which may prove to be helpful in enabling XLA.

Slow tests have been run for the usual generate models (gpt2, t5, rag, speech_to_text, encoder_decoder, vision_encoder_decoder, bart).

Why was this refactor needed?

As it can be read in this issue, TensorArray is meant to be used as a write-once array, anything else falls in the unexpected behavior domain -- in other words, our use was dangerous. The original solution to the XLNet problem was to read all existing tokens from the TensorArray, using the same logic as in this PR, but it failed with XLA -- and the behavior depended on what was written into the variable on its first write. Since we use fixed-size tensors, a normal tensor works just fine, and with simpler code (assuming the reader is familiar with how scatter works :D ).

HuggingFaceDocBuilderDev · 2022-06-21T15:12:55Z

The documentation is not available anymore as the PR was closed or merged.

gante · 2022-06-21T18:15:31Z

cc @ydshieh -- this PR fixes the XLNet generate error we have been seeing :)

patrickvonplaten · 2022-06-21T23:25:11Z

Cool!

Rocketknight1

LGTM! (And agree about TensorArray being cursed). Did you see any performance changes from doing it this way?

gante · 2022-06-23T11:27:22Z

@Rocketknight1 no differences in terms of execution speed 👍

GPT2 sample on a 3090, average of 10 runs (excluding compilation time)

Eager: 884 ms -> 888 ms
XLA: 29.2 ms -> 29.3 ms
JAX: 19.7 ms

gante added 5 commits June 21, 2022 18:08

playing around with scatter updates

fd49326

working xlnet and gpt2 on greedy search

5bfe8c0

propagate changes to sample

c28f081

update beam search

6171c30

handle missing pad token

f2b09d5

gante force-pushed the xlnet_generate branch from 6dd8625 to f2b09d5 Compare June 21, 2022 18:08

gante marked this pull request as ready for review June 21, 2022 18:08

gante changed the title ~~TF: generate without Tensor Array~~ TF: generate without TensorArray Jun 21, 2022

gante changed the title ~~TF: generate without TensorArray~~ TF: generate without tf.TensorArray Jun 21, 2022

gante requested review from Rocketknight1 and patrickvonplaten June 21, 2022 18:14

patrickvonplaten approved these changes Jun 21, 2022

View reviewed changes

Rocketknight1 approved these changes Jun 22, 2022

View reviewed changes

gante merged commit 5cce307 into huggingface:main Jun 23, 2022

gante deleted the xlnet_generate branch June 23, 2022 11:28

younesbelkada pushed a commit to younesbelkada/transformers that referenced this pull request Jun 25, 2022

TF: generate without tf.TensorArray (huggingface#17801)

481a878

younesbelkada pushed a commit to younesbelkada/transformers that referenced this pull request Jun 29, 2022

TF: generate without tf.TensorArray (huggingface#17801)

66c4b0c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF: generate without `tf.TensorArray` #17801

TF: generate without `tf.TensorArray` #17801

gante commented Jun 21, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 21, 2022 •

edited

Loading

gante commented Jun 21, 2022

patrickvonplaten commented Jun 21, 2022

Rocketknight1 left a comment

gante commented Jun 23, 2022

TF: generate without tf.TensorArray #17801

TF: generate without tf.TensorArray #17801

Conversation

gante commented Jun 21, 2022 • edited Loading

What does this PR do?

Why was this refactor needed?

HuggingFaceDocBuilderDev commented Jun 21, 2022 • edited Loading

gante commented Jun 21, 2022

patrickvonplaten commented Jun 21, 2022

Rocketknight1 left a comment

Choose a reason for hiding this comment

gante commented Jun 23, 2022

TF: generate without `tf.TensorArray` #17801

TF: generate without `tf.TensorArray` #17801

gante commented Jun 21, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 21, 2022 •

edited

Loading