[T5] Unused `n_positions` and `max_position_embeddings`. #8047

patrickvonplaten · 2020-10-26T12:20:57Z

The T5Config has the parameter n_positions set to 512 and max_position_embeddigs referring to n_positions. However, neither max_position_embeddigs nor n_positions is used in the T5Model and T5 is not limited to max_position_embeddings. E.g.:

from transformers import T5Model
model = T5Model.from_pretrained("t5-small")

model.config.max_position_embeddings # shows 512

input_ids = torch.tensor([600 * [0]])  # input of size > 512

model(input_ids, decoder_input_ids=input_ids)  # works fine

I think we should delete the parameter.

@thomwolf - do you remember why we added max_position_embeddigs and n_positions to T5? The model does not seem to use these params and also should not be limited to 512 due to its relative position embeddings.

The text was updated successfully, but these errors were encountered:

patrickvonplaten assigned patrickvonplaten and thomwolf Oct 26, 2020

patrickvonplaten mentioned this issue Nov 13, 2020

[T5] Bug correction & Refactor #8518

Merged

5 tasks

patrickvonplaten closed this as completed in #8518 Nov 13, 2020

yuanze1024 mentioned this issue Nov 29, 2023

T5 support more model types NVIDIA/TensorRT-LLM#127

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[T5] Unused `n_positions` and `max_position_embeddings`. #8047

[T5] Unused `n_positions` and `max_position_embeddings`. #8047

patrickvonplaten commented Oct 26, 2020 •

edited

Loading

[T5] Unused n_positions and max_position_embeddings. #8047

[T5] Unused n_positions and max_position_embeddings. #8047

Comments

patrickvonplaten commented Oct 26, 2020 • edited Loading

[T5] Unused `n_positions` and `max_position_embeddings`. #8047

[T5] Unused `n_positions` and `max_position_embeddings`. #8047

patrickvonplaten commented Oct 26, 2020 •

edited

Loading