Ability to set max_seq_len to the SentenceTransformers components #8335

bglearning · 2024-09-05T15:27:47Z

Is your feature request related to a problem? Please describe.
The SentenceTransformer models have a max_seq_len attribute.

In theory, we could set it with the model_max_length in tokenizer_kwargs which then eventually should set that attribute here.

However, it seems to be unreliable. We saw it with bge-m3 but could also be the case for other models. (Colab)
This can result in OOM error when embedding.

Update: found the "issue". The max_seq_length is read into the kwargs from this config json at this point in _load_sbert_model and thus the max_seq_length is not None here and so it doesn't use model_max_length from the tokenizer_kwargs.

Describe the solution you'd like
Would be good to have the ability to set the max_seq_len in the (currently three) SentenceTransformers components as in v1.

We could possibly also intercept the tokenizer_kwargs and use model_max_length from it if it's set.

The text was updated successfully, but these errors were encountered:

bglearning added the 2.x Related to Haystack v2.0 label Sep 5, 2024

sjrl linked a pull request Sep 5, 2024 that will close this issue

feat (v2): Update so model_max_length updates max_seq_length for Sentence Transformers #8334

Merged

sjrl self-assigned this Sep 6, 2024

sjrl closed this as completed in #8334 Sep 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to set max_seq_len to the SentenceTransformers components #8335

Ability to set max_seq_len to the SentenceTransformers components #8335

bglearning commented Sep 5, 2024 •

edited

Loading

Ability to set max_seq_len to the SentenceTransformers components #8335

Ability to set max_seq_len to the SentenceTransformers components #8335

Comments

bglearning commented Sep 5, 2024 • edited Loading

bglearning commented Sep 5, 2024 •

edited

Loading