Fix to avoid overfloat and get rid of model_max_length #319

I8dNLo · 2024-08-06T11:39:01Z

Got rid of max_length=512 and parameter
Replaced it with maxsize to work with such situations. Checkout model_max_length

joein

This does not fix the issue with the mentioned model

The correct value is actually under another field:
"max_length": 512

Also setting a parameter this high might be dangerous (e.g. if we ever set padding to max_length)

I would rather check both fields, take the minimum one, and if they are absent, set some default value.
But tbh, a default value also seems not to be the greatest idea, I think we should just be more careful with the models we are adding

(traceback regarding "does not fix the issue")

2024-08-06 19:45:13.789327 [E:onnxruntime:, sequential_executor.cc:516 ExecuteKernel] Non-zero status code returned while running Add node. Name:'/embeddings/Add_1' Status Message: /Users/runner/work/1/s/onnxruntime/core/providers/cpu/math/element_wise_ops.h:560 void onnxruntime::BroadcastIterator::Append(ptrdiff_t, ptrdiff_t) axis == 1 || axis == largest was false. Attempting to broadcast an axis by a dimension other than 1. 512 by 10002

Update:
Some models tokenizer configs have different values for max_length and model_max_length.
In this particular example it is possible to use the model with RPE as written in the card.
We do not support such modifications, but we should investigate the existing models to choose the right way to fix the issue.

Jupter warning disabled

fastembed/common/preprocessor_utils.py

joein · 2024-08-14T10:34:16Z

fastembed/common/model_management.py

@@ -148,7 +148,7 @@ def decompress_to_cache(cls, targz_path: str, cache_dir: str):
            # Open the tar.gz file
            with tarfile.open(targz_path, "r:gz") as tar:
                # Extract all files into the cache directory
-                tar.extractall(path=cache_dir)
+                tar.extractall(path=cache_dir, filter="fully_trusted")


Suggested change

tar.extractall(path=cache_dir, filter="fully_trusted")

tar.extractall(path=cache_dir)

Fix to avoid overfloat and get rid of model_max_length

018b69a

I8dNLo requested review from generall and joein August 6, 2024 12:13

joein added the bug Something isn't working label Aug 6, 2024

joein requested changes Aug 6, 2024

View reviewed changes

I8dNLo added 4 commits August 13, 2024 18:04

Fixes for max_length vs model_max_length logic

8290825

Jupter warning disabled

Typo fix

d3da1ed

Typo fix 2

8499b12

Support of jwodder/versioningit#48

addd20b

I8dNLo requested a review from joein August 13, 2024 15:25

joein approved these changes Aug 13, 2024

View reviewed changes

fastembed/common/preprocessor_utils.py Outdated Show resolved Hide resolved

joein and others added 2 commits August 13, 2024 18:01

Update fastembed/common/preprocessor_utils.py

cda7208

Move BAAI/bge-base-en to hf

301b569

joein reviewed Aug 14, 2024

View reviewed changes

I8dNLo added 2 commits August 14, 2024 12:46

Fully trusted test

d68b6f2

typo

65ed140

I8dNLo merged commit 62607c2 into main Aug 14, 2024
17 checks passed

I8dNLo deleted the 208-max-length branch August 14, 2024 12:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix to avoid overfloat and get rid of model_max_length #319

Fix to avoid overfloat and get rid of model_max_length #319

I8dNLo commented Aug 6, 2024 •

edited

Loading

joein left a comment •

edited

Loading

joein Aug 14, 2024

	tar.extractall(path=cache_dir, filter="fully_trusted")
	tar.extractall(path=cache_dir)

Fix to avoid overfloat and get rid of model_max_length #319

Fix to avoid overfloat and get rid of model_max_length #319

Conversation

I8dNLo commented Aug 6, 2024 • edited Loading

joein left a comment • edited Loading

Choose a reason for hiding this comment

joein Aug 14, 2024

Choose a reason for hiding this comment

I8dNLo commented Aug 6, 2024 •

edited

Loading

joein left a comment •

edited

Loading