support for jinaai/jina-reranker-v2-base-multilingual model #2004

bash99 · 2024-08-30T10:48:18Z

System Info

optimum                           1.21.4
Python 3.11.9
Ubuntu 22.04.4 LTS
NVIDIA-SMI 550.90.07              Driver Version: 550.90.07      CUDA Version: 12.4
GPU 2080 ti

Who can help?

@JingyaHuang @michaelbenayoun

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction (minimal, reproducible, runnable)

When I run jinaai/jina-reranker-v2-base-multilingual with infinity-emb server

pip install infinity-emb[all]
infinity_emb v2 --model-id jinaai/jina-reranker-v2-base-multilingual
 --engine optimum

the start-up logs

2024-08-30 18:21:25.407568117 [W:onnxruntime:, session_state.cc:1166 VerifyEachNodeIsAssignedToAnEp] Some nodes were not assigned to the preferred execution providers which may or may not have an negative impact on performance. e.g. ORT explicitly assigns shape related ops to CPU to improve perf.
2024-08-30 18:21:25.407597785 [W:onnxruntime:, session_state.cc:1168 VerifyEachNodeIsAssignedToAnEp] Rerunning with verbose output on a non-minimal build will show node assignments.
INFO     2024-08-30 18:21:26,115 infinity_emb INFO: Optimizing model                              utils_optimum.py:139
WARNING  2024-08-30 18:21:26,118 infinity_emb WARNING: Optimization failed with Tried to use      utils_optimum.py:168
         ORTOptimizer for the model type , but it is not available yet. Please open an issue or submit a PR at https://github.com/huggingface/optimum.. Going to use the unoptimized         model.

It's even slow than use default torch engine of infinity-emb start as

infinity_emb v2 --model-id jinaai/jina-reranker-v2-base-multilingual

Expected behavior

optimum engine is faster than torch engine when model is support, for example "netease-youdao/Rerank"

The text was updated successfully, but these errors were encountered:

bash99 added the bug Something isn't working label Aug 30, 2024

dacorvo added the onnxruntime Related to ONNX Runtime label Oct 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for jinaai/jina-reranker-v2-base-multilingual model #2004

support for jinaai/jina-reranker-v2-base-multilingual model #2004

bash99 commented Aug 30, 2024

support for jinaai/jina-reranker-v2-base-multilingual model #2004

support for jinaai/jina-reranker-v2-base-multilingual model #2004

Comments

bash99 commented Aug 30, 2024

System Info

Who can help?

Information

Tasks

Reproduction (minimal, reproducible, runnable)

Expected behavior