BUG: fix embedding model gte-Qwen2 dimensions #2479

JinCheng666 · 2024-10-24T05:27:40Z

Fixes #2471
fix embedding model gte-Qwen2 dimensions

wrong dimensions：

https://github.com/xorbitsai/inference/blob/48a07e8af3d015908c8a55f91c7baea3afbbb809/xinference/model/embedding/model_spec.json#L236C1-L237C1

correct dimensions：

inference/xinference/model/embedding/model_spec_modelscope.json

Line 238 in 48a07e8

"dimensions": 4096,

base on the right message：
https://huggingface.co/Alibaba-NLP/gte-Qwen1.5-7B-instruct
https://modelscope.cn/models/iic/gte_Qwen1.5-7B-instruct/summary

fix embedding model gte-Qwen2 dimensions

JinCheng666

right

JinCheng666 · 2024-10-24T05:34:46Z

I don't know if this PR conforms to the project rules, please point out if there is anything wrong

JinCheng666 · 2024-10-24T07:04:20Z

i dont know why the python3.11 check fail. it seems like not problems with the code.
how to rerun the checks? @qinxuye

qinxuye · 2024-10-24T07:14:29Z

i dont know why the python3.11 check fail. it seems like not problems with the code. how to rerun the checks? @qinxuye

That's OK, can you run python doc/source/gen_docs.py?

This will change many files, you can revert others but the gte-qwen2.rst then commit.

https://github.com/xorbitsai/inference/blob/main/doc/source/models/builtin/embedding/gte-qwen2.rst

BUG: fix embedding model gte-Qwen2 dimensions

JinCheng666 · 2024-10-24T09:51:45Z

i dont know why the python3.11 check fail. it seems like not problems with the code. how to rerun the checks? @qinxuye

That's OK, can you run python doc/source/gen_docs.py?

This will change many files, you can revert others but the gte-qwen2.rst then commit.

https://github.com/xorbitsai/inference/blob/main/doc/source/models/builtin/embedding/gte-qwen2.rst

I ran the code locally and got an error, so I manually adjusted the contents of the file, not sure if it was appropriate？

window pc，The error information is as follows：

PS D:\work\code\inference\doc\source> python .\gen_docs.py
Traceback (most recent call last):
  File "D:\work\code\inference\doc\source\gen_docs.py", line 284, in <module>
    main()
  File "D:\work\code\inference\doc\source\gen_docs.py", line 54, in main
    models = json.load(model_file)
  File "C:\Users\liujincheng\.conda\envs\openaitest\lib\json\__init__.py", line 293, in load
    return loads(fp.read(),
UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 7227: illegal multibyte sequence

qinxuye · 2024-10-24T10:35:50Z

i dont know why the python3.11 check fail. it seems like not problems with the code. how to rerun the checks? @qinxuye

That's OK, can you run python doc/source/gen_docs.py?
This will change many files, you can revert others but the gte-qwen2.rst then commit.
https://github.com/xorbitsai/inference/blob/main/doc/source/models/builtin/embedding/gte-qwen2.rst

I ran the code locally and got an error, so I manually adjusted the contents of the file, not sure if it was appropriate？

window pc，The error information is as follows：
PS D:\work\code\inference\doc\source> python .\gen_docs.py
Traceback (most recent call last):
  File "D:\work\code\inference\doc\source\gen_docs.py", line 284, in <module>
    main()
  File "D:\work\code\inference\doc\source\gen_docs.py", line 54, in main
    models = json.load(model_file)
  File "C:\Users\liujincheng\.conda\envs\openaitest\lib\json\__init__.py", line 293, in load
    return loads(fp.read(),
UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 7227: illegal multibyte sequence

Should be OK, let's wait for CI to complete.

JinCheng666 · 2024-10-24T10:40:26Z

i dont know why the python3.11 check fail. it seems like not problems with the code. how to rerun the checks? @qinxuye

That's OK, can you run python doc/source/gen_docs.py?
This will change many files, you can revert others but the gte-qwen2.rst then commit.
https://github.com/xorbitsai/inference/blob/main/doc/source/models/builtin/embedding/gte-qwen2.rst

I ran the code locally and got an error, so I manually adjusted the contents of the file, not sure if it was appropriate？
window pc，The error information is as follows：
PS D:\work\code\inference\doc\source> python .\gen_docs.py
Traceback (most recent call last):
  File "D:\work\code\inference\doc\source\gen_docs.py", line 284, in <module>
    main()
  File "D:\work\code\inference\doc\source\gen_docs.py", line 54, in main
    models = json.load(model_file)
  File "C:\Users\liujincheng\.conda\envs\openaitest\lib\json\__init__.py", line 293, in load
    return loads(fp.read(),
UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 7227: illegal multibyte sequence
Should be OK, let's wait for CI to complete.

so cool, my first pr is coming soon...

qinxuye

LGTM, thanks for your contribution!

JinCheng666 · 2024-10-24T12:49:45Z

LGTM, thanks for your contribution!

thank you vary much ! happy 1024 day！ @qinxuye

fix embedding model gte-Qwen2 dimensions

0ff0a1f

fix embedding model gte-Qwen2 dimensions

XprobeBot added this to the v0.15 milestone Oct 24, 2024

JinCheng666 commented Oct 24, 2024

View reviewed changes

JinCheng666 mentioned this pull request Oct 24, 2024

gte-Qwen2 Incorrect model information / 模型信息有误 #2471

Closed

3 tasks

qinxuye changed the title ~~fix embedding model gte-Qwen2 dimensions~~ BUG: fix embedding model gte-Qwen2 dimensions Oct 24, 2024

XprobeBot added the bug Something isn't working label Oct 24, 2024

JinCheng666 added 2 commits October 24, 2024 17:44

Update gte-qwen2.rst

f3a55ff

BUG: fix embedding model gte-Qwen2 dimensions

Merge branch 'xorbitsai:main' into main

14c3782

qinxuye approved these changes Oct 24, 2024

View reviewed changes

qinxuye merged commit 6fd879c into xorbitsai:main Oct 24, 2024
10 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: fix embedding model gte-Qwen2 dimensions #2479

BUG: fix embedding model gte-Qwen2 dimensions #2479

JinCheng666 commented Oct 24, 2024 •

edited by qinxuye

Loading

JinCheng666 left a comment

JinCheng666 commented Oct 24, 2024

JinCheng666 commented Oct 24, 2024

qinxuye commented Oct 24, 2024 •

edited

Loading

JinCheng666 commented Oct 24, 2024 •

edited

Loading

qinxuye commented Oct 24, 2024

JinCheng666 commented Oct 24, 2024

qinxuye left a comment

JinCheng666 commented Oct 24, 2024

BUG: fix embedding model gte-Qwen2 dimensions #2479

BUG: fix embedding model gte-Qwen2 dimensions #2479

Conversation

JinCheng666 commented Oct 24, 2024 • edited by qinxuye Loading

JinCheng666 left a comment

Choose a reason for hiding this comment

JinCheng666 commented Oct 24, 2024

JinCheng666 commented Oct 24, 2024

qinxuye commented Oct 24, 2024 • edited Loading

JinCheng666 commented Oct 24, 2024 • edited Loading

qinxuye commented Oct 24, 2024

JinCheng666 commented Oct 24, 2024

qinxuye left a comment

Choose a reason for hiding this comment

JinCheng666 commented Oct 24, 2024

JinCheng666 commented Oct 24, 2024 •

edited by qinxuye

Loading

qinxuye commented Oct 24, 2024 •

edited

Loading

JinCheng666 commented Oct 24, 2024 •

edited

Loading