Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

修正Hugging Face链接引用 #7

Merged
merged 1 commit into from
Jun 15, 2023
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@

### 2023.06.08

第一期的模型 [M3E models]([moka-ai/m3e-base](https://huggingface.co/moka-ai/m3e-base)) 已经开源在了 HuggingFace 上,在中文文本分类和文本检索上都优于 `openai text-embedding-ada-002`,详请请参考 [M3E models README]([moka-ai/m3e-base](https://huggingface.co/moka-ai/m3e-base))。
第一期的模型 [M3E models](https://huggingface.co/moka-ai/m3e-base) 已经开源在了 HuggingFace 上,在中文文本分类和文本检索上都优于 `openai text-embedding-ada-002`,详请请参考 [M3E models README](https://huggingface.co/moka-ai/m3e-base/blob/main/README.md)。

## 使用 M3E

Expand Down Expand Up @@ -136,4 +136,4 @@ python scripts/train_medi.py --help
```bash
python scripts/train_medi.py <model_path_or_name> <data_file>
```
其中 `<model_path_or_name>` 可以是 HuggingFace 上的模型名称,比如 `sentence-transformers/gtr-t5-large` ,也可以是本地的模型路径, `<data_file>` 是训练数据 json 的路径,比如 `medi.json`。 `medi.json` 可以在 [instructor-embedding](https://instructor-embedding.github.io/) 的项目中下载。
其中 `<model_path_or_name>` 可以是 HuggingFace 上的模型名称,比如 `sentence-transformers/gtr-t5-large` ,也可以是本地的模型路径, `<data_file>` 是训练数据 json 的路径,比如 `medi.json`。 `medi.json` 可以在 [instructor-embedding](https://instructor-embedding.github.io/) 的项目中下载。