Add Jina Embedding model #52

philipkiely-baseten · 2023-10-26T17:51:18Z

Add a Truss for Jina embedding model, intending to add to model library today + publish a blog post on this

bolasim · 2023-10-26T17:52:57Z

jina-embeddings/jina-embeddings-v2-base-en/model/model.py

+        self._model = None
+
+    def load(self):
+        self._model = AutoModel.from_pretrained(


bonus, you can use hf_cache in the config with these settings so the cold start is fire out of the box.

bolasim · 2023-10-26T17:53:17Z

jina-embeddings/jina-embeddings-v2-base-en/config.yaml

+environment_variables: {}
+external_package_dirs: []
+model_metadata:
+  example_model_input: {"text": ["I want to eat pasta", "I want to eat pizza"]}


maybe add the max length to the sample so it's obvious that users can edit it.

squidarth

awesome! could you also please add this to the ci.yaml? lmk if you need help getting the lint to pass

bolasim · 2023-10-26T19:48:41Z

jina-embeddings/jina-embeddings-v2-base-en/config.yaml

+environment_variables: {}
+external_package_dirs: []
+hf_cache:
+  - repo_id: jinaai/jina-embeddings-v2-base-en


you probably wanna pin the revision here just like you're doing in the load.

Ah, makes sense, do you know how to do that? Would it just be revision: abcd1234?

https://truss.baseten.co/reference/config#hf-cache-list-item-revision

philipkiely-baseten · 2023-10-27T17:00:09Z

@bolasim I was having trouble getting the hf_cache working with the revision, and I want to get this live, so I took hf_cache out entirely when I updated to the latest version.

The model is a quarter of a gig, so the download actually only takes about a second (log line below):

Downloading model.safetensors: 275MB [00:01, 264MB/s]
...
Completed model.load() execution in 8419 ms

The whole model.load() is 8.5 seconds which is not great but not terrible. And the model is running faster now with the new revision, I'm getting 40-45 seconds for my shakespeare dataset.

bolasim · 2023-10-27T17:09:11Z

Whatever you think is best!

philipkiely-baseten added 2 commits October 26, 2023 12:49

Add Jina Embedding model

e14f0aa

Add Jina Embedding model

f63f7dc

philipkiely-baseten requested review from bolasim and aspctu October 26, 2023 17:51

bolasim approved these changes Oct 26, 2023

View reviewed changes

lint

598b254

squidarth approved these changes Oct 26, 2023

View reviewed changes

philipkiely-baseten added 5 commits October 26, 2023 13:00

add to ci job

21ca4a9

add to ci job

e7b6493

add to ci job

44b70ab

Add sample input and benchmark

2ead6f1

linters should be illegal

b6dea63

bolasim reviewed Oct 26, 2023

View reviewed changes

remove hf cache and update pinned version

6d49cc1

philipkiely-baseten merged commit b8d902e into main Oct 27, 2023
1 check passed

philipkiely-baseten deleted the philip/jina-embed branch December 12, 2023 23:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Jina Embedding model #52

Add Jina Embedding model #52

philipkiely-baseten commented Oct 26, 2023

bolasim Oct 26, 2023

philipkiely-baseten Oct 26, 2023

bolasim Oct 26, 2023

squidarth left a comment

bolasim Oct 26, 2023

philipkiely-baseten Oct 26, 2023

bolasim Oct 26, 2023

philipkiely-baseten commented Oct 27, 2023

bolasim commented Oct 27, 2023

Add Jina Embedding model #52

Add Jina Embedding model #52

Conversation

philipkiely-baseten commented Oct 26, 2023

bolasim Oct 26, 2023

Choose a reason for hiding this comment

philipkiely-baseten Oct 26, 2023

Choose a reason for hiding this comment

bolasim Oct 26, 2023

Choose a reason for hiding this comment

squidarth left a comment

Choose a reason for hiding this comment

bolasim Oct 26, 2023

Choose a reason for hiding this comment

philipkiely-baseten Oct 26, 2023

Choose a reason for hiding this comment

bolasim Oct 26, 2023

Choose a reason for hiding this comment

philipkiely-baseten commented Oct 27, 2023

bolasim commented Oct 27, 2023