voyage-multimodal-3
is a state-of-the-art multimodal embedding model and a big step towards seamless RAG and semantic search for documents rich with both visuals and text. Unlike existing multimodal embedding models, voyage-multimodal-3
can vectorize interleaved texts + images and capture key visual features from screenshots of PDFs, slides, tables, figures, and more, thereby eliminating the need for complex document parsing. When evaluated against seven competing models across 54 datasets, voyage-multimodal-3
consistently achieves the highest retrieval accuracy.
-
Notifications
You must be signed in to change notification settings - Fork 0
voyage-ai/voyage-multimodal-3
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published