voyage-multimodal-3

voyage-multimodal-3 is a state-of-the-art multimodal embedding model and a big step towards seamless RAG and semantic search for documents rich with both visuals and text. Unlike existing multimodal embedding models, voyage-multimodal-3 can vectorize interleaved texts + images and capture key visual features from screenshots of PDFs, slides, tables, figures, and more, thereby eliminating the need for complex document parsing. When evaluated against seven competing models across 54 datasets, voyage-multimodal-3 consistently achieves the highest retrieval accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

voyage-multimodal-3

About

Releases

Packages

voyage-ai/voyage-multimodal-3

Folders and files

Latest commit

History

Repository files navigation

voyage-multimodal-3

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages