llava-cli: improve llava-cli and the API for using LLaVA #6027

phymbert · 2024-03-12T21:18:55Z

From:

server : improvements and maintenance #4216 (comment)

cleaning up the clip/llava libs and improving the API
in the old implementation, there were many internal object exposed to the server and the memory management was dubious
there was no obvious path for supporting parallel multimodal slots

phymbert · 2024-03-12T21:21:56Z

@ggerganov please tell me how I can help on this

phymbert · 2024-03-12T21:22:52Z

ping @damian0815 as you originally started llava.h

JoanFM · 2024-06-11T16:48:59Z

Hello,

Is there any progress in here? I wonder if I could be of any help.

I think it would be nice to make multimodality much more of a first class citizen in llama.cpp. I would be interested on supporting jina-clip-v1 model after the refactoring.

ngxson · 2024-06-19T11:56:28Z

I'm recently playing around with the currently llava implementation.

Currently, a clip model has its own clip_model_load which does not use mmap. While clip_image_batch_encode exists that could be used to process parallel slots, it's not used by llava.cpp. One of the idea that I have in my mind is to somehow reuse llama_load_model_from_file to load the model and llama_decode to decode batch of patches/images.

But that's only very draft idea, probably too complicated to implement atm. @ggerganov what do you think about this?

phymbert added the enhancement New feature or request label Mar 12, 2024

phymbert mentioned this issue Mar 12, 2024

server : improvements and maintenance #4216

Open

10 tasks

phymbert assigned ggerganov Mar 12, 2024

phymbert mentioned this issue Mar 22, 2024

server : refactor #5882

Merged

4 tasks

phymbert added llava LLaVa and multimodal help wanted Extra attention is needed good first issue Good for newcomers labels Mar 22, 2024

phymbert mentioned this issue Mar 26, 2024

Add multimodal example #6313

Closed

slaren mentioned this issue Mar 31, 2024

main: port basic LLaVA (multimodal) support from llava-cli #5730

Closed

ngxson mentioned this issue Jun 19, 2024

server: Bring back multimodal support #8010

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llava-cli: improve llava-cli and the API for using LLaVA #6027

llava-cli: improve llava-cli and the API for using LLaVA #6027

phymbert commented Mar 12, 2024

phymbert commented Mar 12, 2024

phymbert commented Mar 12, 2024

JoanFM commented Jun 11, 2024

ngxson commented Jun 19, 2024 •

edited

Loading

llava-cli: improve llava-cli and the API for using LLaVA #6027

llava-cli: improve llava-cli and the API for using LLaVA #6027

Comments

phymbert commented Mar 12, 2024

phymbert commented Mar 12, 2024

phymbert commented Mar 12, 2024

JoanFM commented Jun 11, 2024

ngxson commented Jun 19, 2024 • edited Loading

ngxson commented Jun 19, 2024 •

edited

Loading