colpali-document-retrieval-vision-language-models-cloud is buggy #977

suhjohn · 2024-11-20T22:05:36Z

suhjohn · 2024-11-20T22:08:47Z


for pdf in sample_pdfs:
    page_embeddings = []
    dataloader = DataLoader(
        pdf["images"],
        batch_size=2,
        shuffle=False,
        collate_fn=lambda x: process_images(processor, x),
    )
    for batch_doc in tqdm(dataloader):
        batch_doc = {k: v.to(device) for k, v in batch_doc.items()}
        with torch.no_grad():
            with torch.cuda.amp.autocast():
                embeddings_doc = model(**batch_doc)
        page_embeddings.extend(list(torch.unbind(embeddings_doc.cpu())))
    pdf["embeddings"] = page_embeddings

fix

thomasht86 · 2024-11-21T13:40:04Z

Thanks for reporting!
Sorry about that - our tests run without GPU, so we didn't catch that one.
I will add a PR with fix.

thomasht86 self-assigned this Nov 21, 2024

thomasht86 mentioned this issue Nov 21, 2024

Thomasht86/update colpali notebooks #979

Merged

3 tasks

thomasht86 closed this as completed Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

colpali-document-retrieval-vision-language-models-cloud is buggy #977

colpali-document-retrieval-vision-language-models-cloud is buggy #977

suhjohn commented Nov 20, 2024

suhjohn commented Nov 20, 2024

thomasht86 commented Nov 21, 2024

colpali-document-retrieval-vision-language-models-cloud is buggy #977

colpali-document-retrieval-vision-language-models-cloud is buggy #977

Comments

suhjohn commented Nov 20, 2024

suhjohn commented Nov 20, 2024

thomasht86 commented Nov 21, 2024