Skip to content

Latest commit

 

History

History
24 lines (18 loc) · 1.31 KB

readme.md

File metadata and controls

24 lines (18 loc) · 1.31 KB

FashionLOGO

1. Introduction

This repo is an implementation of our paper: "FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings".

2. Data Preparation

Related Datasets

  1. In "datasets/[dataset]" directory, we have prepared query and gallery files for each [dataset].
  2. Download related datasets in datasets collection, for FlickrLogos-32 dataset, you need permission from link.

3. Download Pretrained Model

Our model is released in model, download and put it into "checkpoints" folder.

4. Inference

python3 -m torch.distributed.launch predict.py --model_path checkpoints/your_model_checkpoint