-
Notifications
You must be signed in to change notification settings - Fork 9.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CodeCamp2023-683]Support grounding dino #10907
[CodeCamp2023-683]Support grounding dino #10907
Conversation
projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py
Outdated
Show resolved
Hide resolved
projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py
Outdated
Show resolved
Hide resolved
projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py
Outdated
Show resolved
Hide resolved
projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py
Outdated
Show resolved
Hide resolved
projects/GroundingDINO/groundingdino/detectors/grounding_dino.py
Outdated
Show resolved
Hide resolved
projects/GroundingDINO/groundingdino/layers/grounding_dino_layers.py
Outdated
Show resolved
Hide resolved
projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py
Outdated
Show resolved
Hide resolved
projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py
Outdated
Show resolved
Hide resolved
configs/grounding_dino/grounding_dino_swin-t_pretrain_obj365_goldg_cap4m.py
Outdated
Show resolved
Hide resolved
configs/grounding_dino/grounding_dino_swin-t_pretrain_obj365_goldg_cap4m.py
Outdated
Show resolved
Hide resolved
configs/grounding_dino/grounding_dino_swin-t_pretrain_obj365_goldg_cap4m.py
Outdated
Show resolved
Hide resolved
Does the grouding DINO support finetune? |
We did not test the training phase as the original code was not open for training related content. If you want to try to fine-tune it, you may need to modify some files. There is a pull request you can refer to: #10954. Thank you for your interest. |
Co-authored-by: YanxingLiu <[email protected]>
Co-authored-by: YanxingLiu <[email protected]>
What are the minimum equipment requirements of fine-tunning ground DINO with coco dataset?(FP32&total parameters&batch-size≥32) |
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
GLIP: Grounded Language-Image Pre-training
Abstract
In this paper, we present an open-set object detector, called Grounding DINO, by marrying Transformer-based detector DINO with grounded pre-training, which can detect arbitrary objects with human inputs such as category names or referring expressions. The key solution of open-set object detection is introducing language to a closed-set detector for open-set concept generalization. To effectively fuse language and vision modalities, we conceptually divide a closed-set detector into three phases and propose a tight fusion solution, which includes a feature enhancer, a language-guided query selection, and a cross-modality decoder for cross-modality fusion. While previous works mainly evaluate open-set object detection on novel categories, we propose to also perform evaluations on referring expression comprehension for objects specified with attributes. Grounding DINO performs remarkably well on all three settings, including benchmarks on COCO, LVIS, ODinW, and RefCOCO/+/g. Grounding DINO achieves a 52.5 AP on the COCO detection zero-shot transfer benchmark, i.e., without any training data from COCO. It sets a new record on the ODinW zero-shot benchmark with a mean 26.1 AP.
Installation
Results and Models
Note: