[CodeCamp2023-683]Support grounding dino #10907

YanxingLiu · 2023-09-10T12:09:23Z

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

GLIP: Grounded Language-Image Pre-training

Abstract

In this paper, we present an open-set object detector, called Grounding DINO, by marrying Transformer-based detector DINO with grounded pre-training, which can detect arbitrary objects with human inputs such as category names or referring expressions. The key solution of open-set object detection is introducing language to a closed-set detector for open-set concept generalization. To effectively fuse language and vision modalities, we conceptually divide a closed-set detector into three phases and propose a tight fusion solution, which includes a feature enhancer, a language-guided query selection, and a cross-modality decoder for cross-modality fusion. While previous works mainly evaluate open-set object detection on novel categories, we propose to also perform evaluations on referring expression comprehension for objects specified with attributes. Grounding DINO performs remarkably well on all three settings, including benchmarks on COCO, LVIS, ODinW, and RefCOCO/+/g. Grounding DINO achieves a 52.5 AP on the COCO detection zero-shot transfer benchmark, i.e., without any training data from COCO. It sets a new record on the ODinW zero-shot benchmark with a mean 26.1 AP.

Installation

cd $MMDETROOT

# source installation
pip install -r requirements/multimodal.txt

# or mim installation
mim install mmdet[multimodal]

cd $MMDETROOT

wget https://github.com/IDEA-Research/GroundingDINO/releases/download/v0.1.0-alpha/groundingdino_swint_ogc.pth

python projects/GroundingDINO/tools/model_converters/groundingdino_to_mmdet.py \
		groundingdino_swint_ogc.pth \
		weights/groundingdino_swint_ogc_mmdet.pth
# this script will generate a model $WEIGHT_FILE in $MMDETROOT

python demo/image_demo.py \
	demo/demo.jpg \
	projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py \
	--weights  $WEIGHT_FILE \
	--texts 'bench . car .'

Results and Models

Model	backbone	COCO mAP	Pre-Train Data	Config	Download
Grounding DINO-T	Swin-T	48.5	O365,GoldG,Cap4M	config	model
Grounding DINO-B	Swin-B	56.9	COCO,O365,GoldG,Cap4M,OpenImage,ODinW-35,RefCOCO	config	model

Note:

The weights corresponding to the zero-shot model are adopted from the official weights and converted using the script. We have not retrained the model for the time being.

projects/GroundingDINO/README.md

projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py

projects/GroundingDINO/groundingdino/detectors/grounding_dino.py

projects/GroundingDINO/groundingdino/layers/fuse_modules.py

projects/GroundingDINO/groundingdino/layers/grounding_dino_layers.py

projects/GroundingDINO/configs/groundingdino/groundingdino_swin-t.py

configs/grounding_dino/README.md

configs/grounding_dino/grounding_dino_swin-t_pretrain_obj365_goldg_cap4m.py

mmdet/models/detectors/grounding_dino.py

mmdet/models/utils/vlfuse_helper.py

configs/grounding_dino/README.md

mmdet/models/detectors/grounding_dino.py

mmdet/models/utils/vlfuse_helper.py

CDchenlin · 2023-09-23T11:59:50Z

Does the grouding DINO support finetune?

YanxingLiu · 2023-09-25T06:45:46Z

We did not test the training phase as the original code was not open for training related content. If you want to try to fine-tune it, you may need to modify some files. There is a pull request you can refer to: #10954. Thank you for your interest.

Co-authored-by: YanxingLiu <[email protected]>

SoulProficiency · 2024-02-27T02:45:46Z

What are the minimum equipment requirements of fine-tunning ground DINO with coco dataset？（FP32&total parameters&batch-size≥32）

YanxingLiu added 4 commits September 4, 2023 16:13

add model converter for grounding dino

890f769

support more layers

363c3c9

fix some typf

f6426d5

support grounding dino

b654aab

mm-assistant bot assigned ZwwWayne Sep 10, 2023

hhaAndroid reviewed Sep 11, 2023

View reviewed changes

YanxingLiu and others added 11 commits September 14, 2023 11:43

modify file names

e59ae0a

update some variable names

df3f630

add some docstrings

68f2090

fix some typo

66919bc

modify vlfuse_helper to support Grounding DINO

c815ca4

fix some typo

fff0377

fix some typo

fb136d2

move grounding dino from projects to mmdet package

f20f0c2

fix pre-commit error

42489b9

merge GroundingDinoBertModel and BertModel

d1a9fa4

fix some typo

6253f80

hhaAndroid reviewed Sep 16, 2023

View reviewed changes

YanxingLiu added 3 commits September 16, 2023 16:41

fix some typo

3448675

fix a bug of grounding dino

b112ac7

add metafile.yml

0e24ee3

hhaAndroid reviewed Sep 18, 2023

View reviewed changes

mmdet/models/detectors/grounding_dino.py Show resolved Hide resolved

hhaAndroid reviewed Sep 18, 2023

View reviewed changes

mmdet/models/utils/vlfuse_helper.py Outdated Show resolved Hide resolved

YanxingLiu added 3 commits September 18, 2023 10:35

merge latest dev-3.x

9b53b11

fix some bugs

a41dcc1

add a docstring

8ea0da4

hhaAndroid approved these changes Sep 18, 2023

View reviewed changes

hhaAndroid merged commit 073626f into open-mmlab:dev-3.x Sep 18, 2023
1 of 2 checks passed

YanxingLiu changed the title ~~Support grounding dino~~ [CodeCamp2023-683]Support grounding dino Sep 18, 2023

yumion pushed a commit to yumion/mmdetection that referenced this pull request Jan 31, 2024

Support grounding dino (open-mmlab#10907)

70ba167

Co-authored-by: YanxingLiu <[email protected]>

yumion pushed a commit to yumion/mmdetection that referenced this pull request Jan 31, 2024

Support grounding dino (open-mmlab#10907)

0240152

Co-authored-by: YanxingLiu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CodeCamp2023-683]Support grounding dino #10907

[CodeCamp2023-683]Support grounding dino #10907

YanxingLiu commented Sep 10, 2023

CDchenlin commented Sep 23, 2023 •

edited

Loading

YanxingLiu commented Sep 25, 2023 •

edited

Loading

SoulProficiency commented Feb 27, 2024

[CodeCamp2023-683]Support grounding dino #10907

[CodeCamp2023-683]Support grounding dino #10907

Conversation

YanxingLiu commented Sep 10, 2023

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

Abstract

Installation

Results and Models

CDchenlin commented Sep 23, 2023 • edited Loading

YanxingLiu commented Sep 25, 2023 • edited Loading

SoulProficiency commented Feb 27, 2024

CDchenlin commented Sep 23, 2023 •

edited

Loading

YanxingLiu commented Sep 25, 2023 •

edited

Loading