Code for ANetQA baselines

This repository contains code for our baselines, namely HCRN, ClipBERT, and All-in-one, which is migrated from their original implementations to fit our data structure.

Dataset Preparation

see dataset for details.

Model Setup, Training, and Testing

see the HCRN, ClipBERT, and All-in-one folders for details.

Results

The above baseline models are trained on the train set and evaluated on the val,test-dev and test sets, respectively.

model	val set	test-tiny	test-dev	test	weights
HCRN	41.69	41.57	41.18	41.13	ckpt
ClipBERT	44.34	43.90	44.00	43.91	ckpt
All-in-one	45.44	44.27	44.57	44.53	ckpt

License

This project is licensed under the Apache License 2.0.

Citation

If you use ANetQA in your research, we appreciate it if you cite our paper in the following.

@inproceedings{yu2023anetqa,
title={ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos},
   author={Yu, Zhou and Zheng, Lixiang and Zhao, Zhou and Wu, Fei and Fan, Jianping and Ren, Kui and Yu, Jun},
   booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
   year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
all-in-one		all-in-one
clipbert		clipbert
dataset		dataset
hcrn		hcrn
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for ANetQA baselines

Dataset Preparation

Model Setup, Training, and Testing

Results

License

Citation

About

Releases

Packages

Contributors 3

Languages

License

MILVLG/anetqa-code

Folders and files

Latest commit

History

Repository files navigation

Code for ANetQA baselines

Dataset Preparation

Model Setup, Training, and Testing

Results

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages