SIGVID

The reading list for Special Interest Group on Visual Information Description

Image Captioning

Level 0

Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)[ICLR 2015 Oral]
Sequence to Sequence -- Video to Text[ICCV 2015]
What value do explicit high level concepts have in vision to language problems?[CVPR 2016]

Level 1

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images[ICCV 2015]
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention[ICML 2015]
DenseCap: Fully Convolutional Localization Networks for Dense Captioning[CVPR 2016 Oral]
Image Captioning with Deep Bidirectional LSTMs[ACMMM 2016 Oral]

Video Captioning

Early Embedding and Late Reranking for Video Captioning[ACMMM 2016 Grand Challenge Award]
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks[CVPR 2016 Oral]
Frame-and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation[best in MSR Video to Language Challenge]

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SIGVID

Image Captioning

Level 0

Level 1

Video Captioning

Visual Question Answering

Appendix

Other Reading Lists

Project Demo

NeuralTalk

DenseCap

Deeper LSTM+ normalized CNN for Visual Question Answering

About

Releases

Packages

Yugnaynehc/SIGVID

Folders and files

Latest commit

History

Repository files navigation

SIGVID

Image Captioning

Level 0

Level 1

Video Captioning

Visual Question Answering

Appendix

Other Reading Lists

Project Demo

NeuralTalk

DenseCap

Deeper LSTM+ normalized CNN for Visual Question Answering

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages