The reading list for Special Interest Group on Visual Information Description
- Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)[ICLR 2015 Oral]
- Sequence to Sequence -- Video to Text[ICCV 2015]
- What value do explicit high level concepts have in vision to language problems?[CVPR 2016]
- Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images[ICCV 2015]
- Show, Attend and Tell: Neural Image Caption Generation with Visual Attention[ICML 2015]
- DenseCap: Fully Convolutional Localization Networks for Dense Captioning[CVPR 2016 Oral]
- Image Captioning with Deep Bidirectional LSTMs[ACMMM 2016 Oral]
- Early Embedding and Late Reranking for Video Captioning[ACMMM 2016 Grand Challenge Award]
- Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks[CVPR 2016 Oral]
- Frame-and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation[best in MSR Video to Language Challenge]
- VQA: Visual Question Answering[ICCV 2015]