Neural Caption Generator

make_flickr_dataset.py : Extracting feats of flickr30k images, and save them in './data/feats.npy'
model.py : TensorFlow Version

Flickr30k Dataset Download
Extract VGG Featues of Flicker30k images (make_flickr_dataset.py)
Train: run train() in model.py
Test: run test() or test_tf() in model.py
parameters: VGG FC7 feature of test image, trained model path
Once you download Tensorflow VGG Net (one of the links below), you don't need Caffe when testing.

Extraced FC7 data: download
This is used in train() function in model.py. You can skip feature extraction part by using this.
Pretrained model download
This is used in test() and test_tf() in model.py. If you do not have time for training, or if you just want to check out captioning, download and test the model.
Tensorflow VGG net download
This file is used in test_tf() in model.py
Along with the files above, you might want to download flickr30k annotation data from link

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md
acoustic-guitar-player.jpg		acoustic-guitar-player.jpg
cnn_util.py		cnn_util.py
ipython_demo.ipynb		ipython_demo.ipynb
ixtoword.npy		ixtoword.npy
make_flickr_dataset.py		make_flickr_dataset.py
model.py		model.py
result.jpg		result.jpg

Provide feedback