C3D-UCF11-Tensorflow

Very rudimentary Tensorflow implementation of C3D on UCF11 vidoe dataset

Original paper: https://arxiv.org/abs/1412.0767 "Learning Spatiotemporal Features with 3D Convolutional Networks"

This is the final project for RPI's ECSE DL course. Hail Qiang Ji!!!!

The following info should be enough for you to have a taste of an oversimplified C3D implementation.

The code includes both training and testing/plotting. Feel free to modify it using tf.saver etc for fine-tuning and testing.
Due to memory issue I used fp16 instead of fp32 for data loading. If you have >16G memory feel free to use FP32.
Converge time for batchsize=5 and ephoches=10 on RTX2080: <20 Mins.
Pickle file for data: https://drive.google.com/drive/u/1/folders/17Ul1bps7ONxQ3Ktt_QlJHnNWfl1tqHSq
data shape: 10(Batch)x30(Frames per video sequence)x64x64x3(Image size)
The PDF summerizes the design choice, dataset splitting, parameter choice etc. I did not really fine tune hyper parameters.
Major changes compared to the model proposed by the original paper: see PDF file

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
DL_FP.pdf		DL_FP.pdf
README.md		README.md
c3d_UCF11.py		c3d_UCF11.py

Provide feedback