Implementation of RetinaNet from Focal Loss for Dense Object Detection paper in TensorFlow

Differences from the original paper

For some reason this architecture is extremely hard to train, loss gets stuck at early stages of training, predicting everything as a background (probably to the fact that i am using small batch size). To overcome this problem I tried different initialization schemes, backbone architectures and losses.

You can choose densenet, resnext or mobilenet_v2 as a backbone architecture.

Observations

Training focal loss on a simple synthetic dataset does work but poorly
To overcome small batch size problem one might try training on multiple GPUs or using Group Normalization.
Probably the most important observation is that using Group Normalization instead of Batch Normalization gives significantly better results (when trained on small batches)

Current setup which gives at least some result:

Training on a single Titan X with 1 image per batch (can't fit into memory anything larger with 500 image scale)
MobileNetV2 as a backbone, with 500 image scale (can't fit into memory anything larger)
Not using Focal Loss, I am sure I will get back to it once I will find out why it is so hard to train it as it is described in a paper.
Using combination of balanced cross-entropy and dice loss.
Using Group Normalization.

Notes

Interestingly, other open source implementations I have found on github all using much lower learning rate (1e-4, 1e-5) and/or gradient clipping.

Name		Name	Last commit message	Last commit date
Latest commit History 604 Commits
data		data
data_loaders		data_loaders
.gitignore		.gitignore
README.md		README.md
augmentation.py		augmentation.py
augmentation_test.py		augmentation_test.py
dataset.py		dataset.py
dataset_test.py		dataset_test.py
debug_input.py		debug_input.py
densenet.py		densenet.py
densenet_test.py		densenet_test.py
download_weights.sh		download_weights.sh
levels.py		levels.py
levels_test.py		levels_test.py
losses.py		losses.py
losses_test.py		losses_test.py
mobilenet_v2.py		mobilenet_v2.py
model.py		model.py
normalization.py		normalization.py
resnet.py		resnet.py
resnet_test.py		resnet_test.py
retinanet.py		retinanet.py
retinanet_old_test.py		retinanet_old_test.py
retinanet_test.py		retinanet_test.py
train.py		train.py
utils.py		utils.py
utils_test.py		utils_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implementation of RetinaNet from Focal Loss for Dense Object Detection paper in TensorFlow

Differences from the original paper

Observations

Current setup which gives at least some result:

Notes

About

Releases

Packages

Languages

vshmyhlo/retinanet-tensorflow

Folders and files

Latest commit

History

Repository files navigation

Implementation of RetinaNet from Focal Loss for Dense Object Detection paper in TensorFlow

Differences from the original paper

Observations

Current setup which gives at least some result:

Notes

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages