Faster R-CNN for Signature and Annotation detection using Keras

The original code of Keras version of Faster R-CNN I used was written by yhenon (resource link: GitHub .) He used the PASCAL VOC 2007, 2012, and MS COCO datasets. I applied configs different from his work to fit my dataset and I removed unuseful code.

Project Structure

Use the sign_detection_train_vgg.ipynb file to train on any dataset of your choice. Define the annotaion and bounding box coordinates in the annotaion.txt file. It uses a VGG 16 model. For future scope You can add RESNET and other models. Use the sign_detection_test_vgg.ipynb file to test your images. During Training we keep updating and saving the weights, incase of any system failure or power cut, our trained data would still be saved to the nearest epoch.

Requirements

python 3.6+ Link to download and install (https://www.python.org/downloads/)
You will need jupyter notebook to open the .ipynb files. (On command line type pip install jupyter)
Tensorflow(if you have gpu the you could use a tensorflow-gpu version). (On command line type pip install tensorflow or tensorflow-gpu)
Keras (On command line type pip install keras)
Numpy (On command line type pip install numpy)
Pandas (On command line type pip install pandas)

Introduction to Faster-RCNN

Faster R-CNN has two networks: region proposal network (RPN) for generating region proposals and a network using these proposals to detect objects. The main difference here with Fast R-CNN is that the later uses selective search to generate region proposals. The time cost of generating region proposals is much smaller in RPN than selective search, when RPN shares the most computation with the object detection network. Briefly, RPN ranks region boxes (called anchors) and proposes the ones most likely containing objects.

Regional Purpose Network

The output of a region proposal network (RPN) is a bunch of boxes/proposals that will be examined by a classifier and regressor to eventually check the occurrence of objects. To be more precise, RPN predicts the possibility of an anchor being background or foreground, and refine the anchor.

Classifier

The first step of training a classifier is make a training dataset. The training data is the anchors we get from the above process and the ground-truth boxes. The problem we need to solve here is how we use the ground-truth boxes to label the anchors. The basic idea here is that we want to label the anchors having the higher overlaps with ground-truth boxes as foreground, the ones with lower overlaps as background. Apparently, it needs some tweaks and compromise to seperate foreground and background. You can check the details here in the implementation. Now we have labels for the anchors.

References

Fast R-CNN: https://arxiv.org/pdf/1504.08083.pdf
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks: https://arxiv.org/pdf/1506.01497.pdf
py-faster-rcnn: https://github.com/rbgirshick/py-faster-rcnn
A guide to receptive field arithmetic for Convolutional Neural Networks: https://medium.com/@nikasa1889/a-guide-to-receptive-field-arithmetic-for-convolutional-neural-networks-e0f514068807
Region of interest pooling explained: https://blog.deepsense.ai/region-of-interest-pooling-explained/

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
annotate.txt		annotate.txt
fasterrcnn.jpeg		fasterrcnn.jpeg
model_vgg_config.pickle		model_vgg_config.pickle
record.csv		record.csv
sign_detection_test_vgg.ipynb		sign_detection_test_vgg.ipynb
sign_detection_train_vgg.ipynb		sign_detection_train_vgg.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Faster R-CNN for Signature and Annotation detection using Keras

Project Structure

Requirements

Introduction to Faster-RCNN

Regional Purpose Network

Classifier

References

About

Releases

Packages

Languages

KushaalShroff/Signature-and-Annotation-detection

Folders and files

Latest commit

History

Repository files navigation

Faster R-CNN for Signature and Annotation detection using Keras

Project Structure

Requirements

Introduction to Faster-RCNN

Regional Purpose Network

Classifier

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages