Active Convolution

This repository contains the implementation for the paper Active Convolution: Learning the Shape of Convolution for Image Classification.

The code is based on Caffe and cuDNN(v5)

Abstract

In recent years, deep learning has achieved great success in many computer vision applications. Convolutional neural networks (CNNs) have lately emerged as a major approach to image classification. Most research on CNNs thus far has focused on developing architectures such as the Inception and residual networks. The convolution layer is the core of the CNN, but few studies have addressed the convolution unit itself. In this paper, we introduce a convolution unit called the active convolution unit (ACU). A new convolution has no fixed shape, because of which we can define any form of convolution. Its shape can be learned through backpropagation during training. Our proposed unit has a few advantages. First, the ACU is a generalization of convolution; it can define not only all conventional convolutions, but also convolutions with fractional pixel coordinates. We can freely change the shape of the convolution, which provides greater freedom to form CNN structures. Second, the shape of the convolution is learned while training and there is no need to tune it by hand. Third, the ACU can learn better than a conventional unit, where we obtained the improvement simply by changing the conventional convolution to an ACU. We tested our proposed method on plain and residual networks, and the results showed significant improvement using our method on various datasets and architectures in comparison with the baseline.

Testing Code

You can validate backpropagation using test code. Because it is not differentiable on lattice points, you should not use integer point position when you are testing code. It is simply possible to define "TEST_ACONV_FAST_ENV" macro in aconv_fast_layer.hpp

Define "TEST_ACONV_FAST_ENV" macro in aconv_fast_layer.hpp
> make test
> ./build/test/test_aconv_fast_layer.testbin

You should pass all tests. Before the start, don't forget to undefine TEST_ACONV_FAST_ENV macro and make again.

Usage

ACU has 4 parameters(weight, bias, x-positions, y-positions of synapse). Even though you don't use bias term, the order will not be changed.

Please refer deploy file in models/ACU

If you want define arbitary shape of convolution,

use non SQUARE type in aconv_param
define number of synapse using kernel_h, kernel_w parameter in convolution_param

In example, if you want define cross-shaped convolution with 4 synapses, you can use like belows.

...
aconv_param{   type: CIRCLE }
convolution_param {    num_output: 48    kernel_h: 1    kernel_w: 4    stride: 1 }
...

When you use user-defined shape of convolution, you'd better edit aconv_fast_layer.cpp directly to define initial position of synapses.

Example

This is the result of plain ACU network, and there an example in models/ACU of CIFAR-10

Network	CIFAR-10(%)	CIFAR-100(%)
baseline	8.01	27.85
ACU	7.33	27.11
Improvement	+0.68	+0.74

This is changes of the positions over iterations.

You can draw learned position by using ipython script.

Name		Name	Last commit message	Last commit date
Latest commit History 3,708 Commits
cmake		cmake
data		data
docker		docker
docs		docs
examples		examples
include/caffe		include/caffe
matlab		matlab
models		models
python		python
scripts		scripts
src		src
tools		tools
.Doxyfile		.Doxyfile
.gitignore		.gitignore
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
Makefile		Makefile
Makefile.config.example		Makefile.config.example
README.md		README.md
caffe.cloc		caffe.cloc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Active Convolution

Abstract

Testing Code

Usage

Example

About

Releases

Packages

Contributors 198

Languages

License

jyh2986/Active-Convolution

Folders and files

Latest commit

History

Repository files navigation

Active Convolution

Abstract

Testing Code

Usage

Example

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 198

Languages

Packages