Implementation of a number of MobileNet variants, based on https://github.com/marvis/pytorch-mobilenet. Imagenet data is processed as described here

Options Implemented:

Residual Connection: Residual connection around each depthwise separable convolution.
Squeeze-and-Excitation Channel Attention: Based on the Squeeze-and-Excitation paper, a squeeze-and-excite block after every depthwise separable convolution.
Group Convolutions: For the 3x3 convolutions in the depthwise seperable structures, group size is increased to 4.

Learning rate schedule: I use Nesterov and cosine learning rate starting at LR = 0.05 and train it for 90 epochs.

Architecture	Explanation	Accuracy
mobilenet	Baseline architecture	71.84
mobilenetg4	Using group size of 4 in the 3x3s of dep-sep conv	73.184
mobilenetr	Using residual connections	71.94
mobilenetra	Using residual connections and squeeze-and-excite blocks	73.48
mobilenetrag4	Using residual connections, squeeze-excite blocks and group size of 4 in 3x3s	74.13

Command to train: python main.py -a ARCH -b 256 --cosineLR --lr 0.05 --nesterov /imagenet/ where ARCH is one of the options from the first column in the above table.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Files

README.md

Latest commit

History

README.md

File metadata and controls