Parametric Exponential Linear Unit (PELU) for ResNet training in Torch

This project is a clone of Facebook ResNet implementation using ReLU.

This implements training of residual networks from Parametric Exponential Linear Unit for Deep Convolutional Neural Networks by Trottier, L., et. al (2016).

Requirements

See the installation instructions for a step-by-step guide.

Install Torch on a machine with CUDA GPU
Install cuDNN v4 and the Torch cuDNN bindings

If you already have Torch installed, update nn, cunn, and cudnn.

Training

The training commands are available in the following scripts:

PELU: experiment.pelu.sh .
ELU: experiment.elu.sh
ReLU: experiment.bnrelu.sh
PReLU: experiment.pelu.sh
BN+PELU: experiment.bnpelu.sh
BN+ELU: experiment.bnelu.sh

Results

Run python show_results.py to get the following results:

results/results-pelu-div-mul - cifar10: mean [5.5076000000000001], min [5.3609999999999998]
results/results-pelu-div-mul - cifar100: mean [25.0212], min [24.550999999999998]
results/results-bn-pelu-div-mul - cifar10: mean [6.2442000000000011], min [5.8499999999999996]
results/results-bn-pelu-div-mul - cifar100: mean [26.044799999999999], min [25.381]
results/results-elu - cifar10: mean [6.5468000000000002], min [5.9859999999999998]
results/results-elu - cifar100: mean [26.589600000000001], min [25.077999999999999]
results/results-bn-elu - cifar10: mean [11.195399999999998], min [10.391]
results/results-bn-elu - cifar100: mean [35.517600000000002], min [34.746000000000002]
results/results-bn-relu - cifar10: mean [5.6738], min [5.4100000000000001]
results/results-bn-relu - cifar100: mean [25.919799999999999], min [24.989999999999998]
results/results-bn-prelu - cifar10: mean [5.6054000000000004], min [5.3609999999999998]
results/results-bn-prelu - cifar100: mean [25.8262], min [25.498000000000001]
results/results-pelu-div-div - cifar10: mean [5.7306000000000008], min [5.5960000000000001]
results/results-pelu-div-div - cifar100: mean [25.683600000000002], min [25.166]
results/results-pelu-mul-div - cifar10: mean [6.5135999999999994], min [6.0060000000000002]
results/results-pelu-mul-div - cifar100: mean [26.3322], min [25.478999999999999]
results/results-pelu-mul-mul - cifar10: mean [6.7362000000000011], min [6.1230000000000002]
results/results-pelu-mul-mul - cifar100: mean [26.2012], min [25.244]

ResNet ReLU vs ResNet ELU/PELU

There are 3 differences between Facebook ResNet implementation using ReLU and this implementation using PELU:

We remove Batch Normalization (BN) before the activation function. This was pointed out by Shah, A., et. al. (2016), for ELU, where using BN degraded performances.
We remove the activation function after the skip connection. Again pointed out Shah, A., et. al. (2016), for ELU.
We added an activation function before the last average pooling.

Notes

This implementation differs from the ResNet paper in a few ways:

Scale augmentation: We use the scale and aspect ratio augmentation from Going Deeper with Convolutions, instead of scale augmentation used in the ResNet paper. We find this gives a better validation error.

Color augmentation: We use the photometric distortions from Andrew Howard in addition to the AlexNet-style color augmentation used in the ResNet paper.

Weight decay: We apply weight decay to all weights and biases instead of just the weights of the convolution layers.

Strided convolution: When using the bottleneck architecture, we use stride 2 in the 3x3 convolution, instead of the first 1x1 convolution.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
datasets		datasets
figures		figures
models		models
pretrained		pretrained
results		results
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
PATENTS		PATENTS
README.md		README.md
checkpoints.lua		checkpoints.lua
dataloader.lua		dataloader.lua
experiments.bnelu.sh		experiments.bnelu.sh
experiments.bnpelu.sh		experiments.bnpelu.sh
experiments.bnprelu.sh		experiments.bnprelu.sh
experiments.bnrelu.sh		experiments.bnrelu.sh
experiments.elu.sh		experiments.elu.sh
experiments.pelu.sh		experiments.pelu.sh
main.lua		main.lua
opts.lua		opts.lua
show_results.py		show_results.py
train.lua		train.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parametric Exponential Linear Unit (PELU) for ResNet training in Torch

Requirements

Training

Results

ResNet ReLU vs ResNet ELU/PELU

Notes

About

Releases

Packages

Contributors 7

Languages

License

ltrottier/pelu.resnet.torch

Folders and files

Latest commit

History

Repository files navigation

Parametric Exponential Linear Unit (PELU) for ResNet training in Torch

Requirements

Training

Results

ResNet ReLU vs ResNet ELU/PELU

Notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages