On the Expressivity of Random Features in CNNs - TF 2.3 (Community) #9174

Vishal-V · 2020-08-31T17:45:02Z

Only BatchNorm

This repository is the unofficial implementation of the following [Paper].

Training BatchNorm and Only BatchNorm: On the Expressivity of Random Features in CNNs

Description/Abstract

Batch normalization (BatchNorm) has become an indispensable tool for training
deep neural networks, yet it is still poorly understood. Although previous work
has typically focused on studying its normalization component, BatchNorm also
adds two per-feature trainable parameters—a coefficient and a bias—whose role
and expressive power remain unclear. To study this question, we investigate the
performance achieved when training only these parameters and freezing all others
at their random initializations. We find that doing so leads to surprisingly high
performance. For example, sufficiently deep ResNets reach 82% (CIFAR-10) and
32% (ImageNet, top-5) accuracy in this configuration, far higher than when training
an equivalent number of randomly chosen parameters elsewhere in the network.
BatchNorm achieves this performance in part by naturally learning to disable
around a third of the random features. Not only do these results highlight the
under-appreciated role of the affine parameters in BatchNorm, but—in a broader
sense—they characterize the expressive power of neural networks constructed
simply by shifting and rescaling random features.

Key Features

Requirements

To install requirements:

pip install -r requirements.txt

Results

Image Classification (Only BatchNorm weights)

Model name	Download	Top 1 Accuracy
ResNet-14 (N=2)	Checkpoint	46.67%
ResNet-32 (N=5)	Checkpoint	51.29%
ResNet-56 (N=9)	Checkpoint	55.21%
ResNet-110 (N=18)	Checkpoint	65.19%
ResNet-218 (N=36)	Checkpoint	70.09%
ResNet-434 (N=72)	Checkpoint	73.67%
ResNet-866 (N=144)	Checkpoint	77.83%

Dataset

CIFAR10 dataset - 10 classes with 50,000 images in the train set and 10,000 images in the test set.

Training

📝 Provide training information.

Provide details for preprocessing, hyperparameters, random seeds, and environment.

Provide a command line example for training.

Please run this command line for training.

python3 resnet_cifar.py

This trains the OnlyBN model for the ResNet-14 architecture. Replace num_blocks with the appropriate value for 'N' from the results table above to train the respective ResNet architecture.

Evaluation

Please run this command line for evaluation.

python3 ...

References

📝 Provide links to references.

Citation

📝 Make your repository citable.

Reference: Making Your Code Citable

If you want to cite this repository in your research paper, please use the following information.

Authors or Maintainers

Vishal Vinod (@Vishal-V)

This project is licensed under the terms of the Apache License 2.0.

review-notebook-app · 2020-08-31T17:45:06Z

Check out this pull request on

Review Jupyter notebook visual diffs & provide feedback on notebooks.

Powered by ReviewNB

jaeyounkim · 2020-10-02T19:37:18Z

The community directory is only for providing a curated list of community models.

Added all notebooks and models

c45ab2c

Vishal-V requested a review from a team as a code owner August 31, 2020 17:45

googlebot added the cla: yes label Aug 31, 2020

Vishal-V assigned jaeyounkim Aug 31, 2020

Vishal-V added the stat:awaiting review Waiting on review label Aug 31, 2020

jaeyounkim closed this Oct 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

On the Expressivity of Random Features in CNNs - TF 2.3 (Community) #9174

On the Expressivity of Random Features in CNNs - TF 2.3 (Community) #9174

Vishal-V commented Aug 31, 2020

review-notebook-app bot commented Aug 31, 2020

jaeyounkim commented Oct 2, 2020

On the Expressivity of Random Features in CNNs - TF 2.3 (Community) #9174

On the Expressivity of Random Features in CNNs - TF 2.3 (Community) #9174

Conversation

Vishal-V commented Aug 31, 2020

Only BatchNorm

Description/Abstract

Key Features

Requirements

Results

Image Classification (Only BatchNorm weights)

Dataset

Training

Evaluation

References

Citation

Authors or Maintainers

review-notebook-app bot commented Aug 31, 2020

jaeyounkim commented Oct 2, 2020