Towards Learning Convolutions from Scratch

07/27/2020
by   Behnam Neyshabur, et al.
53

Convolution is one of the most essential components of architectures used in computer vision. As machine learning moves towards reducing the expert bias and learning it from data, a natural next step seems to be learning convolution-like structures from scratch. This, however, has proven elusive. For example, current state-of-the-art architecture search algorithms use convolution as one of the existing modules rather than learning it from data. In an attempt to understand the inductive bias that gives rise to convolutions, we investigate minimum description length as a guiding principle and show that in some settings, it can indeed be indicative of the performance of architectures. To find architectures with small description length, we propose β-LASSO, a simple variant of LASSO algorithm that, when applied on fully-connected networks for image classification tasks, learns architectures with local connections and achieves state-of-the-art accuracies for training fully-connected nets on CIFAR-10 (85.19 bridging the gap between fully-connected and convolutional nets.

READ FULL TEXT

page 9

page 16

page 17

page 18

research
10/16/2020

Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets?

Convolutional neural networks often dominate fully-connected counterpart...
research
11/09/2015

How far can we go without convolution: Improving fully-connected networks

We propose ways to improve the performance of fully connected networks. ...
research
06/08/2016

Convolution by Evolution: Differentiable Pattern Producing Networks

In this work we introduce a differentiable version of the Compositional ...
research
05/30/2019

DeepShift: Towards Multiplication-Less Neural Networks

Deep learning models, especially DCNN have obtained high accuracies in s...
research
06/20/2019

Clustering and Classification Networks

In this paper, we will describe a network architecture that demonstrates...
research
03/21/2018

Fisher Pruning of Deep Nets for Facial Trait Classification

Although deep nets have resulted in high accuracies for various visual t...
research
07/01/2018

Autonomous Deep Learning: A Genetic DCNN Designer for Image Classification

Recent years have witnessed the breakthrough success of deep convolution...

Please sign up or login with your details

Forgot password? Click here to reset