Fast Training of Convolutional Networks through FFTs

12/20/2013
by   Michael Mathieu, et al.
0

Convolutional networks are one of the most widely employed architectures in computer vision and machine learning. In order to leverage their ability to learn complex functions, large amounts of data are required for training. Training a large convolutional network to produce state-of-the-art results can take weeks, even when using modern GPUs. Producing labels using a trained network can also be costly when dealing with web-scale datasets. In this work, we present a simple algorithm which accelerates training and inference by a significant factor, and can yield improvements of over an order of magnitude compared to existing state-of-the-art implementations. This is done by computing convolutions as pointwise products in the Fourier domain while reusing the same transformed feature map many times. The algorithm is implemented on a GPU architecture and addresses a number of related challenges.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2018

On the Blindspots of Convolutional Networks

Deep convolutional network has been the state-of-the-art approach for a ...
research
10/22/2015

ZNN - A Fast and Scalable Algorithm for Training 3D Convolutional Networks on Multi-Core and Many-Core Shared Memory Machines

Convolutional networks (ConvNets) have become a popular approach to comp...
research
10/02/2016

Deep unsupervised learning through spatial contrasting

Convolutional networks have marked their place over the last few years a...
research
10/22/2020

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

While the Transformer architecture has become the de-facto standard for ...
research
09/12/2021

U-Net Convolutional Network for Recognition of Vessels and Materials in Chemistry Lab

Convolutional networks have been widely applied for computer vision syst...
research
12/02/2015

Rethinking the Inception Architecture for Computer Vision

Convolutional networks are at the core of most state-of-the-art computer...
research
01/20/2017

Fusion of Heterogeneous Data in Convolutional Networks for Urban Semantic Labeling (Invited Paper)

In this work, we present a novel module to perform fusion of heterogeneo...

Please sign up or login with your details

Forgot password? Click here to reset