Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures

08/16/2018
by   Evangelos Georganas, et al.
0

Convolution layers are prevalent in many classes of deep neural networks, including Convolutional Neural Networks (CNNs) which provide state-of-the-art results for tasks like image recognition, neural machine translation and speech recognition. The computationally expensive nature of a convolution operation has led to the proliferation of implementations including matrix-matrix multiplication formulation, and direct convolution primarily targeting GPUs. In this paper, we introduce direct convolution kernels for x86 architectures, in particular for Xeon and XeonPhi systems, which are implemented via a dynamic compilation approach. Our JIT-based implementation shows close to theoretical peak performance, depending on the setting and the CPU architecture at hand. We additionally demonstrate how these JIT-optimized kernels can be integrated into a lightweight multi-node graph execution model. This illustrates that single- and multi-node runs yield high efficiencies and high image-throughputs when executing state-of-the-art image recognition tasks on CPUs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2017

Parallel Multi Channel Convolution using General Matrix Multiplication

Convolutional neural networks (CNNs) have emerged as one of the most suc...
research
04/16/2021

Efficient and Generic 1D Dilated Convolution Layer for Deep Learning

Convolutional neural networks (CNNs) have found many applications in tas...
research
05/28/2019

Deep Scale-spaces: Equivariance Over Scale

We introduce deep scale-spaces (DSS), a generalization of convolutional ...
research
09/20/2018

High Performance Zero-Memory Overhead Direct Convolutions

The computation of convolution layers in deep neural networks typically ...
research
10/23/2015

Confusing Deep Convolution Networks by Relabelling

Deep convolutional neural networks have become the gold standard for ima...
research
11/11/2022

Dual Complementary Dynamic Convolution for Image Recognition

As a powerful engine, vanilla convolution has promoted huge breakthrough...
research
03/12/2017

Detection of Human Rights Violations in Images: Can Convolutional Neural Networks help?

After setting the performance benchmarks for image, video, speech and au...

Please sign up or login with your details

Forgot password? Click here to reset