Harmonic Networks: Deep Translation and Rotation Equivariance

12/14/2016
by   Daniel E. Worrall, et al.
0

Translating or rotating an input image should not affect the results of many computer vision tasks. Convolutional neural networks (CNNs) are already translation equivariant: input image translations produce proportionate feature map translations. This is not the case for rotations. Global rotation equivariance is typically sought through data augmentation, but patch-wise equivariance is more difficult. We present Harmonic Networks or H-Nets, a CNN exhibiting equivariance to patch-wise translation and 360-rotation. We achieve this by replacing regular CNN filters with circular harmonics, returning a maximal response and orientation for every receptive field patch. H-Nets use a rich, parameter-efficient and low computational complexity representation, and we show that deep feature maps within the network encode complicated rotational invariants. We demonstrate that our layers are general enough to be used in conjunction with the latest architectures and techniques, such as deep supervision and batch normalization. We also achieve state-of-the-art classification on rotated-MNIST, and competitive results on other benchmark challenges.

READ FULL TEXT

page 3

page 5

page 8

research
11/28/2019

Patch Reordering: a Novel Way to Achieve Rotation and Translation Invariance in Convolutional Neural Networks

Convolutional Neural Networks (CNNs) have demonstrated state-of-the-art ...
research
07/29/2019

On the Realization and Analysis of Circular Harmonic Transforms for Feature Detection

Cartesian-separable realizations of circular-harmonic decompositions for...
research
11/21/2022

RIC-CNN: Rotation-Invariant Coordinate Convolutional Neural Network

In recent years, convolutional neural network has shown good performance...
research
12/29/2016

Rotation equivariant vector field networks

In many computer vision tasks, we expect a particular behavior of the ou...
research
04/21/2022

A case for using rotation invariant features in state of the art feature matchers

The aim of this paper is to demonstrate that a state of the art feature ...
research
11/23/2020

Learnable Gabor modulated complex-valued networks for orientation robustness

Robustness to transformation is desirable in many computer vision tasks,...
research
06/10/2021

Group Equivariant Subsampling

Subsampling is used in convolutional neural networks (CNNs) in the form ...

Please sign up or login with your details

Forgot password? Click here to reset