Equivariant Multi-View Networks

04/01/2019
by   Carlos Esteves, et al.
14

Several approaches to 3D vision tasks process multiple views of the input independently with deep neural networks pre-trained on natural images, achieving view permutation invariance through a single round of pooling over all views. We argue that this operation discards important information and leads to subpar global descriptors. In this paper, we propose a group convolutional approach to multiple view aggregation where convolutions are performed over a discrete subgroup of the rotation group, enabling, thus, joint reasoning over all views in an equivariant (instead of invariant) fashion, up to the very last layer. We further develop this idea to operate on smaller discrete homogeneous spaces of the rotation group, where a polar view representation is used to maintain equivariance with only a fraction of the number of input views. We set the new state of the art in several large scale 3D shape retrieval tasks, and show additional applications to panoramic scene classification.

READ FULL TEXT

page 2

page 5

page 6

page 13

page 14

page 15

page 16

research
07/24/2020

Multi-view adaptive graph convolutions for graph classification

In this paper, a novel multi-view methodology for graph-based neural net...
research
05/05/2015

Multi-view Convolutional Neural Networks for 3D Shape Recognition

A longstanding question in computer vision concerns the representation o...
research
04/19/2017

End-to-End Multi-View Networks for Text Classification

We propose a multi-view network for text classification. Our method auto...
research
08/20/2018

VERAM: View-Enhanced Recurrent Attention Model for 3D Shape Classification

Multi-view deep neural network is perhaps the most successful approach i...
research
08/02/2018

Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction

We study the problem of recovering an underlying 3D shape from a set of ...
research
11/29/2021

SPIN: Simplifying Polar Invariance for Neural networks Application to vision-based irradiance forecasting

Translational invariance induced by pooling operations is an inherent pr...

Please sign up or login with your details

Forgot password? Click here to reset