Multi-view Convolutional Neural Networks for 3D Shape Recognition

05/05/2015
by   Hang Su, et al.
0

A longstanding question in computer vision concerns the representation of 3D shapes for recognition: should 3D shapes be represented with descriptors operating on their native 3D formats, such as voxel grid or polygon mesh, or can they be effectively represented with view-based descriptors? We address this question in the context of learning to recognize 3D shapes from a collection of their rendered views on 2D images. We first present a standard CNN architecture trained to recognize the shapes' rendered views independently of each other, and show that a 3D shape can be recognized even from a single view at an accuracy far higher than using state-of-the-art 3D shape descriptors. Recognition rates further increase when multiple views of the shapes are provided. In addition, we present a novel CNN architecture that combines information from multiple views of a 3D shape into a single and compact shape descriptor offering even better recognition performance. The same architecture can be applied to accurately recognize human hand-drawn sketches of shapes. We conclude that a collection of 2D views can be highly informative for 3D shape recognition and is amenable to emerging CNN architectures and their derivatives.

READ FULL TEXT
research
07/23/2018

Learning 3D Shapes as Multi-Layered Height-maps using 2D Convolutional Networks

We present a novel global representation of 3D shapes, suitable for the ...
research
08/16/2021

Learning Canonical View Representation for 3D Shape Recognition with Arbitrary Views

In this paper, we focus on recognizing 3D shapes from arbitrary views, i...
research
04/01/2019

Equivariant Multi-View Networks

Several approaches to 3D vision tasks process multiple views of the inpu...
research
06/14/2017

Learning Local Shape Descriptors from Part Correspondences With Multi-view Convolutional Networks

We present a new local descriptor for 3D shapes, directly applicable to ...
research
02/09/2018

Shapes Characterization on Address Event Representation Using Histograms of Oriented Events and an Extended LBP Approach

Address Event Representation is a thriving technology that could change ...
research
11/29/2019

SketchZooms: Deep multi-view descriptors for matching line drawings

Finding point-wise correspondences between images is a long-standing pro...
research
05/31/2018

Classification of volcanic ash particles using a convolutional neural network and probability

Analyses of volcanic ash are typically performed either by qualitatively...

Please sign up or login with your details

Forgot password? Click here to reset