Volumetric and Multi-View CNNs for Object Classification on 3D Data

04/12/2016
by   Charles R. Qi, et al.
0

3D shape models are becoming widely available and easier to capture, making available 3D information crucial for progress in object classification. Current state-of-the-art methods rely on CNNs to address this problem. Recently, we witness two types of CNNs being developed: CNNs based upon volumetric representations versus CNNs based upon multi-view representations. Empirical results from these two types of CNNs exhibit a large gap, indicating that existing volumetric CNN architectures and approaches are unable to fully exploit the power of 3D representations. In this paper, we aim to improve both volumetric CNNs and multi-view CNNs according to extensive analysis of existing approaches. To this end, we introduce two distinct network architectures of volumetric CNNs. In addition, we examine multi-view CNNs, where we introduce multi-resolution filtering in 3D. Overall, we are able to outperform current state-of-the-art methods for both volumetric CNNs and multi-view CNNs. We provide extensive experiments designed to evaluate underlying design choices, thus providing a better understanding of the space of methods available for object classification on 3D data.

READ FULL TEXT

page 6

page 12

page 13

page 14

research
05/30/2022

Neural Volumetric Object Selection

We introduce an approach for selecting objects in neural volumetric 3D r...
research
02/07/2018

A Spatial Mapping Algorithm with Applications in Deep Learning-Based Structure Classification

Convolutional Neural Network (CNN)-based machine learning systems have m...
research
09/18/2017

Wide and deep volumetric residual networks for volumetric image classification

3D shape models that directly classify objects from 3D information have ...
research
12/01/2021

3DVNet: Multi-View Depth Prediction and Volumetric Refinement

We present 3DVNet, a novel multi-view stereo (MVS) depth-prediction meth...
research
05/25/2022

VTP: Volumetric Transformer for Multi-view Multi-person 3D Pose Estimation

This paper presents Volumetric Transformer Pose estimator (VTP), the fir...
research
06/11/2019

iProStruct2D: Identifying protein structural classes by deep learning via 2D representations

In this paper we address the problem of protein classification starting ...
research
06/17/2022

TAVA: Template-free Animatable Volumetric Actors

Coordinate-based volumetric representations have the potential to genera...

Please sign up or login with your details

Forgot password? Click here to reset