Multi-Resolution 3D Convolutional Neural Networks for Object Recognition

05/30/2018
by   Sambit Ghadai, et al.
2

Learning from 3D Data is a fascinating idea which is well explored and studied in computer vision. This allows one to learn from very sparse LiDAR data, point cloud data as well as 3D objects in terms of CAD models and surfaces etc. Most of the approaches to learn from such data are limited to uniform 3D volume occupancy grids or octree representations. A major challenge in learning from 3D data is that one needs to define a proper resolution to represent it in a voxel grid and this becomes a bottleneck for the learning algorithms. Specifically, while we focus on learning from 3D data, a fine resolution is very important to capture key features in the object and at the same time the data becomes sparser as the resolution becomes finer. There are numerous applications in computer vision where a multi-resolution representation is used instead of a uniform grid representation in order to make the applications memory efficient. Though such methods are difficult to learn from, they are much more efficient in representing 3D data. In this paper, we explore the challenges in learning from such data representation. In particular, we use a multi-level voxel representation where we define a coarse voxel grid that contains information of important voxels(boundary voxels) and multiple fine voxel grids corresponding to each significant voxel of the coarse grid. A multi-level voxel representation can capture important features in the 3D data in a memory efficient way in comparison to an octree representation. Consequently, learning from a 3D object with high resolution, which is paramount in feature recognition, is made efficient.

READ FULL TEXT
research
04/03/2017

Hierarchical Surface Prediction for 3D Object Reconstruction

Recently, Convolutional Neural Networks have shown promising results for...
research
02/28/2018

Resolution Improvement of the Common Method for Presentating Arbitrary Space Curves Voxel

The task of voxel resolution for a space curve in video memory of 3D dis...
research
03/28/2017

Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs

We present a deep convolutional decoder architecture that can generate v...
research
11/15/2016

OctNet: Learning Deep 3D Representations at High Resolutions

We present OctNet, a representation for deep learning with sparse 3D dat...
research
10/25/2021

Accelerate 3D Object Processing via Spectral Layout

3D image processing is an important problem in computer vision and patte...
research
11/04/2021

Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image

Inferring 3D locations and shapes of multiple objects from a single 2D i...
research
12/07/2016

Learning Localized Geometric Features Using 3D-CNN: An Application to Manufacturability Analysis of Drilled Holes

3D convolutional neural networks (3D-CNN) have been used for object reco...

Please sign up or login with your details

Forgot password? Click here to reset