3D Semantic Segmentation with Submanifold Sparse Convolutional Networks

11/28/2017
by   Benjamin Graham, et al.
0

Convolutional networks are the de-facto standard for analyzing spatio-temporal data such as images, videos, and 3D shapes. Whilst some of this data is naturally dense (e.g., photos), many other data sources are inherently sparse. Examples include 3D point clouds that were obtained using a LiDAR scanner or RGB-D camera. Standard "dense" implementations of convolutional networks are very inefficient when applied on such sparse data. We introduce new sparse convolutional operations that are designed to process spatially-sparse data more efficiently, and use them to develop spatially-sparse convolutional networks. We demonstrate the strong performance of the resulting models, called submanifold sparse convolutional networks (SSCNs), on two tasks involving semantic segmentation of 3D point clouds. In particular, our models outperform all prior state-of-the-art on the test set of a recent semantic segmentation competition.

READ FULL TEXT

page 4

page 10

research
06/05/2017

Submanifold Sparse Convolutional Networks

Convolutional network are the de-facto standard for analysing spatio-tem...
research
08/02/2018

Sparse and Dense Data with CNNs: Depth Completion and Semantic Segmentation

Convolutional neural networks are designed for dense data, but vision da...
research
05/10/2021

PillarSegNet: Pillar-based Semantic Grid Map Estimation using Sparse LiDAR Data

Semantic understanding of the surrounding environment is essential for a...
research
12/02/2021

Putting 3D Spatially Sparse Networks on a Diet

3D neural networks have become prevalent for many 3D vision tasks includ...
research
07/06/2018

Tangent Convolutions for Dense Prediction in 3D

We present an approach to semantic scene analysis using deep convolution...
research
04/26/2022

Focal Sparse Convolutional Networks for 3D Object Detection

Non-uniformed 3D sparse data, e.g., point clouds or voxels in different ...
research
08/18/2023

Metadata Improves Segmentation Through Multitasking Elicitation

Metainformation is a common companion to biomedical images. However, thi...

Please sign up or login with your details

Forgot password? Click here to reset