Point-Voxel CNN for Efficient 3D Deep Learning

07/08/2019
by   Zhijian Liu, et al.
0

We present Point-Voxel CNN (PVCNN) for efficient, fast 3D deep learning. Previous work processes 3D data using either voxel-based or point-based NN models. However, both approaches are computationally inefficient. The computation cost and memory footprints of the voxel-based models grow cubically with the input resolution, making it memory-prohibitive to scale up the resolution. As for point-based networks, up to 80 structuring the irregular data which have rather poor memory locality, not on the actual feature extraction. In this paper, we propose PVCNN that represents the 3D input data in points to reduce the memory consumption, while performing the convolutions in voxels to largely reduce the irregular data access and improve the locality. Our PVCNN model is both memory and computation efficient. Evaluated on semantic and part segmentation datasets, it achieves much higher accuracy than the voxel-based baseline with 10x GPU memory reduction; it also outperforms the state-of-the-art point-based models with 7x measured speedup on average. Remarkably, narrower version of PVCNN achieves 2x speedup over PointNet (an extremely efficient model) on part and scene segmentation benchmarks with much higher accuracy. We validate the general effectiveness of our PVCNN on 3D object detection: by replacing the primitives in Frustrum PointNet with PVConv, it outperforms Frustrum PointNet++ by 2.4 with 1.5x measured speedup and GPU memory reduction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2021

Multi Voxel-Point Neurons Convolution (MVPConv) for Fast and Accurate 3D Deep Learning

We present a new convolutional neural network, called Multi Voxel-Point ...
research
07/28/2021

Multi Point-Voxel Convolution (MPVConv) for Deep Learning on Point Clouds

The existing 3D deep learning methods adopt either individual point-base...
research
04/25/2022

PVNAS: 3D Neural Architecture Search with Point-Voxel Convolution

3D neural networks are widely used in real-world applications (e.g., AR/...
research
07/31/2020

Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

Self-driving cars need to understand 3D scenes efficiently and accuratel...
research
05/16/2022

PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection

Real-time and high-performance 3D object detection is of critical import...
research
07/17/2023

Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection

Voxel-based methods have achieved state-of-the-art performance for 3D ob...
research
03/17/2022

FUSED-PAGERANK: Loop-Fusion based Approximate PageRank

PageRank is a graph centrality metric that gives the importance of each ...

Please sign up or login with your details

Forgot password? Click here to reset