FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud Segmentation

by   Aoran Xiao, et al.

Scene understanding based on LiDAR point cloud is an essential task for autonomous cars to drive safely, which often employs spherical projection to map 3D point cloud into multi-channel 2D images for semantic segmentation. Most existing methods simply stack different point attributes/modalities (e.g. coordinates, intensity, depth, etc.) as image channels to increase information capacity, but ignore distinct characteristics of point attributes in different image channels. We design FPS-Net, a convolutional fusion network that exploits the uniqueness and discrepancy among the projected image channels for optimal point cloud segmentation. FPS-Net adopts an encoder-decoder structure. Instead of simply stacking multiple channel images as a single input, we group them into different modalities to first learn modality-specific features separately and then map the learned features into a common high-dimensional feature space for pixel-level fusion and learning. Specifically, we design a residual dense block with multiple receptive fields as a building block in the encoder which preserves detailed information in each modality and learns hierarchical modality-specific and fused features effectively. In the FPS-Net decoder, we use a recurrent convolution block likewise to hierarchically decode fused features into output space for pixel-level classification. Extensive experiments conducted on two widely adopted point cloud datasets show that FPS-Net achieves superior semantic segmentation as compared with state-of-the-art projection-based methods. In addition, the proposed modality fusion idea is compatible with typical projection-based methods and can be incorporated into them with consistent performance improvements.


page 3

page 9

page 10

page 15

page 18


LU-Net: An Efficient Network for 3D LiDAR Point Cloud Semantic Segmentation Based on End-to-End-Learned 3D Features and U-Net

We propose LU-Net -- for LiDAR U-Net, a new method for the semantic segm...

Multi Projection Fusion for Real-time Semantic Segmentation of 3D LiDAR Point Clouds

Semantic segmentation of 3D point cloud data is essential for enhanced h...

S3Net: 3D LiDAR Sparse Semantic Segmentation Network

Semantic Segmentation is a crucial component in the perception systems o...

Pyramid Deep Fusion Network for Two-Hand Reconstruction from RGB-D Images

Accurately recovering the dense 3D mesh of both hands from monocular ima...

Adaptive Channel Encoding Transformer for Point Cloud Analysis

Transformer plays an increasingly important role in various computer vis...

Road Segmentation with Image-LiDAR Data Fusion

Robust road segmentation is a key challenge in self-driving research. Th...

A Unified Point-Based Framework for 3D Segmentation

3D point cloud segmentation remains challenging for structureless and te...

Please sign up or login with your details

Forgot password? Click here to reset