Learning Spatial and Temporal Variations for 4D Point Cloud Segmentation

07/11/2022
by   Shi Hanyu, et al.
0

LiDAR-based 3D scene perception is a fundamental and important task for autonomous driving. Most state-of-the-art methods on LiDAR-based 3D recognition tasks focus on single frame 3D point cloud data, and the temporal information is ignored in those methods. We argue that the temporal information across the frames provides crucial knowledge for 3D scene perceptions, especially in the driving scenario. In this paper, we focus on spatial and temporal variations to better explore the temporal information across the 3D frames. We design a temporal variation-aware interpolation module and a temporal voxel-point refiner to capture the temporal variation in the 4D point cloud. The temporal variation-aware interpolation generates local features from the previous and current frames by capturing spatial coherence and temporal variation information. The temporal voxel-point refiner builds a temporal graph on the 3D point cloud sequences and captures the temporal variation with a graph convolution module. The temporal voxel-point refiner also transforms the coarse voxel-level predictions into fine point-level predictions. With our proposed modules, the new network TVSN achieves state-of-the-art performance on SemanticKITTI and SemantiPOSS. Specifically, our method achieves 52.5% in mIoU (+5.5 on SemanticKITTI, and 63.0 approaches).

READ FULL TEXT

page 1

page 2

page 4

page 10

page 11

research
08/04/2022

TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection

3D object detection using point clouds has attracted increasing attentio...
research
08/25/2023

SVQNet: Sparse Voxel-Adjacent Query Network for 4D Spatio-Temporal LiDAR Semantic Segmentation

LiDAR-based semantic perception tasks are critical yet challenging for a...
research
07/04/2023

SUIT: Learning Significance-guided Information for 3D Temporal Detection

3D object detection from LiDAR point cloud is of critical importance for...
research
12/20/2020

Anchor-Based Spatial-Temporal Attention Convolutional Networks for Dynamic 3D Point Cloud Sequences

Recently, learning based methods for the robot perception from the image...
research
02/28/2022

Spatiotemporal Transformer Attention Network for 3D Voxel Level Joint Segmentation and Motion Prediction in Point Cloud

Environment perception including detection, classification, tracking, an...
research
03/27/2023

NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation

In recent years, there has been a significant increase in focus on the i...
research
07/06/2022

Delving into Sequential Patches for Deepfake Detection

Recent advances in face forgery techniques produce nearly visually untra...

Please sign up or login with your details

Forgot password? Click here to reset