LinK: Linear Kernel for LiDAR-based 3D Perception

03/28/2023
by   Tao Lu, et al.
0

Extending the success of 2D Large Kernel to 3D perception is challenging due to: 1. the cubically-increasing overhead in processing 3D data; 2. the optimization difficulties from data scarcity and sparsity. Previous work has taken the first step to scale up the kernel size from 3x3x3 to 7x7x7 by introducing block-shared weights. However, to reduce the feature variations within a block, it only employs modest block size and fails to achieve larger kernels like the 21x21x21. To address this issue, we propose a new method, called LinK, to achieve a wider-range perception receptive field in a convolution-like manner with two core designs. The first is to replace the static kernel matrix with a linear kernel generator, which adaptively provides weights only for non-empty voxels. The second is to reuse the pre-computed aggregation results in the overlapped blocks to reduce computation complexity. The proposed method successfully enables each voxel to perceive context within a range of 21x21x21. Extensive experiments on two basic perception tasks, 3D object detection and 3D semantic segmentation, demonstrate the effectiveness of our method. Notably, we rank 1st on the public leaderboard of the 3D detection benchmark of nuScenes (LiDAR track), by simply incorporating a LinK-based backbone into the basic detector, CenterPoint. We also boost the strong segmentation baseline's mIoU with 2.7 available at https://github.com/MCG-NJU/LinK.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2022

ALSO: Automotive Lidar Self-supervision by Occupancy estimation

We propose a new self-supervised method for pre-training the backbone of...
research
09/19/2022

LidarMultiNet: Towards a Unified Multi-task Network for LiDAR Perception

LiDAR-based 3D object detection, semantic segmentation, and panoptic seg...
research
07/20/2022

Fully Sparse 3D Object Detection

As the perception range of LiDAR increases, LiDAR-based 3D object detect...
research
04/06/2023

RFAConv: Innovating Spatital Attention and Standard Convolutional Operation

Spatial attention has been demonstrated to enable convolutional neural n...
research
09/05/2022

LKD-Net: Large Kernel Convolution Network for Single Image Dehazing

The deep convolutional neural networks (CNNs)-based single image dehazin...
research
06/21/2022

Scaling up Kernels in 3D CNNs

Recent advances in 2D CNNs and vision transformers (ViTs) reveal that la...
research
03/24/2022

AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception

Studying the inherent symmetry of data is of great importance in machine...

Please sign up or login with your details

Forgot password? Click here to reset