Point2Seq: Detecting 3D Objects as Sequences

03/25/2022
by   Yujing Xue, et al.
0

We present a simple and effective framework, named Point2Seq, for 3D object detection from point clouds. In contrast to previous methods that normally predict attributes of 3D objects all at once, we expressively model the interdependencies between attributes of 3D objects, which in turn enables a better detection accuracy. Specifically, we view each 3D object as a sequence of words and reformulate the 3D object detection task as decoding words from 3D scenes in an auto-regressive manner. We further propose a lightweight scene-to-sequence decoder that can auto-regressively generate words conditioned on features from a 3D scene as well as cues from the preceding words. The predicted words eventually constitute a set of sequences that completely describe the 3D objects in the scene, and all the predicted sequences are then automatically assigned to the respective ground truths through similarity-based sequence matching. Our approach is conceptually intuitive and can be readily plugged upon most existing 3D-detection backbones without adding too much computational overhead; the sequential decoding paradigm we proposed, on the other hand, can better exploit information from complex 3D scenes with the aid of preceding predicted words. Without bells and whistles, our method significantly outperforms previous anchor- and center-based 3D object detection frameworks, yielding the new state of the art on the challenging ONCE dataset as well as the Waymo Open Dataset. Code is available at <https://github.com/ocNflag/point2seq>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2021

Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

3D object detection in point clouds is a challenging vision task that be...
research
01/10/2023

Rethinking Voxelization and Classification for 3D Object Detection

The main challenge in 3D object detection from LiDAR point clouds is ach...
research
07/21/2022

Boosting 3D Object Detection via Object-Focused Image Fusion

3D object detection has achieved remarkable progress by taking point clo...
research
12/01/2021

FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection

Recently, promising applications in robotics and augmented reality have ...
research
01/03/2023

Semi-Structured Object Sequence Encoders

In this paper we explore the task of modeling (semi) structured object s...
research
05/04/2023

Aligning Bird-Eye View Representation of Point Cloud Sequences using Scene Flow

Low-resolution point clouds are challenging for object detection methods...
research
11/25/2022

Language-Assisted 3D Feature Learning for Semantic Scene Understanding

Learning descriptive 3D features is crucial for understanding 3D scenes ...

Please sign up or login with your details

Forgot password? Click here to reset