Context-Aware Transformer for 3D Point Cloud Automatic Annotation

03/27/2023
by   Xiaoyan Qian, et al.
0

3D automatic annotation has received increased attention since manually annotating 3D point clouds is laborious. However, existing methods are usually complicated, e.g., pipelined training for 3D foreground/background segmentation, cylindrical object proposals, and point completion. Furthermore, they often overlook the inter-object feature relation that is particularly informative to hard samples for 3D annotation. To this end, we propose a simple yet effective end-to-end Context-Aware Transformer (CAT) as an automated 3D-box labeler to generate precise 3D box annotations from 2D boxes, trained with a small number of human annotations. We adopt the general encoder-decoder architecture, where the CAT encoder consists of an intra-object encoder (local) and an inter-object encoder (global), performing self-attention along the sequence and batch dimensions, respectively. The former models intra-object interactions among points, and the latter extracts feature relations among different objects, thus boosting scene-level understanding. Via local and global encoders, CAT can generate high-quality 3D box annotations with a streamlined workflow, allowing it to outperform existing state-of-the-art by up to 1.79

READ FULL TEXT
research
10/28/2021

3D Object Tracking with Transformer

Feature fusion and similarity computation are two core problems in 3D ob...
research
12/21/2020

3D Object Detection with Pointformer

Feature learning for 3D object detection from point clouds is very chall...
research
07/20/2022

Multimodal Transformer for Automatic 3D Annotation and Object Detection

Despite a growing number of datasets being collected for training 3D obj...
research
08/02/2019

L2G Auto-encoder: Understanding Point Clouds by Local-to-Global Reconstruction with Hierarchical Self-Attention

Auto-encoder is an important architecture to understand point clouds in ...
research
03/08/2023

Full Point Encoding for Local Feature Aggregation in 3D Point Clouds

Point cloud processing methods exploit local point features and global c...
research
04/22/2022

Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds

Dense captioning in 3D point clouds is an emerging vision-and-language t...
research
12/24/2022

MURPHY: Relations Matter in Surgical Workflow Analysis

Autonomous robotic surgery has advanced significantly based on analysis ...

Please sign up or login with your details

Forgot password? Click here to reset