AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation

06/01/2023
by   Yuanwen Yue, et al.
0

During interactive segmentation, a model and a user work together to delineate objects of interest in a 3D point cloud. In an iterative process, the model assigns each data point to an object (or the background), while the user corrects errors in the resulting segmentation and feeds them back into the model. From a machine learning perspective the goal is to design the model and the feedback mechanism in a way that minimizes the required user input. The current best practice segments objects one at a time, and asks the user to provide positive clicks to indicate regions wrongly assigned to the background and negative clicks to indicate regions wrongly assigned to the object (foreground). Sequentially visiting objects is wasteful, since it disregards synergies between objects: a positive click for a given object can, by definition, serve as a negative click for nearby objects, moreover a direct competition between adjacent objects can speed up the identification of their common boundary. We introduce AGILE3D, an efficient, attention-based model that (1) supports simultaneous segmentation of multiple 3D objects, (2) yields more accurate segmentation masks with fewer user clicks, and (3) offers faster inference. We encode the point cloud into a latent feature representation, and view user clicks as queries and employ cross-attention to represent contextual relations between different click locations as well as between clicks and the 3D point cloud features. Every time new clicks are added, we only need to run a lightweight decoder that produces updated segmentation masks. In experiments with four different point cloud datasets, AGILE3D sets a new state of the art, moreover, we also verify its practicality in real-world setups with a real user study.

READ FULL TEXT

page 9

page 15

page 16

research
08/13/2019

Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data

Deep learning techniques for point cloud data have demonstrated great po...
research
04/14/2022

Interactive Object Segmentation in 3D Point Clouds

Deep learning depends on large amounts of labeled training data. Manual ...
research
09/24/2017

3D Camouflaging Object using RGB-D Sensors

This paper proposes a new optical camouflage system that uses RGB-D came...
research
09/15/2023

A Ground Segmentation Method Based on Point Cloud Map for Unstructured Roads

Ground segmentation, as the basic task of unmanned intelligent perceptio...
research
10/20/2022

PSA-Det3D: Pillar Set Abstraction for 3D object Detection

Small object detection for 3D point cloud is a challenging problem becau...
research
06/23/2011

Inferring 3D Articulated Models for Box Packaging Robot

Given a point cloud, we consider inferring kinematic models of 3D articu...

Please sign up or login with your details

Forgot password? Click here to reset