Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans

06/06/2022
by   Alexandr Notchenko, et al.
8

We propose Scan2Part, a method to segment individual parts of objects in real-world, noisy indoor RGB-D scans. To this end, we vary the part hierarchies of objects in indoor scenes and explore their effect on scene understanding models. Specifically, we use a sparse U-Net-based architecture that captures the fine-scale detail of the underlying 3D scan geometry by leveraging a multi-scale feature hierarchy. In order to train our method, we introduce the Scan2Part dataset, which is the first large-scale collection providing detailed semantic labels at the part level in the real-world setting. In total, we provide 242,081 correspondences between 53,618 PartNet parts of 2,477 ShapeNet objects and 1,506 ScanNet scenes, at two spatial resolutions of 2 cm^3 and 5 cm^3. As output, we are able to predict fine-grained per-object part labels, even when the geometry is coarse or partially missing.

READ FULL TEXT

page 3

page 5

page 6

page 8

page 15

page 16

research
12/03/2020

Towards Part-Based Understanding of RGB-D Scans

Recent advances in 3D semantic scene understanding have shown impressive...
research
12/29/2019

Fine-grained Object Semantic Understanding from Correspondences

Fine-grained semantic understanding of 3D objects is crucial in many app...
research
05/24/2019

Deep Reason: A Strong Baseline for Real-World Visual Reasoning

This paper presents a strong baseline for real-world visual reasoning (G...
research
02/19/2018

Multi-resolution Tensor Learning for Large-Scale Spatial Data

High-dimensional tensor models are notoriously computationally expensive...
research
07/16/2022

Monitoring Vegetation From Space at Extremely Fine Resolutions via Coarsely-Supervised Smooth U-Net

Monitoring vegetation productivity at extremely fine resolutions is valu...
research
03/17/2021

Learning with Group Noise

Machine learning in the context of noise is a challenging but practical ...
research
12/12/2022

ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes

The two popular datasets ScanRefer [16] and ReferIt3D [3] connect natura...

Please sign up or login with your details

Forgot password? Click here to reset