Matterport3D: Learning from RGB-D Data in Indoor Environments

09/18/2017
by   Angel Chang, et al.
0

Access to large, diverse RGB-D datasets is critical for training RGB-D scene understanding algorithms. However, existing datasets still cover only a limited number of views or a restricted scale of spaces. In this paper, we introduce Matterport3D, a large-scale RGB-D dataset containing 10,800 panoramic views from 194,400 RGB-D images of 90 building-scale scenes. Annotations are provided with surface reconstructions, camera poses, and 2D and 3D semantic segmentations. The precise global alignment and comprehensive, diverse panoramic set of views over entire buildings enable a variety of supervised and self-supervised computer vision tasks, including keypoint matching, view overlap prediction, normal prediction from color, semantic segmentation, and region classification.

READ FULL TEXT

page 13

page 14

page 15

page 17

page 19

page 20

page 21

page 22

research
02/28/2023

Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors

Current popular backbones in computer vision, such as Vision Transformer...
research
04/07/2017

Learning Where to Look: Data-Driven Viewpoint Set Selection for 3D Scenes

The use of rendered images, whether from completely synthetic datasets o...
research
08/31/2018

Semantic Mapping for Orchard Environments by Merging Two-Sides Reconstructions of Tree Rows

Measuring semantic traits for phenotyping is an essential but labor-inte...
research
02/03/2017

Joint 2D-3D-Semantic Data for Indoor Scene Understanding

We present a dataset of large-scale indoor spaces that provides a variet...
research
06/12/2021

Reverse-engineer the Distributional Structure of Infant Egocentric Views for Training Generalizable Image Classifiers

We analyze egocentric views of attended objects from infants. This paper...
research
01/19/2023

Multiview Compressive Coding for 3D Reconstruction

A central goal of visual recognition is to understand objects and scenes...
research
01/19/2019

The RobotriX: An eXtremely Photorealistic and Very-Large-Scale Indoor Dataset of Sequences with Robot Trajectories and Interactions

Enter the RobotriX, an extremely photorealistic indoor dataset designed ...

Please sign up or login with your details

Forgot password? Click here to reset