MVMO: A Multi-Object Dataset for Wide Baseline Multi-View Semantic Segmentation

05/30/2022
by   Aitor Alvarez-Gila, et al.
0

We present MVMO (Multi-View, Multi-Object dataset): a synthetic dataset of 116,000 scenes containing randomly placed objects of 10 distinct classes and captured from 25 camera locations in the upper hemisphere. MVMO comprises photorealistic, path-traced image renders, together with semantic segmentation ground truth for every view. Unlike existing multi-view datasets, MVMO features wide baselines between cameras and high density of objects, which lead to large disparities, heavy occlusions and view-dependent object appearance. Single view semantic segmentation is hindered by self and inter-object occlusions that could benefit from additional viewpoints. Therefore, we expect that MVMO will propel research in multi-view semantic segmentation and cross-view semantic transfer. We also provide baselines that show that new research is needed in such fields to exploit the complementary information of multi-view setups.

READ FULL TEXT

page 2

page 5

research
03/26/2017

Multi-View Deep Learning for Consistent Semantic Mapping with RGB-D Cameras

Visual scene understanding is an important capability that enables robot...
research
04/08/2016

STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling

We propose a novel superpixel-based multi-view convolutional neural netw...
research
06/09/2019

Cross-view Semantic Segmentation for Sensing Surroundings

Sensing surroundings is ubiquitous and effortless to humans: It takes a ...
research
08/14/2023

A One Stop 3D Target Reconstruction and multilevel Segmentation Method

3D object reconstruction and multilevel segmentation are fundamental to ...
research
03/20/2020

A Robotic 3D Perception System for Operating Room Environment Awareness

Purpose: We describe a 3D multi-view perception system for the da Vinci ...
research
09/28/2017

X-View: Graph-Based Semantic Multi-View Localization

Global registration of multi-view robot data is a challenging task. Appe...
research
12/27/2020

Ellipse Regression with Predicted Uncertainties for Accurate Multi-View 3D Object Estimation

Convolutional neural network (CNN) based architectures, such as Mask R-C...

Please sign up or login with your details

Forgot password? Click here to reset