Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities

08/06/2023
by   Rohit Mohan, et al.
0

Safety and efficiency are paramount in healthcare facilities where the lives of patients are at stake. Despite the adoption of robots to assist medical staff in challenging tasks such as complex surgeries, human expertise is still indispensable. The next generation of autonomous healthcare robots hinges on their capacity to perceive and understand their complex and frenetic environments. While deep learning models are increasingly used for this purpose, they require extensive annotated training data which is impractical to obtain in real-world healthcare settings. To bridge this gap, we present Syn-Mediverse, the first hyper-realistic multimodal synthetic dataset of diverse healthcare facilities. Syn-Mediverse contains over 48000 images from a simulated industry-standard optical tracking camera and provides more than 1.5M annotations spanning five different scene understanding tasks including depth estimation, object detection, semantic segmentation, instance segmentation, and panoptic segmentation. We demonstrate the complexity of our dataset by evaluating the performance on a broad range of state-of-the-art baselines for each task. To further advance research on scene understanding of healthcare facilities, along with the public dataset we provide an online evaluation benchmark available at <http://syn-mediverse.cs.uni-freiburg.de>

READ FULL TEXT

page 1

page 3

page 4

page 7

research
02/04/2022

StandardSim: A Synthetic Dataset For Retail Environments

Autonomous checkout systems rely on visual and sensory inputs to carry o...
research
09/22/2020

PennSyn2Real: Training Object Recognition Models without Human Labeling

Scalability is a critical problem in generating training images for deep...
research
12/15/2016

SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth

We introduce SceneNet RGB-D, expanding the previous work of SceneNet to ...
research
03/31/2021

Evaluation of Multimodal Semantic Segmentation using RGB-D Data

Our goal is to develop stable, accurate, and robust semantic scene under...
research
08/10/2021

UniNet: A Unified Scene Understanding Network and Exploring Multi-Task Relationships through the Lens of Adversarial Attacks

Scene understanding is crucial for autonomous systems which intend to op...
research
09/13/2019

MinneApple: A Benchmark Dataset for Apple Detection and Segmentation

In this work, we present a new dataset to advance the state-of-the-art i...
research
09/09/2022

MassMIND: Massachusetts Maritime INfrared Dataset

Recent advances in deep learning technology have triggered radical progr...

Please sign up or login with your details

Forgot password? Click here to reset