A Dataset for Developing and Benchmarking Active Vision

02/27/2017
by   Phil Ammirato, et al.
0

We present a new public dataset with a focus on simulating robotic vision tasks in everyday indoor environments using real imagery. The dataset includes 20,000+ RGB-D images and 50,000+ 2D bounding boxes of object instances densely captured in 9 unique scenes. We train a fast object category detector for instance detection on our data. Using the dataset we show that, although increasingly accurate and fast, the state of the art for object detection is still severely impacted by object scale, occlusion, and viewing direction all of which matter for robotics applications. We next validate the dataset for simulating active vision, and use the dataset to develop and evaluate a deep-network-based system for next best move prediction for object classification using reinforcement learning. Our dataset is available for download at cs.unc.edu/ ammirato/active_vision_dataset_website/.

READ FULL TEXT

page 3

page 6

research
09/26/2016

Multiview RGB-D Dataset for Object Instance Detection

This paper presents a new multi-view RGB-D dataset of nine kitchen scene...
research
10/03/2019

360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images

While there are several widely used object detection datasets, current c...
research
07/12/2020

EAGLE: Large-scale Dataset for Vehicle Detection in Aerial Imagery

Multi-class vehicle detection from airborne imagery with orientation est...
research
03/10/2023

DAVIS-Ag: A Synthetic Plant Dataset for Developing Domain-Inspired Active Vision in Agricultural Robots

In agricultural environments, viewpoint planning can be a critical funct...
research
11/11/2020

Learning from THEODORE: A Synthetic Omnidirectional Top-View Indoor Dataset for Deep Transfer Learning

Recent work about synthetic indoor datasets from perspective views has s...
research
06/29/2023

Evaluation of Environmental Conditions on Object Detection using Oriented Bounding Boxes for AR Applications

The objective of augmented reality (AR) is to add digital content to nat...
research
12/09/2018

A Comparison of Embedded Deep Learning Methods for Person Detection

Recent advancements in parallel computing, GPU technology and deep learn...

Please sign up or login with your details

Forgot password? Click here to reset