A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation

03/09/2017
by   Chaitanya Mitash, et al.
0

Progress has been achieved recently in object detection given advancements in deep learning. Nevertheless, such tools typically require a large amount of training data and significant manual effort to label objects. This limits their applicability in robotics, where solutions must scale to a large number of objects and variety of conditions. This work proposes an autonomous process for training a Convolutional Neural Network (CNN) for object detection and pose estimation in robotic setups. The focus is on detecting objects placed in cluttered, tight environments, such as a shelf with multiple objects. In particular, given access to 3D object models, several aspects of the environment are physically simulated. The models are placed in physically realistic poses with respect to their environment to generate a labeled synthetic dataset. To further improve object detection, the network self-trains over real images that are labeled using a robust multi-view pose estimation process. The proposed training process is evaluated on several existing datasets and on a dataset collected for this paper with a Motoman robotic arm. Results show that the proposed approach outperforms popular training processes relying on synthetic - but not physically realistic - data and manual annotation. The key contributions are the incorporation of physical reasoning in the synthetic data generation process and the automation of the annotation process over real images.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

research
06/25/2018

Physics-based Scene-level Reasoning for Object Pose Estimation in Clutter

This paper focuses on vision-based pose estimation for multiple rigid ob...
research
06/18/2018

Learning Object Localization and 6D Pose Estimation from Simulation and Weakly Labeled Real Images

This work proposes a process for efficiently training a point-wise objec...
research
09/29/2016

Multi-view Self-supervised Deep Learning for 6D Pose Estimation in the Amazon Picking Challenge

Robot warehouse automation has attracted significant interest in recent ...
research
02/26/2019

An Annotation Saved is an Annotation Earned: Using Fully Synthetic Training for Object Instance Detection

Deep learning methods typically require vast amounts of training data to...
research
04/28/2019

Synthetic Data Generation and Adaption for Object Detection in Smart Vending Machines

This paper presents an improved scheme for the generation and adaption o...
research
09/23/2019

Pose Estimation for Texture-less Shiny Objects in a Single RGB Image Using Synthetic Training Data

In the industrial domain, the pose estimation of multiple texture-less s...
research
05/12/2020

Stillleben: Realistic Scene Synthesis for Deep Learning in Robotics

Training data is the key ingredient for deep learning approaches, but di...

Please sign up or login with your details

Forgot password? Click here to reset