Multi-view Self-supervised Deep Learning for 6D Pose Estimation in the Amazon Picking Challenge

09/29/2016
by   Andy Zeng, et al.
0

Robot warehouse automation has attracted significant interest in recent years, perhaps most visibly in the Amazon Picking Challenge (APC). A fully autonomous warehouse pick-and-place system requires robust vision that reliably recognizes and locates objects amid cluttered environments, self-occlusions, sensor noise, and a large variety of objects. In this paper we present an approach that leverages multi-view RGB-D data and self-supervised, data-driven learning to overcome those difficulties. The approach was part of the MIT-Princeton Team system that took 3rd- and 4th- place in the stowing and picking tasks, respectively at APC 2016. In the proposed approach, we segment and label multiple views of a scene with a fully convolutional neural network, and then fit pre-scanned 3D object models to the resulting segmentation to get the 6D object pose. Training a deep neural network for segmentation typically requires a large amount of training data. We propose a self-supervised method to generate a large labeled dataset without tedious manual segmentation. We demonstrate that our system can reliably estimate the 6D pose of objects under a variety of scenarios. All code, data, and benchmarks are available at http://apc.cs.princeton.edu/

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

research
03/09/2017

A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation

Progress has been achieved recently in object detection given advancemen...
research
02/02/2023

Hand Pose Estimation via Multiview Collaborative Self-Supervised Learning

3D hand pose estimation has made significant progress in recent years. H...
research
07/27/2022

On the robustness of self-supervised representations for multi-view object classification

It is known that representations from self-supervised pre-training can p...
research
06/25/2018

Physics-based Scene-level Reasoning for Object Pose Estimation in Clutter

This paper focuses on vision-based pose estimation for multiple rigid ob...
research
10/13/2020

Self-Supervised Multi-View Synchronization Learning for 3D Pose Estimation

Current state-of-the-art methods cast monocular 3D human pose estimation...
research
02/01/2021

Despeckling Sentinel-1 GRD images by deep learning and application to narrow river segmentation

This paper presents a despeckling method for Sentinel-1 GRD images based...
research
04/14/2021

Self-supervised Learning of 3D Object Understanding by Data Association and Landmark Estimation for Image Sequence

In this paper, we propose a self-supervised learningmethod for multi-obj...

Please sign up or login with your details

Forgot password? Click here to reset