ProgressLabeller: Visual Data Stream Annotation for Training Object-Centric 3D Perception

03/01/2022
by   Xiaotong Chen, et al.
0

Visual perception tasks often require vast amounts of labelled data, including 3D poses and image space segmentation masks. The process of creating such training data sets can prove difficult or time-intensive to scale up to efficacy for general use. Consider the task of pose estimation for rigid objects. Deep neural network based approaches have shown good performance when trained on large, public datasets. However, adapting these networks for other novel objects, or fine-tuning existing models for different environments, requires significant time investment to generate newly labelled instances. Towards this end, we propose ProgressLabeller as a method for more efficiently generating large amounts of 6D pose training data from color images sequences for custom scenes in a scalable manner. ProgressLabeller is intended to also support transparent or translucent objects, for which the previous methods based on depth dense reconstruction will fail. We demonstrate the effectiveness of ProgressLabeller by rapidly create a dataset of over 1M samples with which we fine-tune a state-of-the-art pose estimation network in order to markedly improve the downstream robotic grasp success rates. ProgressLabeller will be made publicly available soon.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

research
08/01/2020

PERCH 2.0 : Fast and Accurate GPU-based Perception via Search for Object Pose Estimation

Pose estimation of known objects is fundamental to tasks such as robotic...
research
03/02/2022

Object Pose Estimation using Mid-level Visual Representations

This work proposes a novel pose estimation model for object categories t...
research
08/22/2022

TransNet: Category-Level Transparent Object Pose Estimation

Transparent objects present multiple distinct challenges to visual perce...
research
03/02/2022

OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation

This paper proposes a universal framework, called OVE6D, for model-based...
research
11/07/2020

Rapid Pose Label Generation through Sparse Representation of Unknown Objects

Deep Convolutional Neural Networks (CNNs) have been successfully deploye...
research
10/19/2022

MC-hands-1M: A glove-wearing hand dataset for pose estimation

Nowadays, the need for large amounts of carefully and complexly annotate...
research
11/16/2022

RF-Annotate: Automatic RF-Supervised Image Annotation of Common Objects in Context

Wireless tags are increasingly used to track and identify common items o...

Please sign up or login with your details

Forgot password? Click here to reset