Learning Dense Visual Descriptors using Image Augmentations for Robot Manipulation Tasks

09/12/2022
by   Christian Graf, et al.
0

We propose a self-supervised training approach for learning view-invariant dense visual descriptors using image augmentations. Unlike existing works, which often require complex datasets, such as registered RGBD sequences, we train on an unordered set of RGB images. This allows for learning from a single camera view, e.g., in an existing robotic cell with a fix-mounted camera. We create synthetic views and dense pixel correspondences using data augmentations. We find our descriptors are competitive to the existing methods, despite the simpler data recording and setup requirements. We show that training on synthetic correspondences provides descriptor consistency across a broad range of camera views. We compare against training with geometric correspondence from multiple views and provide ablation studies. We also show a robotic bin-picking experiment using descriptors learned from a fix-mounted camera for defining grasp preferences.

READ FULL TEXT

page 1

page 3

page 4

page 8

page 13

page 14

page 15

page 20

research
03/03/2022

NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields

Thin, reflective objects such as forks and whisks are common in our dail...
research
11/23/2013

On the Design and Analysis of Multiple View Descriptors

We propose an extension of popular descriptors based on gradient orienta...
research
10/10/2021

Digging Into Self-Supervised Learning of Feature Descriptors

Fully-supervised CNN-based approaches for learning local image descripto...
research
02/16/2021

Supervised Training of Dense Object Nets using Optimal Descriptors for Industrial Robotic Applications

Dense Object Nets (DONs) by Florence, Manuelli and Tedrake (2018) introd...
research
08/24/2022

Cross-Camera View-Overlap Recognition

We propose a decentralised view-overlap recognition framework that opera...
research
08/04/2018

Learning to Align Images using Weak Geometric Supervision

Image alignment tasks require accurate pixel correspondences, which are ...
research
03/15/2022

SISL:Self-Supervised Image Signature Learning for Splicing Detection and Localization

Recent algorithms for image manipulation detection almost exclusively us...

Please sign up or login with your details

Forgot password? Click here to reset