End-to-End Learning of Keypoint Representations for Continuous Control from Images

06/15/2021
by   Rinu Boney, et al.
1

In many control problems that include vision, optimal controls can be inferred from the location of the objects in the scene. This information can be represented using keypoints, which is a list of spatial locations in the input image. Previous works show that keypoint representations learned during unsupervised pre-training using encoder-decoder architectures can provide good features for control tasks. In this paper, we show that it is possible to learn efficient keypoint representations end-to-end, without the need for unsupervised pre-training, decoders, or additional losses. Our proposed architecture consists of a differentiable keypoint extractor that feeds the coordinates of the estimated keypoints directly to a soft actor-critic agent. The proposed algorithm yields performance competitive to the state-of-the art on DeepMind Control Suite tasks.

READ FULL TEXT

page 3

page 5

page 7

page 8

page 12

page 13

research
02/18/2022

KINet: Keypoint Interaction Networks for Unsupervised Forward Modeling

Object-centric representation is an essential abstraction for physical r...
research
06/19/2019

Unsupervised Learning of Object Keypoints for Perception and Control

The study of object representations in computer vision has primarily foc...
research
06/16/2021

Towards Automatic Actor-Critic Solutions to Continuous Control

Model-free off-policy actor-critic methods are an efficient solution to ...
research
06/02/2023

DocFormerv2: Local Features for Document Understanding

We propose DocFormerv2, a multi-modal transformer for Visual Document Un...
research
04/03/2018

3D Interpreter Networks for Viewer-Centered Wireframe Modeling

Understanding 3D object structure from a single image is an important bu...
research
09/30/2022

An information-theoretic approach to unsupervised keypoint representation learning

Extracting informative representations from videos is fundamental for th...
research
09/21/2022

Long-Lived Accurate Keypoints in Event Streams

We present a novel end-to-end approach to keypoint detection and trackin...

Please sign up or login with your details

Forgot password? Click here to reset