Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

12/02/2021
by   Nitish Srivastava, et al.
10

Modeling the world can benefit robot learning by providing a rich training signal for shaping an agent's latent state space. However, learning world models in unconstrained environments over high-dimensional observation spaces such as images is challenging. One source of difficulty is the presence of irrelevant but hard-to-model background distractions, and unimportant visual details of task-relevant entities. We address this issue by learning a recurrent latent dynamics model which contrastively predicts the next observation. This simple model leads to surprisingly robust robotic control even with simultaneous camera, background, and color distractions. We outperform alternatives such as bisimulation methods which impose state-similarity measures derived from divergence in future reward or future optimal actions. We obtain state-of-the-art results on the Distracting Control Suite, a challenging benchmark for pixel-based robotic control.

READ FULL TEXT

page 2

page 5

page 8

page 15

page 18

page 19

research
06/29/2022

Hidden Parameter Recurrent State Space Models For Changing Dynamics Scenarios

Recurrent State-space models (RSSMs) are highly expressive models for le...
research
06/14/2021

Temporal Predictive Coding For Model-Based Planning In Latent Space

High-dimensional observations are a major challenge in the application o...
research
12/04/2020

Planning from Pixels using Inverse Dynamics Models

Learning task-agnostic dynamics models in high-dimensional observation s...
research
10/27/2021

DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations

Top-performing Model-Based Reinforcement Learning (MBRL) agents, such as...
research
08/18/2020

Heteroscedastic Uncertainty for Robust Generative Latent Dynamics

Learning or identifying dynamics from a sequence of high-dimensional obs...
research
03/10/2020

Active Reward Learning for Co-Robotic Vision Based Exploration in Bandwidth Limited Environments

We present a novel POMDP problem formulation for a robot that must auton...
research
12/04/2020

A data-set of piercing needle through deformable objects for Deep Learning from Demonstrations

Many robotic tasks are still teleoperated since automating them is very ...

Please sign up or login with your details

Forgot password? Click here to reset