Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion

04/17/2020
by   Aditi Mavalankar, et al.
0

We propose a novel approach to learn goal-conditioned policies for locomotion in a batch RL setting. The batch data is collected by a policy that is not goal-conditioned. For the locomotion task, this translates to data collection using a policy learnt by the agent for walking straight in one direction, and using that data to learn a goal-conditioned policy that enables the agent to walk in any direction. The data collection policy used should be invariant to the direction the agent is facing i.e. regardless of its initial orientation, the agent should take the same actions to walk forward. We exploit this property to learn a goal-conditioned policy using two key ideas: (1) augmenting data by generating trajectories with the same actions in different directions, and (2) learning an encoder that enforces invariance between these rotated trajectories with a Siamese framework. We show that our approach outperforms existing RL algorithms on 3-D locomotion agents like Ant, Humanoid and Minitaur.

READ FULL TEXT

page 2

page 9

research
03/04/2019

Sim-to-Real Transfer for Biped Locomotion

We present a new approach for transfer of dynamic robot control policies...
research
03/20/2023

Imitating Graph-Based Planning with Goal-Conditioned Policies

Recently, graph-based planning algorithms have gained much attention to ...
research
07/15/2019

Towards Robust Direction Invariance in Character Animation

In character animation, direction invariance is a desirable property. Th...
research
02/20/2020

Learning to Walk in the Real World with Minimal Human Effort

Reliable and stable locomotion has been one of the most fundamental chal...
research
07/07/2023

Goal-Conditioned Predictive Coding as an Implicit Planner for Offline Reinforcement Learning

Recent work has demonstrated the effectiveness of formulating decision m...
research
05/31/2022

Human-AI Shared Control via Frequency-based Policy Dissection

Human-AI shared control allows human to interact and collaborate with AI...
research
05/18/2023

From Data-Fitting to Discovery: Interpreting the Neural Dynamics of Motor Control through Reinforcement Learning

In motor neuroscience, artificial recurrent neural networks models often...

Please sign up or login with your details

Forgot password? Click here to reset