Model-free Reinforcement Learning for Robust Locomotion Using Trajectory Optimization for Exploration

07/14/2021
by   Miroslav Bogdanovic, et al.
1

In this work we present a general, two-stage reinforcement learning approach for going from a single demonstration trajectory to a robust policy that can be deployed on hardware without any additional training. The demonstration is used in the first stage as a starting point to facilitate initial exploration. In the second stage, the relevant task reward is optimized directly and a policy robust to environment uncertainties is computed. We demonstrate and examine in detail performance and robustness of our approach on highly dynamic hopping and bounding tasks on a real quadruped robot.

READ FULL TEXT

page 3

page 5

research
07/07/2020

Guided Exploration with Proximal Policy Optimization using a Single Demonstration

Solving sparse reward tasks through exploration is one of the major chal...
research
12/11/2018

Efficient Model-Free Reinforcement Learning Using Gaussian Process

Efficient Reinforcement Learning usually takes advantage of demonstratio...
research
11/04/2021

Learning to Manipulate Tools by Aligning Simulation to Video Demonstration

A seamless integration of robots into human environments requires robots...
research
03/15/2020

Robot Playing Kendama with Model-Based and Model-Free Reinforcement Learning

Several model-based and model-free methods have been proposed for the ro...
research
02/17/2023

A State Augmentation based approach to Reinforcement Learning from Human Preferences

Reinforcement Learning has suffered from poor reward specification, and ...
research
11/26/2020

Reinforcement Learning for Robust Missile Autopilot Design

Designing missiles' autopilot controllers has been a complex task, given...
research
11/07/2019

H_inf Model-free Reinforcement Learning with Robust Stability Guarantee

Reinforcement learning is showing great potentials in robotics applicati...

Please sign up or login with your details

Forgot password? Click here to reset