Learning Setup Policies: Reliable Transition Between Locomotion Behaviours

01/23/2021
by   Brendan Tidd, et al.
0

Dynamic platforms that operate over manyunique terrain conditions typically require multiple controllers.To transition safely between controllers, there must be anoverlap of states between adjacent controllers. We developa novel method for training Setup Policies that bridge thetrajectories between pre-trained Deep Reinforcement Learning(DRL) policies. We demonstrate our method with a simulatedbiped traversing a difficult jump terrain, where a single policyfails to learn the task, and switching between pre-trainedpolicies without Setup Policies also fails. We perform anablation of key components of our system, and show thatour method outperforms others that learn transition policies.We demonstrate our method with several difficult and diverseterrain types, and show that we can use Setup Policies as partof a modular control suite to successfully traverse a sequence ofcomplex terrains. We show that using Setup Policies improvesthe success rate for traversing a single difficult jump terrain(from 1.5 asequence of various terrains (from 6.5

READ FULL TEXT

page 1

page 2

page 6

research
11/01/2020

Learning When to Switch: Composing Controllers to Traverse a Sequence of Terrain Artifacts

Legged robots often use separate control policies that are highly engine...
research
10/08/2021

Training Transition Policies via Distribution Matching for Complex Tasks

Humans decompose novel complex tasks into simpler ones to exploit previo...
research
09/12/2021

Direct Random Search for Fine Tuning of Deep Reinforcement Learning Policies

Researchers have demonstrated that Deep Reinforcement Learning (DRL) is ...
research
03/10/2022

Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control

Replacing poorly performing existing controllers with smarter solutions ...
research
02/25/2023

DeepCPG Policies for Robot Locomotion

Central Pattern Generators (CPGs) form the neural basis of the observed ...
research
09/18/2019

DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning

This paper addresses the problem of legged locomotion in non-flat terrai...

Please sign up or login with your details

Forgot password? Click here to reset