Finite State Machine Policies Modulating Trajectory Generator

09/26/2021
by   Ren Liu, et al.
0

Deep reinforcement learning (deep RL) has emerged as an effective tool for developing controllers for legged robots. However, a simple neural network representation is known for its poor extrapolation ability, making the learned behavior vulnerable to unseen perturbations or challenging terrains. Therefore, researchers have investigated a novel architecture, Policies Modulating Trajectory Generators (PMTG), which combines trajectory generators (TG) and feedback control signals to achieve more robust behaviors. In this work, we propose to extend the PMTG framework with a finite state machine PMTG by replacing simple TGs with asynchronous finite state machines (Async FSMs). This invention offers an explicit notion of contact events to the policy to negotiate unexpected perturbations. We demonstrated that the proposed architecture could achieve more robust behaviors in various scenarios, such as challenging terrains or external perturbations, on both simulated and real robots. The supplemental video can be found at: http://youtu.be/XUiTSZaM8f0.

READ FULL TEXT

page 1

page 6

research
10/07/2019

Policies Modulating Trajectory Generators

We propose an architecture for learning complex controllable behaviors b...
research
03/11/2021

Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning

Deep reinforcement learning has emerged as a popular and powerful way to...
research
05/25/2019

Adversarial Policies: Attacking Deep Reinforcement Learning

Deep reinforcement learning (RL) policies are known to be vulnerable to ...
research
10/24/2018

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks

Contact-rich manipulation tasks in unstructured environments often requi...
research
03/22/2018

Neuronal Circuit Policies

We propose an effective way to create interpretable control agents, by r...
research
04/01/2023

Convergent iLQR for Safe Trajectory Planning and Control of Legged Robots

In order to perform highly dynamic and agile maneuvers, legged robots ty...
research
11/01/2022

CPG-RL: Learning Central Pattern Generators for Quadruped Locomotion

In this letter, we present a method for integrating central pattern gene...

Please sign up or login with your details

Forgot password? Click here to reset