SafeSteps: Learning Safer Footstep Planning Policies for Legged Robots via Model-Based Priors

07/24/2023
by   Shafeef Omar, et al.
0

We present a footstep planning policy for quadrupedal locomotion that is able to directly take into consideration a-priori safety information in its decisions. At its core, a learning process analyzes terrain patches, classifying each landing location by its kinematic feasibility, shin collision, and terrain roughness. This information is then encoded into a small vector representation and passed as an additional state to the footstep planning policy, which furthermore proposes only safe footstep location by applying a masked variant of the Proximal Policy Optimization (PPO) algorithm. The performance of the proposed approach is shown by comparative simulations on an electric quadruped robot walking in different rough terrain scenarios. We show that violations of the above safety conditions are greatly reduced both during training and the successive deployment of the policy, resulting in an inherently safer footstep planner. Furthermore, we show how, as a byproduct, fewer reward terms are needed to shape the behavior of the policy, which in return is able to achieve both better final performances and sample efficiency

READ FULL TEXT
research
03/05/2022

Safe Reinforcement Learning for Legged Locomotion

Designing control policies for legged locomotion is complex due to the u...
research
12/03/2021

Coupling Vision and Proprioception for Navigation of Legged Robots

We exploit the complementary strengths of vision and proprioception to a...
research
06/10/2019

Data Efficient and Safe Learning for Locomotion via Simplified Model

In this letter, we formulate a novel Markov Decision Process (MDP) for d...
research
12/11/2020

Protective Policy Transfer

Being able to transfer existing skills to new situations is a key capabi...
research
07/19/2019

Footstep Planning for Autonomous Walking Over Rough Terrain

To increase the speed of operation and reduce operator burden, humanoid ...
research
02/26/2020

GLAS: Global-to-Local Safe Autonomy Synthesis for Multi-Robot Motion Planning with End-to-End Learning

We present GLAS: Global-to-Local Autonomy Synthesis, a provably-safe, au...
research
10/08/2018

Safe-To-Explore State Spaces: Ensuring Safe Exploration in Policy Search with Hierarchical Task Optimization

Policy search reinforcement learning allows robots to acquire skills by ...

Please sign up or login with your details

Forgot password? Click here to reset