Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations

07/28/2020
by   Sashank Tirumala, et al.
2

With the research into development of quadruped robots picking up pace, learning based techniques are being explored for developing locomotion controllers for such robots. A key problem is to generate leg trajectories for continuously varying target linear and angular velocities, in a stable manner. In this paper, we propose a two pronged approach to address this problem. First, multiple simpler policies are trained to generate trajectories for a discrete set of target velocities and turning radius. These policies are then augmented using a higher level neural network for handling the transition between the learned trajectories. Specifically, we develop a neural network-based filter that takes in target velocity, radius and transforms them into new commands that enable smooth transitions to the new trajectory. This transformation is achieved by learning from expert demonstrations. An application of this is the transformation of a novice user's input into an expert user's input, thereby ensuring stable manoeuvres regardless of the user's experience. Training our proposed architecture requires much less expert demonstrations compared to standard neural network architectures. Finally, we demonstrate experimentally these results in the in-house quadruped Stoch 2.

READ FULL TEXT

page 1

page 6

research
05/30/2022

TaSIL: Taylor Series Imitation Learning

We propose Taylor Series Imitation Learning (TaSIL), a simple augmentati...
research
09/07/2023

Learning from Demonstration via Probabilistic Diagrammatic Teaching

Learning for Demonstration (LfD) enables robots to acquire new skills by...
research
02/13/2023

Imitation from Observation With Bootstrapped Contrastive Learning

Imitation from observation (IfO) is a learning paradigm that consists of...
research
03/09/2022

Learning to control from expert demonstrations

In this paper, we revisit the problem of learning a stabilizing controll...
research
06/10/2020

Bayesian Experience Reuse for Learning from Multiple Demonstrators

Learning from demonstrations (LfD) improves the exploration efficiency o...
research
07/27/2023

Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior

We propose a theoretical framework for studying the imitation of stochas...
research
06/07/2023

Divide and Repair: Using Options to Improve Performance of Imitation Learning Against Adversarial Demonstrations

We consider the problem of learning to perform a task from demonstration...

Please sign up or login with your details

Forgot password? Click here to reset