Learning to Brachiate via Simplified Model Imitation

05/08/2022
by   Daniele Reda, et al.
0

Brachiation is the primary form of locomotion for gibbons and siamangs, in which these primates swing from tree limb to tree limb using only their arms. It is challenging to control because of the limited control authority, the required advance planning, and the precision of the required grasps. We present a novel approach to this problem using reinforcement learning, and as demonstrated on a finger-less 14-link planar model that learns to brachiate across challenging handhold sequences. Key to our method is the use of a simplified model, a point mass with a virtual arm, for which we first learn a policy that can brachiate across handhold sequences with a prescribed order. This facilitates the learning of the policy for the full model, for which it provides guidance by providing an overall center-of-mass trajectory to imitate, as well as for the timing of the holds. Lastly, the simplified model can also readily be used for planning suitable sequences of handholds in a given environment. Our results demonstrate brachiation motions with a variety of durations for the flight and hold phases, as well as emergent extra back-and-forth swings when this proves useful. The system is evaluated with a variety of ablations. The method enables future work towards more general 3D brachiation, as well as using simplified model imitation in other settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2022

Delayed Reinforcement Learning by Imitation

When the agent's observations or interactions are delayed, classic reinf...
research
02/19/2019

Analytic Model for Quadruped Locomotion Task-Space Planning

Despite the extensive presence of the legged locomotion in animals, it i...
research
09/27/2021

Solving Challenging Control Problems Using Two-Staged Deep Reinforcement Learning

We present a two-staged deep reinforcement learning algorithm for solvin...
research
03/21/2019

Flying through a narrow gap using neural network: an end-to-end planning and control approach

In this paper, we investigate the problem of enabling a drone to fly thr...
research
04/07/2019

Planning and Execution of Dynamic Whole-Body Locomotion for a Hydraulic Quadruped on Challenging Terrain

We present a framework for dynamic quadrupedal locomotion over challengi...
research
07/10/2019

Robust Humanoid Locomotion Using Trajectory Optimization and Sample-Efficient Learning

Trajectory optimization (TO) is one of the most powerful tools for gener...
research
06/10/2019

Data Efficient and Safe Learning for Locomotion via Simplified Model

In this letter, we formulate a novel Markov Decision Process (MDP) for d...

Please sign up or login with your details

Forgot password? Click here to reset