Accessibility-Based Clustering for Efficient Learning of Robot Fall Recovery

09/23/2021
by   Chong Zhang, et al.
0

For the model-free deep reinforcement learning of quadruped fall recovery, the initialization of robot configurations is crucial to the data efficiency and robustness. This work focuses on algorithmic improvements of data efficiency and robustness simultaneously through automatic discovery of initial states, which is achieved by our proposed K-Access algorithm based on accessibility metrics. Specifically, we formulated accessibility metrics to measure the difficulty of transitions between two arbitrary states, and proposed a novel K-Access algorithm for state-space clustering that automatically discovers the centroids of the static-pose clusters based on the accessibility metrics. By using the discovered centroidal static poses as initial states, we improve the data efficiency by reducing the redundant exploration, and enhance the robustness by easy explorations from the centroids to sampled static poses. We studied extensive validation using an 8-DOF quadrupedal robot Bittle. Compared to random initialization, the learning curve of our proposed method converges much faster, requiring only around 60 training episodes. With our method, the robot can successfully recover standing poses in 99.4

READ FULL TEXT

page 1

page 3

page 6

page 7

research
01/22/2019

Robust Recovery Controller for a Quadrupedal Robot using Deep Reinforcement Learning

The ability to recover from a fall is an essential feature for a legged ...
research
03/09/2023

Learning Arm-Assisted Fall Damage Reduction and Recovery for Legged Mobile Manipulators

Adaptive falling and recovery skills greatly extend the applicability of...
research
02/01/2019

Thermal Recovery of Multi-Limbed Robots with Electric Actuators

The problem of finding thermally minimizing configurations of a humanoid...
research
04/28/2021

A Deep Learning Object Detection Method for an Efficient Clusters Initialization

Clustering is an unsupervised machine learning method grouping data samp...
research
07/04/2019

Procedural Generation of Initial States of Sokoban

Procedural generation of initial states of state-space search problems h...

Please sign up or login with your details

Forgot password? Click here to reset