Reinforcement Learning Control of Robotic Knee with Human in the Loop by Flexible Policy Iteration

06/16/2020
by   Xiang Gao, et al.
0

This study is motivated by a new class of challenging control problems described by automatic tuning of robotic knee control parameters with human in the loop. In addition to inter-person and intra-person variances inherent in such human-robot systems, human user safety and stability, as well as data and time efficiency should also be taken into design consideration. Here by data and time efficiency we mean learning and adaptation of device configurations takes place within countable gait cycles or within minutes of time. As solutions to this problem is not readily available, we therefore propose a new policy iteration based adaptive dynamic programming algorithm, namely the flexible policy iteration (FPI). We show that the FPI solves the control parameters via (weighted) least-squares while it incorporates data flexibly and utilizes prior knowledge. We provide analyses on stable control policies, non-increasing and converging value functions to Bellman optimality, and error bounds on the iterative value functions subject to approximation errors. We extensively evaluated the performance of FPI in a well-established locomotion simulator, the OpenSim under realistic conditions. By inspecting FPI with three other comparable algorithms, we demonstrate the FPI as a feasible data and time efficient design approach for adapting the control parameters of the prosthetic knee to co-adapt with the human user who also places control on the prosthesis. As the proposed FPI algorithm does not require stringent constraints or peculiar assumptions, we expect this reinforcement learning controller can potentially be applied to other challenging adaptive optimal control problems.

READ FULL TEXT
research
01/22/2021

Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-critic Reinforcement Learning

We address a state-of-the-art reinforcement learning (RL) control approa...
research
11/11/2020

A Data-Driven Reinforcement Learning Solution Framework for Optimal and Adaptive Personalization of a Hip Exoskeleton

Robotic exoskeletons are exciting technologies for augmenting human mobi...
research
07/16/2021

Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

This paper studies the adaptive optimal stationary control of continuous...
research
10/27/2021

A Subgame Perfect Equilibrium Reinforcement Learning Approach to Time-inconsistent Problems

In this paper, we establish a subgame perfect equilibrium reinforcement ...
research
10/23/2017

Stability Analysis of Optimal Adaptive Control using Value Iteration with Approximation Errors

Adaptive optimal control using value iteration initiated from a stabiliz...
research
03/22/2019

Symbolic Regression Methods for Reinforcement Learning

Reinforcement learning algorithms can be used to optimally solve dynamic...

Please sign up or login with your details

Forgot password? Click here to reset