Learning Policies for Continuous Control via Transition Models

09/16/2022
by   Justus Huebotter, et al.
5

It is doubtful that animals have perfect inverse models of their limbs (e.g., what muscle contraction must be applied to every joint to reach a particular location in space). However, in robot control, moving an arm's end-effector to a target position or along a target trajectory requires accurate forward and inverse models. Here we show that by learning the transition (forward) model from interaction, we can use it to drive the learning of an amortized policy. Hence, we revisit policy optimization in relation to the deep active inference framework and describe a modular neural network architecture that simultaneously learns the system dynamics from prediction errors and the stochastic policy that generates suitable continuous control commands to reach a desired reference position. We evaluated the model by comparing it against the baseline of a linear quadratic regulator, and conclude with additional steps to take toward human-like motor control.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/29/2017

Non-linear motor control by local learning in spiking neural networks

Learning weights in a spiking neural network with hidden neurons, using ...
research
09/23/2017

Multi-task Learning with Gradient Guided Policy Specialization

We present a method for efficient learning of control policies for multi...
research
02/21/2023

A comparative study of human inverse kinematics techniques for lower limbs

Inverse Kinematics (IK) has been an active research topic and many metho...
research
07/29/2020

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Underwater robots in shallow waters usually suffer from strong wave forc...
research
08/27/2021

Active Inference for Stochastic Control

Active inference has emerged as an alternative approach to control probl...
research
03/18/2021

Robot Manipulator Control with Inverse Kinematics PD-Pseudoinverse Jacobian and Forward Kinematics Denavit Hartenberg

This paper presents the development of vision-based robotic arm manipula...
research
06/02/2022

Uniqueness and Complexity of Inverse MDP Models

What is the action sequence aa'a" that was likely responsible for reachi...

Please sign up or login with your details

Forgot password? Click here to reset