Goal-Directed Planning by Reinforcement Learning and Active Inference

06/18/2021
by   Dongqi Han, et al.
13

What is the difference between goal-directed and habitual behavior? We propose a novel computational framework of decision making with Bayesian inference, in which everything is integrated as an entire neural network model. The model learns to predict environmental state transitions by self-exploration and generating motor actions by sampling stochastic internal states z. Habitual behavior, which is obtained from the prior distribution of z, is acquired by reinforcement learning. Goal-directed behavior is determined from the posterior distribution of z by planning, using active inference which optimizes the past, current and future z by minimizing the variational free energy for the desired future observation constrained by the observed sensory sequence. We demonstrate the effectiveness of the proposed framework by experiments in a sensorimotor navigation task with camera observations and continuous motor actions.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 8

research
02/21/2022

Goal-directed Planning and Goal Understanding by Active Inference: Evaluation Through Simulated and Physical Robot Experiments

We show that goal-directed action planning and generation in a teleologi...
research
04/11/2023

Habits and goals in synergy: a variational Bayesian framework for behavior

How to behave efficiently and flexibly is a central problem for understa...
research
11/16/2022

A Neural Active Inference Model of Perceptual-Motor Learning

The active inference framework (AIF) is a promising new computational fr...
research
03/12/2019

Goal-Directed Behavior under Variational Predictive Coding: Dynamic Organization of Visual Attention and Working Memory

Mental simulation is a critical cognitive function for goal-directed beh...
research
02/23/2022

Inference of Affordances and Active Motor Control in Simulated Agents

Flexible, goal-directed behavior is a fundamental aspect of human life. ...
research
05/29/2019

Learning Navigation Subroutines by Watching Videos

Hierarchies are an effective way to boost sample efficiency in reinforce...
research
12/10/2021

Encoding priors in the brain: a reinforcement learning model for mouse decision making

In two-alternative forced choice tasks, prior knowledge can improve perf...

Please sign up or login with your details

Forgot password? Click here to reset