A Bayesian Rule for Adaptive Control based on Causal Interventions

11/26/2009
by   Pedro A. Ortega, et al.
0

Explaining adaptive behavior is a central problem in artificial intelligence research. Here we formalize adaptive agents as mixture distributions over sequences of inputs and outputs (I/O). Each distribution of the mixture constitutes a `possible world', but the agent does not know which of the possible worlds it is actually facing. The problem is to adapt the I/O stream in a way that is compatible with the true world. A natural measure of adaptation can be obtained by the Kullback-Leibler (KL) divergence between the I/O distribution of the true world and the I/O distribution expected by the agent that is uncertain about possible worlds. In the case of pure input streams, the Bayesian mixture provides a well-known solution for this problem. We show, however, that in the case of I/O streams this solution breaks down, because outputs are issued by the agent itself and require a different probabilistic syntax as provided by intervention calculus. Based on this calculus, we obtain a Bayesian control rule that allows modeling adaptive behavior with mixture distributions over I/O streams. This rule might allow for a novel approach to adaptive control based on a minimum KL-principle.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/16/2010

Convergence of Bayesian Control Rule

Recently, new approaches to adaptive control have sought to reformulate ...
research
12/10/2018

Learning Sharing Behaviors with Arbitrary Numbers of Agents

We propose a method for modeling and learning turn-taking behaviors for ...
research
11/30/2019

Dis-entangling Mixture of Interventions on a Causal Bayesian Network Using Aggregate Observations

We study the problem of separating a mixture of distributions, all of wh...
research
01/20/2019

Fitting A Mixture Distribution to Data: Tutorial

This paper is a step-by-step tutorial for fitting a mixture distribution...
research
03/27/2013

The Rational and Computational Scope of Probabilistic Rule-Based Expert Systems

Belief updating schemes in artificial intelligence may be viewed as thre...
research
05/31/2022

Parallel Tempering With a Variational Reference

Sampling from complex target distributions is a challenging task fundame...

Please sign up or login with your details

Forgot password? Click here to reset