Generalized Thompson Sampling for Sequential Decision-Making and Causal Inference

03/18/2013
by   Pedro A. Ortega, et al.
0

Recently, it has been shown how sampling actions from the predictive distribution over the optimal action-sometimes called Thompson sampling-can be applied to solve sequential adaptive control problems, when the optimal policy is known for each possible environment. The predictive distribution can then be constructed by a Bayesian superposition of the optimal policies weighted by their posterior probability that is updated by Bayesian inference and causal calculus. Here we discuss three important features of this approach. First, we discuss in how far such Thompson sampling can be regarded as a natural consequence of the Bayesian modeling of policy uncertainty. Second, we show how Thompson sampling can be used to study interactions between multiple adaptive agents, thus, opening up an avenue of game-theoretic analysis. Third, we show how Thompson sampling can be applied to infer causal relationships when interacting with an environment in a sequential fashion. In summary, our results suggest that Thompson sampling might not merely be a useful heuristic, but a principled method to address problems of adaptive sequential decision-making and causal inference.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2019

Correcting Predictions for Approximate Bayesian Inference

Bayesian models quantify uncertainty and facilitate optimal decision-mak...
research
06/27/2023

Causal Inference via Predictive Coding

Bayesian and causal inference are fundamental processes for intelligence...
research
03/20/2022

Geometric Methods for Sampling, Optimisation, Inference and Adaptive Agents

In this chapter, we identify fundamental geometric structures that under...
research
11/12/2017

Quickest Detection of Markov Networks

Detecting correlation structures in large networks arises in many domain...
research
02/10/2021

Patterns, predictions, and actions: A story about machine learning

This graduate textbook on machine learning tells a story of how patterns...
research
05/24/2020

Causal Bayesian Optimization

This paper studies the problem of globally optimizing a variable of inte...
research
03/13/2013

Some Problems for Convex Bayesians

We discuss problems for convex Bayesian decision making and uncertainty ...

Please sign up or login with your details

Forgot password? Click here to reset