A Monte Carlo Algorithm for Universally Optimal Bayesian Sequence Prediction and Planning

01/18/2010
by   Anthony Di Franco, et al.
0

The aim of this work is to address the question of whether we can in principle design rational decision-making agents or artificial intelligences embedded in computable physics such that their decisions are optimal in reasonable mathematical senses. Recent developments in rare event probability estimation, recursive bayesian inference, neural networks, and probabilistic planning are sufficient to explicitly approximate reinforcement learners of the AIXI style with non-trivial model classes (here, the class of resource-bounded Turing machines). Consideration of the effects of resource limitations in a concrete implementation leads to insights about possible architectures for learning systems using optimal decision makers as components.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2015

Information-Theoretic Bounded Rationality

Bounded rationality, that is, decision-making and planning under resourc...
research
09/11/2019

Correcting Predictions for Approximate Bayesian Inference

Bayesian models quantify uncertainty and facilitate optimal decision-mak...
research
11/09/2019

Markov-chain Monte-Carlo Sampling for Optimal Fidelity Determination in Dynamic Decision-Making

Decision making for dynamic systems is challenging due to the scale and ...
research
02/14/2022

Learning Reward Models for Cooperative Trajectory Planning with Inverse Reinforcement Learning and Monte Carlo Tree Search

Cooperative trajectory planning methods for automated vehicles, are capa...
research
06/01/2021

A Bayesian-network-based cybersecurity adversarial risk analysis framework with numerical examples

Cybersecurity risk analysis plays an essential role in supporting organi...
research
09/26/2013

Monte-Carlo Planning: Theoretically Fast Convergence Meets Practical Efficiency

Popular Monte-Carlo tree search (MCTS) algorithms for online planning, s...
research
02/20/2019

Where Do Human Heuristics Come From?

Human decision-making deviates from the optimal solution, that maximizes...

Please sign up or login with your details

Forgot password? Click here to reset