Near-Optimal BRL using Optimistic Local Transitions

06/18/2012
by   Mauricio Araya, et al.
0

Model-based Bayesian Reinforcement Learning (BRL) allows a found formalization of the problem of acting optimally while facing an unknown environment, i.e., avoiding the exploration-exploitation dilemma. However, algorithms explicitly addressing BRL suffer from such a combinatorial explosion that a large body of work relies on heuristic algorithms. This paper introduces BOLT, a simple and (almost) deterministic heuristic algorithm for BRL which is optimistic about the transition function. We analyze BOLT's sample complexity, and show that under certain parameters, the algorithm is near-optimal in the Bayesian sense with high probability. Then, experimental results highlight the key differences of this method compared to previous work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2012

Model-Based Bayesian Reinforcement Learning in Large Structured Domains

Model-based Bayesian reinforcement learning has generated significant in...
research
04/13/2023

Near-Optimal Degree Testing for Bayes Nets

This paper considers the problem of testing the maximum in-degree of the...
research
07/22/2021

Learning Sparse Fixed-Structure Gaussian Bayesian Networks

Gaussian Bayesian networks (a.k.a. linear Gaussian structural equation m...
research
10/07/2021

Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver

Although model-based reinforcement learning (RL) approaches are consider...
research
09/05/2023

Distributionally Robust Model-based Reinforcement Learning with Large State Spaces

Three major challenges in reinforcement learning are the complex dynamic...
research
02/14/2019

Procrastinating with Confidence: Near-Optimal, Anytime, Adaptive Algorithm Configuration

Algorithm configuration methods optimize the performance of a parameteri...
research
01/29/2023

Combinatorial Pen Testing (or Consumer Surplus of Deferred-Acceptance Auctions)

Pen testing is the problem of selecting high capacity resources when the...

Please sign up or login with your details

Forgot password? Click here to reset