Feature Reinforcement Learning In Practice

08/18/2011
by   Phuong Nguyen, et al.
0

Following a recent surge in using history-based methods for resolving perceptual aliasing in reinforcement learning, we introduce an algorithm based on the feature reinforcement learning framework called PhiMDP. To create a practical algorithm we devise a stochastic search procedure for a class of context trees based on parallel tempering and a specialized proposal distribution. We provide the first empirical evaluation for PhiMDP. Our proposed algorithm achieves superior performance to the classical U-tree algorithm and the recent active-LZ algorithm, and is competitive with MC-AIXI-CTW that maintains a bayesian mixture over all context trees up to a chosen depth.We are encouraged by our ability to compete with this sophisticated method using an algorithm that simply picks one single model, and uses Q-learning on the corresponding MDP. Our PhiMDP algorithm is much simpler, yet consumes less time and memory. These results show promise for our future work on attacking more complex and larger problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2018

Exploration by Distributional Reinforcement Learning

We propose a framework based on distributional reinforcement learning an...
research
03/31/2023

Online Reinforcement Learning in Markov Decision Process Using Linear Programming

We consider online reinforcement learning in episodic Markov decision pr...
research
09/08/2022

An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning

We study a posterior sampling approach to efficient exploration in const...
research
04/06/2022

Standardized feature extraction from pairwise conflicts applied to the train rescheduling problem

We propose a train rescheduling algorithm which applies a standardized f...
research
07/16/2018

Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees

Deep Reinforcement Learning (DRL) has achieved impressive success in man...
research
10/07/2019

Self-Paced Contextual Reinforcement Learning

Generalization and adaptation of learned skills to novel situations is a...
research
06/17/2023

Vanishing Bias Heuristic-guided Reinforcement Learning Algorithm

Reinforcement Learning has achieved tremendous success in the many Atari...

Please sign up or login with your details

Forgot password? Click here to reset