Learning from Scarce Experience

04/20/2002
by   Leonid Peshkin, et al.
0

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each change of the target policy, its value is estimated from the results of following that very policy. This requires a large number of interactions with the environment as different polices are considered. We present a family of algorithms based on likelihood ratio estimation that use data gathered when executing one policy (or collection of policies) to estimate the value of a different policy. The algorithms combine estimation and optimization stages. The former utilizes experience to build a non-parametric representation of an optimized function. The latter performs optimization on this estimate. We show positive empirical results and provide the sample complexity bound.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2013

Solving POMDPs by Searching the Space of Finite Policies

Solving partially observable Markov decision processes (POMDPs) is highl...
research
01/17/2022

Chaining Value Functions for Off-Policy Learning

To accumulate knowledge and improve its policy of behaviour, a reinforce...
research
02/26/2020

Policy Evaluation Networks

Many reinforcement learning algorithms use value functions to guide the ...
research
06/04/2019

Off-Policy Evaluation via Off-Policy Classification

In this work, we consider the problem of model selection for deep reinfo...
research
06/01/2022

On Gap-dependent Bounds for Offline Reinforcement Learning

This paper presents a systematic study on gap-dependent sample complexit...
research
03/26/2021

Composable Learning with Sparse Kernel Representations

We present a reinforcement learning algorithm for learning sparse non-pa...
research
07/15/2023

Evaluation of Deep Reinforcement Learning Algorithms for Portfolio Optimisation

We evaluate benchmark deep reinforcement learning (DRL) algorithms on th...

Please sign up or login with your details

Forgot password? Click here to reset