Statistical Linear Estimation with Penalized Estimators: an Application to Reinforcement Learning

06/27/2012
by   Bernardo Ávila Pires, et al.
0

Motivated by value function estimation in reinforcement learning, we study statistical linear inverse problems, i.e., problems where the coefficients of a linear system to be solved are observed in noise. We consider penalized estimators, where performance is evaluated using a matrix-weighted two-norm of the defect of the estimator measured with respect to the true, unknown coefficients. Two objective functions are considered depending whether the error of the defect measured with respect to the noisy coefficients is squared or unsquared. We propose simple, yet novel and theoretically well-founded data-dependent choices for the regularization parameters for both cases that avoid data-splitting. A distinguishing feature of our analysis is that we derive deterministic error bounds in terms of the error of the coefficients, thus allowing the complete separation of the analysis of the stochastic properties of these errors. We show that our results lead to new insights and bounds for linear value function estimation in reinforcement learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/27/2022

Variance Reduction for Score Functions Using Optimal Baselines

Many problems involve the use of models which learn probability distribu...
research
07/25/2023

The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation

Theoretical guarantees in reinforcement learning (RL) are known to suffe...
research
09/19/2019

Value function estimation in Markov reward processes: Instance-dependent ℓ_∞-bounds for policy evaluation

Markov reward processes (MRPs) are used to model stochastic phenomena ar...
research
07/18/2013

Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization

We consider the problem of reinforcement learning over episodes of a fin...
research
10/16/2018

Clustering in statistical ill-posed linear inverse problems

In many statistical linear inverse problems, one needs to recover classe...
research
05/16/2019

Adaptive estimation in the linear random coefficients model when regressors have limited variation

We consider a linear model where the coefficients-intercept and slopes-a...
research
07/05/2021

Optimal Estimation of Brownian Penalized Regression Coefficients

In this paper we introduce a new methodology to determine an optimal coe...

Please sign up or login with your details

Forgot password? Click here to reset