Greedy Algorithms for Sparse Reinforcement Learning

Feature selection and regularization are becoming increasingly prominent tools in the efforts of the reinforcement learning (RL) community to expand the reach and applicability of RL. One approach to the problem of feature selection is to impose a sparsity-inducing form of regularization on the learning method. Recent work on L_1 regularization has adapted techniques from the supervised learning literature for use with RL. Another approach that has received renewed attention in the supervised learning community is that of using a simple algorithm that greedily adds new features. Such algorithms have many of the good properties of the L_1 regularization methods, while also being extremely efficient and, in some cases, allowing theoretical guarantees on recovery of the true form of a sparse target function from sampled data. This paper considers variants of orthogonal matching pursuit (OMP) applied to reinforcement learning. The resulting algorithms are analyzed and compared experimentally with existing L_1 regularized approaches. We demonstrate that perhaps the most natural scenario in which one might hope to achieve sparse recovery fails; however, one variant, OMP-BRM, provides promising theoretical guarantees under certain assumptions on the feature dictionary. Another variant, OMP-TD, empirically outperforms prior methods both in approximation accuracy and efficiency on several benchmark problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2019

Exploiting the potential of deep reinforcement learning for classification tasks in high-dimensional and unstructured data

This paper presents a framework for efficiently learning feature selecti...
research
01/02/2023

Learning to Maximize Mutual Information for Dynamic Feature Selection

Feature selection helps reduce data acquisition costs in ML, but the sta...
research
11/19/2019

Efficient decorrelation of features using Gramian in Reinforcement Learning

Learning good representations is a long standing problem in reinforcemen...
research
04/14/2020

Efficient Least Residual Greedy Algorithms for Sparse Recovery

We present a novel stagewise strategy for improving greedy algorithms fo...
research
11/22/2020

Online Orthogonal Matching Pursuit

Greedy algorithms for feature selection are widely used for recovering s...
research
02/26/2021

Low-Precision Reinforcement Learning

Low-precision training has become a popular approach to reduce computati...
research
04/12/2018

Regularized Greedy Column Subset Selection

The Column Subset Selection Problem provides a natural framework for uns...

Please sign up or login with your details

Forgot password? Click here to reset