Computably Continuous Reinforcement-Learning Objectives are PAC-learnable

03/09/2023
by   Cambridge Yang, et al.
0

In reinforcement learning, the classic objectives of maximizing discounted and finite-horizon cumulative rewards are PAC-learnable: There are algorithms that learn a near-optimal policy with high probability using a finite amount of samples and computation. In recent years, researchers have introduced objectives and corresponding reinforcement-learning algorithms beyond the classic cumulative rewards, such as objectives specified as linear temporal logic formulas. However, questions about the PAC-learnability of these new objectives have remained open. This work demonstrates the PAC-learnability of general reinforcement-learning objectives through sufficient conditions for PAC-learnability in two analysis settings. In particular, for the analysis that considers only sample complexity, we prove that if an objective given as an oracle is uniformly continuous, then it is PAC-learnable. Further, for the analysis that considers computational complexity, we prove that if an objective is computable, then it is PAC-learnable. In other words, if a procedure computes successive approximations of the objective's value, then the objective is PAC-learnable. We give three applications of our condition on objectives from the literature with previously unknown PAC-learnability and prove that these objectives are PAC-learnable. Overall, our result helps verify existing objectives' PAC-learnability. Also, as some studied objectives that are not uniformly continuous have been shown to be not PAC-learnable, our results could guide the design of new PAC-learnable objectives.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2021

Reinforcement Learning for General LTL Objectives Is Intractable

In recent years, researchers have made significant progress in devising ...
research
02/06/2023

Find a witness or shatter: the landscape of computable PAC learning

This paper contributes to the study of CPAC learnability – a computable ...
research
11/24/2021

On computable learning of continuous features

We introduce definitions of computable PAC learning for binary classific...
research
07/11/2023

Reinforcement Learning with Non-Cumulative Objective

In reinforcement learning, the objective is almost always defined as a c...
research
09/19/2013

Predictive PAC Learning and Process Decompositions

We informally call a stochastic process learnable if it admits a general...
research
07/27/2018

Learnable: Theory vs Applications

Two different views on machine learning problem: Applied learning (machi...
research
10/18/2018

On Statistical Learning of Simplices: Unmixing Problem Revisited

Learning of high-dimensional simplices from uniformly-sampled observatio...

Please sign up or login with your details

Forgot password? Click here to reset