The complexity of non-stationary reinforcement learning

07/13/2023
by   Christos Papadimitriou, et al.
0

The problem of continual learning in the domain of reinforcement learning, often called non-stationary reinforcement learning, has been identified as an important challenge to the application of reinforcement learning. We prove a worst-case complexity result, which we believe captures this challenge: Modifying the probabilities or the reward of a single state-action pair in a reinforcement learning problem requires an amount of time almost as large as the number of states in order to keep the value function up to date, unless the strong exponential time hypothesis (SETH) is false; SETH is a widely accepted strengthening of the P ≠ NP conjecture. Recall that the number of states in current applications of reinforcement learning is typically astronomical. In contrast, we show that just adding a new state-action pair is considerably easier to implement.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2019

Continual Reinforcement Learning in 3D Non-stationary Environments

High-dimensional always-changing environments constitute a hard challeng...
research
10/29/2018

Learning to Learn without Forgetting By Maximizing Transfer and Minimizing Interference

Lack of performance when it comes to continual learning over non-station...
research
12/25/2020

Towards Continual Reinforcement Learning: A Review and Perspectives

In this article, we aim to provide a literature review of different form...
research
12/14/2022

Reinforcement Learning in System Identification

System identification, also known as learning forward models, transfer f...
research
08/19/2022

Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games

This paper addresses policy learning in non-stationary environments and ...
research
09/18/2023

Self-Sustaining Multiple Access with Continual Deep Reinforcement Learning for Dynamic Metaverse Applications

The Metaverse is a new paradigm that aims to create a virtual environmen...
research
02/22/2022

Continual Auxiliary Task Learning

Learning auxiliary tasks, such as multiple predictions about the world, ...

Please sign up or login with your details

Forgot password? Click here to reset