Reinforcement Learning with an Abrupt Model Change

04/22/2023
by   Wuxia Chen, et al.
0

The problem of reinforcement learning is considered where the environment or the model undergoes a change. An algorithm is proposed that an agent can apply in such a problem to achieve the optimal long-time discounted reward. The algorithm is model-free and learns the optimal policy by interacting with the environment. It is shown that the proposed algorithm has strong optimality properties. The effectiveness of the algorithm is also demonstrated using simulation results. The proposed algorithm exploits a fundamental reward-detection trade-off present in these problems and uses a quickest change detection algorithm to detect the model change. Recommendations are provided for faster detection of model changes and for smart initialization strategies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2023

Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In Time

A crucial problem in reinforcement learning is learning the optimal poli...
research
02/13/2022

Reinforcement Learning Based Power Control for Reliable Wireless Transmission

In this paper, we investigate a sequential power allocation problem over...
research
02/27/2023

Reinforcement Learning with Depreciating Assets

A basic assumption of traditional reinforcement learning is that the val...
research
05/18/2023

Bayesian Risk-Averse Q-Learning with Streaming Observations

We consider a robust reinforcement learning problem, where a learning ag...
research
07/28/2023

Curiosity-Driven Reinforcement Learning based Low-Level Flight Control

Curiosity is one of the main motives in many of the natural creatures wi...
research
04/26/2021

Performance Testing Using a Smart Reinforcement Learning-Driven Test Agent

Performance testing with the aim of generating an efficient and effectiv...
research
05/23/2021

An Efficient Application of Neuroevolution for Competitive Multiagent Learning

Multiagent systems provide an ideal environment for the evaluation and a...

Please sign up or login with your details

Forgot password? Click here to reset