Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning

07/03/2023
by   E. Hurwitz, et al.
0

Bimodal, stochastic environments present a challenge to typical Reinforcement Learning problems. This problem is one that is surprisingly common in real world applications, being particularly applicable to pricing problems. In this paper we present a novel learning approach to the tabular Q-learning algorithm, tailored to tackling these specific challenges by using batch updates. A simulation of pricing problem is used as a testbed to compare a typically updated agent with a batch learning agent. The batch learning agents are shown to be both more effective than the typically-trained agents, and to be more resilient to the fluctuations in a large stochastic environment. This work has a significant potential to enable practical, industrial deployment of Reinforcement Learning in the context of pricing and others.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2021

Reinforcement Learning for Ridesharing: A Survey

In this paper, we present a comprehensive, in-depth survey of the litera...
research
12/06/2021

Hierarchical Reinforcement Learning with Timed Subgoals

Hierarchical reinforcement learning (HRL) holds great potential for samp...
research
08/31/2018

APES: a Python toolbox for simulating reinforcement learning environments

Assisted by neural networks, reinforcement learning agents have been abl...
research
09/07/2023

Navigation Through Endoluminal Channels Using Q-Learning

In this paper, we present a novel approach to navigating endoluminal cha...
research
08/07/2023

Deep Q-Network for Stochastic Process Environments

Reinforcement learning is a powerful approach for training an optimal po...
research
11/13/2022

Goal-Conditioned Reinforcement Learning in the Presence of an Adversary

Reinforcement learning has seen increasing applications in real-world co...
research
04/27/2023

Batch Quantum Reinforcement Learning

Training DRL agents is often a time-consuming process as a large number ...

Please sign up or login with your details

Forgot password? Click here to reset