A sojourn-based approach to semi-Markov Reinforcement Learning

01/18/2022
by   Giacomo Ascione, et al.
0

In this paper we introduce a new approach to discrete-time semi-Markov decision processes based on the sojourn time process. Different characterizations of discrete-time semi-Markov processes are exploited and decision processes are constructed by means of these characterizations. With this new approach, the agent is allowed to consider different actions depending on how much time the process has been in the current state. Numerical method based on Q-learning algorithms for finite horizon reinforcement learning and stochastic recursive relations are investigated. We consider a toy example in which the reward depends on the sojourn-time, according to the gambler's fallacy and we prove that the underlying process does not generally exhibit the Markov property. Finally, we use this last example to carry on some numerical evaluations on the previously presented Q-learning algorithms and on a different method based on deep reinforcement learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2023

Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning

We propose a novel generalization of constrained Markov decision process...
research
05/23/2022

Logarithmic regret bounds for continuous-time average-reward Markov decision processes

We consider reinforcement learning for continuous-time Markov decision p...
research
01/14/2018

Deep Reinforcement Fuzzing

Fuzzing is the process of finding security vulnerabilities in input-proc...
research
10/19/2020

Chance-Constrained Control with Lexicographic Deep Reinforcement Learning

This paper proposes a lexicographic Deep Reinforcement Learning (DeepRL)...
research
07/13/2023

Deep reinforcement learning for the dynamic vehicle dispatching problem: An event-based approach

The dynamic vehicle dispatching problem corresponds to deciding which ve...
research
10/25/2019

On the convergence of projective-simulation-based reinforcement learning in Markov decision processes

In recent years, the interest in leveraging quantum effects for enhancin...
research
01/24/2020

PCGRL: Procedural Content Generation via Reinforcement Learning

We investigate how reinforcement learning can be used to train level-des...

Please sign up or login with your details

Forgot password? Click here to reset