Markov Rewards Processes with Impulse Rewards and Absorbing States

05/01/2021
by   Louis Tan, et al.
0

We study the expected accumulated reward for a discrete-time Markov reward model with absorbing states. The rewards are impulse rewards, where a reward ρ_ij is accumulated when transitioning from state i to state j. We derive an explicit, single-letter expression for the expected accumulated reward as a function of the number of time steps n and include in our analysis the limit in which n →∞.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2023

On the Expressivity of Multidimensional Markov Reward

We consider the expressivity of Markov rewards in sequential decision ma...
research
01/29/2018

Learning the Reward Function for a Misspecified Model

In model-based reinforcement learning it is typical to treat the problem...
research
02/21/2021

Delayed Rewards Calibration via Reward Empirical Sufficiency

Appropriate credit assignment for delay rewards is a fundamental challen...
research
12/21/2022

Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search

This paper introduces a novel method of adding intrinsic bonuses to task...
research
10/12/2017

Identifying On-time Reward Delivery Projects with Estimating Delivery Duration on Kickstarter

In Crowdfunding platforms, people turn their prototype ideas into real p...
research
03/23/2021

Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification

In the standard Markov decision process formalism, users specify tasks b...
research
05/29/2018

Maximizing Service Reward for Queues with Deadlines

In this paper we consider a real time queuing system with rewards and de...

Please sign up or login with your details

Forgot password? Click here to reset