Reinforcement Learning with Depreciating Assets

02/27/2023
by   Taylor Dohmen, et al.
0

A basic assumption of traditional reinforcement learning is that the value of a reward does not change once it is received by an agent. The present work forgoes this assumption and considers the situation where the value of a reward decays proportionally to the time elapsed since it was obtained. Emphasizing the inflection point occurring at the time of payment, we use the term asset to refer to a reward that is currently in the possession of an agent. Adopting this language, we initiate the study of depreciating assets within the framework of infinite-horizon quantitative optimization. In particular, we propose a notion of asset depreciation, inspired by classical exponential discounting, where the value of an asset is scaled by a fixed discount factor at each time step after it is obtained by the agent. We formulate a Bellman-style equational characterization of optimality in this context and develop a model-free reinforcement learning approach to obtain optimal policies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2016

Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning

As a genetics-based machine learning technique, zeroth-level classifier ...
research
02/09/2020

Maximizing the Total Reward via Reward Tweaking

In reinforcement learning, the discount factor γ controls the agent's ef...
research
04/22/2023

Reinforcement Learning with an Abrupt Model Change

The problem of reinforcement learning is considered where the environmen...
research
10/18/2020

Average-reward model-free reinforcement learning: a systematic review and literature mapping

Model-free reinforcement learning (RL) has been an active area of resear...
research
11/24/2020

Learning Principle of Least Action with Reinforcement Learning

Nature provides a way to understand physics with reinforcement learning ...
research
08/05/2020

Optimizing AD Pruning of Sponsored Search with Reinforcement Learning

Industrial sponsored search system (SSS) can be logically divided into t...
research
08/13/2019

Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective

Can an arbitrarily intelligent reinforcement learning agent be kept unde...

Please sign up or login with your details

Forgot password? Click here to reset