A conversion between utility and information

11/26/2009
by   Pedro A. Ortega, et al.
0

Rewards typically express desirabilities or preferences over a set of alternatives. Here we propose that rewards can be defined for any probability distribution based on three desiderata, namely that rewards should be real-valued, additive and order-preserving, where the latter implies that more probable events should also be more desirable. Our main result states that rewards are then uniquely determined by the negative information content. To analyze stochastic processes, we define the utility of a realization as its reward rate. Under this interpretation, we show that the expected utility of a stochastic process is its negative entropy rate. Furthermore, we apply our results to analyze agent-environment interactions. We show that the expected utility that will actually be achieved by the agent is given by the negative cross-entropy from the input-output (I/O) distribution of the coupled interaction system and the agent's I/O distribution. Thus, our results allow for an information-theoretic interpretation of the notion of utility and the characterization of agent-environment interactions in terms of entropy dynamics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2009

Convergence of Expected Utility for Universal AI

We consider a sequence of repeated interactions between an agent and an ...
research
02/21/2022

Inferring Lexicographically-Ordered Rewards from Preferences

Modeling the preferences of agents over a set of alternatives is a princ...
research
05/11/2020

Maximizing Information Gain in Partially Observable Environments via Prediction Reward

Information gathering in a partially observable environment can be formu...
research
07/10/2021

Dirichlet polynomials and entropy

A Dirichlet polynomial d in one variable 𝓎 is a function of the form d(𝓎...
research
02/09/2023

An Information-Theoretic Analysis of Nonstationary Bandit Learning

In nonstationary bandit learning problems, the decision-maker must conti...
research
07/29/2019

Reinforcement with Fading Memories

We study the effect of imperfect memory on decision making in the contex...
research
11/11/2012

Random Utility Theory for Social Choice

Random utility theory models an agent's preferences on alternatives by d...

Please sign up or login with your details

Forgot password? Click here to reset