Servant of Many Masters: Shifting priorities in Pareto-optimal sequential decision-making

10/31/2017
by   Andrew Critch, et al.
0

It is often argued that an agent making decisions on behalf of two or more principals who have different utility functions should adopt a Pareto-optimal policy, i.e., a policy that cannot be improved upon for one agent without making sacrifices for another. A famous theorem of Harsanyi shows that, when the principals have a common prior on the outcome distributions of all policies, a Pareto-optimal policy for the agent is one that maximizes a fixed, weighted linear combination of the principals' utilities. In this paper, we show that Harsanyi's theorem does not hold for principals with different priors, and derive a more precise generalization which does hold, which constitutes our main result. In this more general case, the relative weight given to each principal's utility should evolve over time according to how well the agent's observations conform with that principal's prior. The result has implications for the design of contracts, treaties, joint ventures, and robots.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/05/2017

Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making

Existing multi-objective reinforcement learning (MORL) algorithms do not...
research
06/09/2021

Bayesian Persuasion in Sequential Decision-Making

We study a dynamic model of Bayesian persuasion in sequential decision-m...
research
12/08/2021

Aggregation of Pareto optimal models

In statistical decision theory, a model is said to be Pareto optimal (or...
research
06/01/2021

Bayesian Agency: Linear versus Tractable Contracts

We study principal-agent problems in which a principal commits to an out...
research
06/07/2021

Stateful Strategic Regression

Automated decision-making tools increasingly assess individuals to deter...
research
01/15/2021

Deciding What to Learn: A Rate-Distortion Approach

Agents that learn to select optimal actions represent a prominent focus ...
research
12/03/2022

Pandora's Problem with Nonobligatory Inspection: Optimal Structure and a PTAS

Weitzman introduced Pandora's box problem as a mathematical model of seq...

Please sign up or login with your details

Forgot password? Click here to reset