A Survey of Multi-Objective Sequential Decision-Making

02/04/2014
by   Diederik Marijn Roijers, et al.
0

Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2021

A Practical Guide to Multi-Objective Reinforcement Learning and Planning

Real-world decision-making tasks are generally complex, requiring trade-...
research
09/06/2019

Multi-Objective Multi-Agent Decision Making: A Utility-based Analysis and Survey

The majority of multi-agent system (MAS) implementations aim to optimise...
research
08/08/2022

Improving performance in multi-objective decision-making in Bottles environments with soft maximin approaches

Balancing multiple competing and conflicting objectives is an essential ...
research
07/02/2009

Strategic Positioning in Tactical Scenario Planning

Capability planning problems are pervasive throughout many areas of huma...
research
05/31/2020

Global Convergence of MAML for LQR

The paper studies the performance of the Model-Agnostic Meta-Learning (M...
research
03/13/2023

Revealed Multi-Objective Utility Aggregation in Human Driving

A central design problem in game theoretic analysis is the estimation of...
research
05/15/2014

Multi-Criteria Optimal Planning for Energy Policies in CLP

In the policy making process a number of disparate and diverse issues su...

Please sign up or login with your details

Forgot password? Click here to reset