Preference Inference from Demonstration in Multi-objective Multi-agent Decision Making

04/27/2023
by   Junlin Lu, et al.
0

It is challenging to quantify numerical preferences for different objectives in a multi-objective decision-making problem. However, the demonstrations of a user are often accessible. We propose an algorithm to infer linear preference weights from either optimal or near-optimal demonstrations. The algorithm is evaluated in three environments with two baseline methods. Empirical results demonstrate significant improvements compared to the baseline algorithms, in terms of both time requirements and accuracy of the inferred preferences. In future work, we plan to evaluate the algorithm's effectiveness in a multi-agent system, where one of the agents is enabled to infer the preferences of an opponent using our preference inference algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2023

Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning: A Dynamic Weight-based Approach

Many decision-making problems feature multiple objectives. In such probl...
research
02/21/2023

Inferring Implicit Trait Preferences for Task Allocation in Heterogeneous Teams

Task allocation in heterogeneous multi-agent teams often requires reason...
research
08/21/2019

A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation

We introduce a new algorithm for multi-objective reinforcement learning ...
research
04/30/2023

Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL

The goal of multi-objective reinforcement learning (MORL) is to learn po...
research
09/17/2020

Learnable Strategies for Bilateral Agent Negotiation over Multiple Issues

We present a novel bilateral negotiation model that allows a self-intere...
research
12/18/2015

Learning the Preferences of Ignorant, Inconsistent Agents

An important use of machine learning is to learn what people value. What...
research
09/21/2022

LMI-based Variable Impedance Controller design from User Demonstrations and Preferences

In this paper, we introduce a new off-line method to find suitable param...

Please sign up or login with your details

Forgot password? Click here to reset