PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm

08/16/2022
by   Toygun Basaklar, et al.
0

Many real-world problems involve multiple, possibly conflicting, objectives. Multi-objective reinforcement learning (MORL) approaches have emerged to tackle these problems by maximizing a joint objective function weighted by a preference vector. These approaches find fixed customized policies corresponding to preference vectors specified during training. However, the design constraints and objectives typically change dynamically in real-life scenarios. Furthermore, storing a policy for each potential preference is not scalable. Hence, obtaining a set of Pareto front solutions for the entire preference space in a given domain with a single training is critical. To this end, we propose a novel MORL algorithm that trains a single universal network to cover the entire preference space. The proposed approach, Preference-Driven MORL (PD-MORL), utilizes the preferences as guidance to update the network parameters. After demonstrating PD-MORL using classical Deep Sea Treasure and Fruit Tree Navigation benchmarks, we evaluate its performance on challenging multi-objective continuous control tasks.

READ FULL TEXT
research
03/15/2023

Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning

Sequential decision making in the real world often requires finding a go...
research
01/04/2017

Estimating Quality in Multi-Objective Bandits Optimization

Many real-world applications are characterized by a number of conflictin...
research
10/27/2011

User preference extraction using dynamic query sliders in conjunction with UPS-EMO algorithm

One drawback of evolutionary multiobjective optimization algorithms (EMO...
research
02/12/2019

Multi-objective Bayesian optimisation with preferences over objectives

We present a Bayesian multi-objective optimisation algorithm that allows...
research
11/17/2017

Addressing Expensive Multi-objective Games with Postponed Preference Articulation via Memetic Co-evolution

This paper presents algorithmic and empirical contributions demonstratin...
research
03/02/2023

Reinforcement Learning Guided Multi-Objective Exam Paper Generation

To reduce the repetitive and complex work of instructors, exam paper gen...
research
06/16/2023

Fairness in Preference-based Reinforcement Learning

In this paper, we address the issue of fairness in preference-based rein...

Please sign up or login with your details

Forgot password? Click here to reset