Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning

03/15/2023
by   Takuya Kanazawa, et al.
0

Sequential decision making in the real world often requires finding a good balance of conflicting objectives. In general, there exist a plethora of Pareto-optimal policies that embody different patterns of compromises between objectives, and it is technically challenging to obtain them exhaustively using deep neural networks. In this work, we propose a novel multi-objective reinforcement learning (MORL) algorithm that trains a single neural network via policy gradient to approximately obtain the entire Pareto set in a single run of training, without relying on linear scalarization of objectives. The proposed method works in both continuous and discrete action spaces with no design change of the policy network. Numerical experiments in benchmark environments demonstrate the practicality and efficacy of our approach in comparison to standard MORL baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2022

gTLO: A Generalized and Non-linear Multi-Objective Deep Reinforcement Learning Approach

In real-world decision optimization, often multiple competing objectives...
research
08/16/2022

PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm

Many real-world problems involve multiple, possibly conflicting, objecti...
research
06/13/2014

Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material

This document contains supplementary material for the paper "Multi-objec...
research
04/11/2022

Pareto Conditioned Networks

In multi-objective optimization, learning all the policies that reach Pa...
research
11/08/2018

Meta-Learning for Multi-objective Reinforcement Learning

Multi-objective reinforcement learning (MORL) is the generalization of s...
research
09/15/2022

Multi-Objective Policy Gradients with Topological Constraints

Multi-objective optimization models that encode ordered sequential const...
research
10/10/2021

Multi-condition multi-objective optimization using deep reinforcement learning

A multi-condition multi-objective optimization method that can find Pare...

Please sign up or login with your details

Forgot password? Click here to reset