Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning

10/02/2019
by   Huixin Zhan, et al.
0

Solving multi-objective optimization problems is important in various applications where users are interested in obtaining optimal policies subject to multiple, yet often conflicting objectives. A typical approach to obtain optimal policies is to first construct a loss function that is based on the scalarization of individual objectives, and then find the optimal policy that minimizes the loss. However, optimizing the scalarized (and weighted) loss does not necessarily provide a guarantee of high performance on each possibly conflicting objective. In this paper, we propose a vector value based reinforcement learning approach that seeks to explicitly learn the inter-objective relationship and optimize multiple objectives based on the learned relationship. In particular, the proposed method is to first define relationship matrix, a mathematical representation of the inter-objective relationship, and then create one actor and multiple critics that can co-learn the relationship matrix and action selection. The proposed approach can quantify the inter-objective relationship via reinforcement learning when the impact of one objective on another is unknown a prior. We also provide rigorous convergence analysis of the proposed approach and present a quantitative evaluation of the approach based on two testing scenarios.

READ FULL TEXT

page 1

page 2

research
09/26/2019

Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation

Solving multi-objective optimization problems is important in various ap...
research
11/19/2020

Provable Multi-Objective Reinforcement Learning with Generative Models

Multi-objective reinforcement learning (MORL) is an extension of ordinar...
research
02/08/2023

A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis

Many sequential decision-making problems need optimization of different ...
research
06/02/2021

Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making

In many real-world scenarios, the utility of a user is derived from the ...
research
09/11/2019

Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning

A common approach for defining a reward function for Multi-objective Rei...
research
05/18/2021

PoBRL: Optimizing Multi-Document Summarization by Blending Reinforcement Learning Policies

We propose a novel reinforcement learning based framework PoBRL for solv...
research
04/22/2019

Measuring and Assessing Latent Variation in Alliance Design and Objectives

The alliance literature is bifurcated between an empirically-driven appr...

Please sign up or login with your details

Forgot password? Click here to reset