Reinforcement Learning Based Dynamic Selection of Auxiliary Objectives with Preserving of the Best Found Solution

04/24/2017
by   Irina Petrova, et al.
0

Efficiency of single-objective optimization can be improved by introducing some auxiliary objectives. Ideally, auxiliary objectives should be helpful. However, in practice, objectives may be efficient on some optimization stages but obstructive on others. In this paper we propose a modification of the EA+RL method which dynamically selects optimized objectives using reinforcement learning. The proposed modification prevents from losing the best found solution. We analysed the proposed modification and compared it with the EA+RL method and Random Local Search on XdivK, Generalized OneMax and LeadingOnes problems. The proposed modification outperforms the EA+RL method on all problem instances. It also outperforms the single objective approach on the most problem instances. We also provide detailed analysis of how different components of the considered algorithms influence efficiency of optimization. In addition, we present theoretical analysis of the proposed modification on the XdivK problem.

READ FULL TEXT
research
06/15/2021

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Many advances that have improved the robustness and efficiency of deep r...
research
06/05/2023

A General Perspective on Objectives of Reinforcement Learning

In this lecture, we present a general perspective on reinforcement learn...
research
10/12/2022

Reinforcement Learning with Automated Auxiliary Loss Search

A good state representation is crucial to solving complicated reinforcem...
research
05/24/2019

Rethinking Expected Cumulative Reward Formalism of Reinforcement Learning: A Micro-Objective Perspective

The standard reinforcement learning (RL) formulation considers the expec...
research
07/17/2017

Auxiliary Objectives for Neural Error Detection Models

We investigate the utility of different auxiliary objectives and trainin...
research
10/07/2021

Generalization in Deep RL for TSP Problems via Equivariance and Local Search

Deep reinforcement learning (RL) has proved to be a competitive heuristi...
research
06/20/2022

EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL

Reinforcement learning (RL) in long horizon and sparse reward tasks is n...

Please sign up or login with your details

Forgot password? Click here to reset