Multi-agent Dynamic Algorithm Configuration

10/13/2022
by   Ke Xue, et al.
0

Automated algorithm configuration relieves users from tedious, trial-and-error tuning tasks. A popular algorithm configuration tuning paradigm is dynamic algorithm configuration (DAC), in which an agent learns dynamic configuration policies across instances by reinforcement learning (RL). However, in many complex algorithms, there may exist different types of configuration hyperparameters, and such heterogeneity may bring difficulties for classic DAC which uses a single-agent RL policy. In this paper, we aim to address this issue and propose multi-agent DAC (MA-DAC), with one agent working for one type of configuration hyperparameter. MA-DAC formulates the dynamic configuration of a complex algorithm with multiple types of hyperparameters as a contextual multi-agent Markov decision process and solves it by a cooperative multi-agent RL (MARL) algorithm. To instantiate, we apply MA-DAC to a well-known optimization algorithm for multi-objective optimization problems. Experimental results show the effectiveness of MA-DAC in not only achieving superior performance compared with other configuration tuning approaches based on heuristic rules, multi-armed bandits, and single-agent RL, but also being capable of generalizing to different problem classes. Furthermore, we release the environments in this paper as a benchmark for testing MARL algorithms, with the hope of facilitating the application of MARL.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2019

Multi-Agent Deep Reinforcement Learning with Adaptive Policies

We propose a novel approach to address one aspect of the non-stationarit...
research
03/13/2023

Discovering Multiple Algorithm Configurations

Many practitioners in robotics regularly depend on classic, hand-designe...
research
03/02/2021

The Surprising Effectiveness of MAPPO in Cooperative, Multi-Agent Games

Proximal Policy Optimization (PPO) is a popular on-policy reinforcement ...
research
04/12/2021

A coevolutionary approach to deep multi-agent reinforcement learning

Traditionally, Deep Artificial Neural Networks (DNN's) are trained throu...
research
06/15/2023

Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization

Offline reinforcement learning (RL) that learns policies from offline da...
research
06/05/2021

Dynamic Resource Configuration for Low-Power IoT Networks: A Multi-Objective Reinforcement Learning Method

Considering grant-free transmissions in low-power IoT networks with unkn...
research
06/18/2019

Towards White-box Benchmarks for Algorithm Control

The performance of many algorithms in the fields of hard combinatorial p...

Please sign up or login with your details

Forgot password? Click here to reset