Agent Modeling as Auxiliary Task for Deep Reinforcement Learning

07/22/2019
by   Pablo Hernandez-Leal, et al.
0

In this paper we explore how actor-critic methods in deep reinforcement learning, in particular Asynchronous Advantage Actor-Critic (A3C), can be extended with agent modeling. Inspired by recent works on representation learning and multiagent deep reinforcement learning, we propose two architectures to perform agent modeling: the first one based on parameter sharing, and the second one based on agent policy features. Both architectures aim to learn other agents' policies as auxiliary tasks, besides the standard actor (policy) and critic (values). We performed experiments in both cooperative and competitive domains. The former is a problem of coordinated multiagent object transportation and the latter is a two-player mini version of the Pommerman game. Our results show that the proposed architectures stabilize learning and outperform the standard A3C architecture when learning a best response in terms of expected rewards.

READ FULL TEXT
research
07/24/2019

Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning

Deep reinforcement learning has achieved great successes in recent years...
research
07/22/2018

Asynchronous Advantage Actor-Critic Agent for Starcraft II

Deep reinforcement learning, and especially the Asynchronous Advantage A...
research
02/10/2022

DDA3C: Cooperative Distributed Deep Reinforcement Learning in A Group-Agent System

It can largely benefit the reinforcement learning process of each agent ...
research
09/28/2021

Exploring More When It Needs in Deep Reinforcement Learning

We propose a exploration mechanism of policy in Deep Reinforcement Learn...
research
12/22/2021

Alpha-Mini: Minichess Agent with Deep Reinforcement Learning

We train an agent to compete in the game of Gardner minichess, a downsiz...
research
10/13/2021

Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification

The presentation and analysis of image data from a single viewpoint are ...
research
10/23/2019

Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning

Reinforcement Learning enables to train an agent via interaction with th...

Please sign up or login with your details

Forgot password? Click here to reset