Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking

08/13/2019
by   Yue Wang, et al.
7

Counterfactual thinking describes a psychological phenomenon that people re-infer the possible results with different solutions about things that have already happened. It helps people to gain more experience from mistakes and thus to perform better in similar future tasks. This paper investigates the counterfactual thinking for agents to find optimal decision-making strategies in multi-agent reinforcement learning environments. In particular, we propose a multi-agent deep reinforcement learning model with a structure which mimics the human-psychological counterfactual thinking process to improve the competitive abilities for agents. To this end, our model generates several possible actions (intent actions) with a parallel policy structure and estimates the rewards and regrets for these intent actions based on its current understanding of the environment. Our model incorporates a scenario-based framework to link the estimated regrets with its inner policies. During the iterations, our model updates the parallel policies and the corresponding scenario-based regrets for agents simultaneously. To verify the effectiveness of our proposed model, we conduct extensive experiments on two different environments with real-world applications. Experimental results show that counterfactual thinking can actually benefit the agents to obtain more accumulative rewards from the environments with fair information by comparing to their opponents while keeping high performing efficiency.

READ FULL TEXT
research
09/20/2021

Promoting Coordination Through Electing First-moveAgent in Multi-Agent Reinforcement Learning

Learning to coordinate among multiple agents is an essential problem in ...
research
04/03/2019

Robust Multi-agent Counterfactual Prediction

We consider the problem of using logged data to make predictions about w...
research
04/01/2020

Counterfactual Multi-Agent Reinforcement Learning with Graph Convolution Communication

We consider a fully cooperative multi-agent system where agents cooperat...
research
09/27/2019

Counterfactual States for Atari Agents via Generative Deep Learning

Although deep reinforcement learning agents have produced impressive res...
research
12/29/2018

Learn to Interpret Atari Agents

Deep Reinforcement Learning (DeepRL) models surpass human-level performa...
research
05/19/2023

Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation

Multi-robot navigation is the task of finding trajectories for a team of...
research
06/23/2021

Evolving Hierarchical Memory-Prediction Machines in Multi-Task Reinforcement Learning

A fundamental aspect of behaviour is the ability to encode salient featu...

Please sign up or login with your details

Forgot password? Click here to reset