To be a fast adaptive learner: using game history to defeat opponents

05/17/2021
by   Guangzhao Cheng, et al.
0

In many real-world games, such as traders repeatedly bargaining with customers, it is very hard for a single AI trader to make good deals with various customers in a few turns, since customers may adopt different strategies even the strategies they choose are quite simple. In this paper, we model this problem as fast adaptive learning in the finitely repeated games. We believe that past game history plays a vital role in such a learning procedure, and therefore we propose a novel framework (named, F3) to fuse the past and current game history with an Opponent Action Estimator (OAE) module that uses past game history to estimate the opponent's future behaviors. The experiments show that the agent trained by F3 can quickly defeat opponents who adopt unknown new strategies. The F3 trained agent obtains more rewards in a fixed number of turns than the agents that are trained by deep reinforcement learning. Further studies show that the OAE module in F3 contains meta-knowledge that can even be transferred across different games.

READ FULL TEXT
research
01/23/2019

Hierarchical Reinforcement Learning for Multi-agent MOBA Game

Although deep reinforcement learning has achieved great success recently...
research
04/08/2019

Creating Pro-Level AI for Real-Time Fighting Game with Deep Reinforcement Learning

Reinforcement learning combined with deep neural networks has performed ...
research
05/10/2022

On the Verge of Solving Rocket League using Deep Reinforcement Learning and Sim-to-sim Transfer

Autonomously trained agents that are supposed to play video games reason...
research
08/30/2021

Learning Meta Representations for Agents in Multi-Agent Reinforcement Learning

In multi-agent reinforcement learning, the behaviors that agents learn i...
research
02/19/2020

How To Avoid Being Eaten By a Grue: Exploration Strategies for Text-Adventure Agents

Text-based games – in which an agent interacts with the world through te...
research
07/05/2022

CEN : Cooperatively Evolving Networks

A finitely repeated game is a dynamic game in which a simultaneous game ...
research
05/21/2023

ToxBuster: In-game Chat Toxicity Buster with BERT

Detecting toxicity in online spaces is challenging and an ever more pres...

Please sign up or login with your details

Forgot password? Click here to reset