Dota 2 with Large Scale Deep Reinforcement Learning

12/13/2019
by   OpenAI, et al.
13

On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems. OpenAI Five leveraged existing reinforcement learning techniques, scaled to learn from batches of approximately 2 million frames every 2 seconds. We developed a distributed training system and tools for continual training which allowed us to train OpenAI Five for 10 months. By defeating the Dota 2 world champion (Team OG), OpenAI Five demonstrates that self-play reinforcement learning can achieve superhuman performance on a difficult task.

READ FULL TEXT

page 36

page 37

page 39

page 41

research
11/25/2020

Towards Playing Full MOBA Games with Deep Reinforcement Learning

MOBA games, e.g., Honor of Kings, League of Legends, and Dota 2, pose gr...
research
02/25/2022

Building a 3-Player Mahjong AI using Deep Reinforcement Learning

Mahjong is a popular multi-player imperfect-information game developed i...
research
02/15/2021

ScrofaZero: Mastering Trick-taking Poker Game Gongzhu by Deep Reinforcement Learning

People have made remarkable progress in game AIs, especially in domain o...
research
11/08/2018

Modular Architecture for StarCraft II with Deep Reinforcement Learning

We present a novel modular architecture for StarCraft II AI. The archite...
research
07/05/2021

Winning at Any Cost – Infringing the Cartel Prohibition With Reinforcement Learning

Pricing decisions are increasingly made by AI. Thanks to their ability t...
research
10/20/2021

Playing 2048 With Reinforcement Learning

The game of 2048 is a highly addictive game. It is easy to learn the gam...
research
05/25/2023

Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning

A successful tactic that is followed by the scientific community for adv...

Please sign up or login with your details

Forgot password? Click here to reset