ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero

02/12/2019
by   Yuandong Tian, et al.
14

The AlphaGo, AlphaGo Zero, and AlphaZero series of algorithms are a remarkable demonstration of deep reinforcement learning's capabilities, achieving superhuman performance in the complex game of Go with progressively increasing autonomy. However, many obstacles remain in the understanding of and usability of these promising approaches by the research community. Toward elucidating unresolved mysteries and facilitating future research, we propose ELF OpenGo, an open-source reimplementation of the AlphaZero algorithm. ELF OpenGo is the first open-source Go AI to convincingly demonstrate superhuman performance with a perfect (20:0) record against global top professionals. We apply ELF OpenGo to conduct extensive ablation studies, and to identify and analyze numerous interesting phenomena in both the model training and in the gameplay inference procedures. Our code, models, selfplay datasets, and auxiliary data are publicly available.

READ FULL TEXT
research
07/05/2021

Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning

Ensemble and auxiliary tasks are both well known to improve the performa...
research
09/10/2022

Leveraging Human Computation for Quality Assurance in Open Source Communities

Software developed under the open source development model (OSSD) has ri...
research
09/02/2023

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

Large language models (LLMs) have recently demonstrated remarkable capab...
research
10/24/2022

OSS Mentor A framework for improving developers contributions via deep reinforcement learning

In open source project governance, there has been a lot of concern about...
research
02/21/2023

Assessment of Reinforcement Learning for Macro Placement

We provide open, transparent implementation and assessment of Google Bra...
research
08/15/2022

Cooperative and uncooperative institution designs: Surprises and problems in open-source game theory

It is increasingly possible for real-world agents, such as software-base...
research
09/18/2017

Zooming in on NYC taxi data with Portal

In this paper we develop a methodology for analyzing transportation data...

Please sign up or login with your details

Forgot password? Click here to reset