Improving Model and Search for Computer Go

02/06/2021
by   Tristan Cazenave, et al.
0

The standard for Deep Reinforcement Learning in games, following Alpha Zero, is to use residual networks and to increase the depth of the network to get better results. We propose to improve mobile networks as an alternative to residual networks and experimentally show the playing strength of the networks according to both their width and their depth. We also propose a generalization of the PUCT search algorithm that improves on PUCT.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2020

Mobile Networks for Computer Go

The architecture of the neural networks used in Deep Reinforcement Learn...
research
08/30/2018

ExpIt-OOS: Towards Learning from Planning in Imperfect Information Games

The current state of the art in playing many important perfect informati...
research
08/30/2018

ExIt-OOS: Towards Learning from Planning in Imperfect Information Games

The current state of the art in playing many important perfect informati...
research
10/07/2019

Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks

The skip-connections used in residual networks have become a standard ar...
research
03/26/2023

Exploring Novel Quality Diversity Methods For Generalization in Reinforcement Learning

The Reinforcement Learning field is strong on achievements and weak on r...
research
05/03/2019

Deep Residual Reinforcement Learning

We revisit residual algorithms in both model-free and model-based reinfo...
research
09/28/2018

Depth Reconstruction of Translucent Objects from a Single Time-of-Flight Camera using Deep Residual Networks

We propose a novel approach to recovering the translucent objects from a...

Please sign up or login with your details

Forgot password? Click here to reset