Adversarial Policies Beat Professional-Level Go AIs

11/01/2022
by   Tony Tong Wang, et al.
0

We attack the state-of-the-art Go-playing AI system, KataGo, by training an adversarial policy that plays against a frozen KataGo victim. Our attack achieves a >99 when KataGo uses enough search to be near-superhuman. To the best of our knowledge, this is the first successful end-to-end attack against a Go AI playing at the level of a top human professional. Notably, the adversary does not win by learning to play Go better than KataGo – in fact, the adversary is easily beaten by human amateurs. Instead, the adversary wins by tricking KataGo into ending the game prematurely at a point that is favorable to the adversary. Our results demonstrate that even professional-level AI systems may harbor surprising failure modes. See https://goattack.alignmentfund.org/ for example games.

READ FULL TEXT

page 2

page 18

page 19

page 20

research
11/25/2020

Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings

We present JueWu-SL, the first supervised-learning-based artificial inte...
research
10/06/2020

Human-Level Performance in No-Press Diplomacy via Equilibrium Search

Prior AI breakthroughs in complex games have focused on either the purel...
research
09/19/2018

TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game

Starcraft II (SCII) is widely considered as the most challenging Real Ti...
research
12/22/2022

Adversarial Machine Learning and Defense Game for NextG Signal Classification with Deep Learning

This paper presents a game-theoretic framework to study the interactions...
research
11/03/2022

The ProfessionAl Go annotation datasEt (PAGE)

The game of Go has been highly under-researched due to the lack of game ...
research
08/20/2019

Playing magic tricks to deep neural networks untangles human deception

Magic is the art of producing in the spectator an illusion of impossibil...
research
09/05/2019

The Impact of Complex and Informed Adversarial Behavior in Graphical Coordination Games

How does system-level information impact the ability of an adversary to ...

Please sign up or login with your details

Forgot password? Click here to reset