Understanding Game-Playing Agents with Natural Language Annotations

04/15/2022
by   Nicholas Tomlin, et al.
4

We present a new dataset containing 10K human-annotated games of Go and show how these natural language annotations can be used as a tool for model interpretability. Given a board state and its associated comment, our approach uses linear probing to predict mentions of domain-specific terms (e.g., ko, atari) from the intermediate state representations of game-playing agents like AlphaGo Zero. We find these game concepts are nontrivially encoded in two distinct policy networks, one trained via imitation learning and another trained via reinforcement learning. Furthermore, mentions of domain-specific terms are most easily predicted from the later layers of both models, suggesting that these policy networks encode high-level abstractions similar to those used in the natural language annotations.

READ FULL TEXT

page 8

page 10

page 11

research
11/26/2022

Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex

AlphaZero, an approach to reinforcement learning that couples neural net...
research
08/29/2018

Decoupling Strategy and Generation in Negotiation Dialogues

We consider negotiation settings in which two agents use natural languag...
research
10/02/2019

Natural Language State Representation for Reinforcement Learning

Recent advances in Reinforcement Learning have highlighted the difficult...
research
11/10/2018

Playing by the Book: Towards Agent-based Narrative Understanding through Role-playing and Simulation

Understanding procedural text requires tracking entities, actions and ef...
research
06/28/2019

No-boarding buses: Agents allowed to cooperate or defect

We study a bus system with a no-boarding policy, where a "slow" bus may ...
research
12/04/2015

Reuse of Neural Modules for General Video Game Playing

A general approach to knowledge transfer is introduced in which an agent...
research
05/02/2023

FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information

Dungeons Dragons (D D) is a tabletop roleplaying game with complex...

Please sign up or login with your details

Forgot password? Click here to reset