Learn to Interpret Atari Agents

12/29/2018
by   Zhao Yang, et al.
8

Deep Reinforcement Learning (DeepRL) models surpass human-level performance in a multitude of tasks. Standing in stark contrast to the stellar performance is the obscure nature of the learned policies. The direct mapping from states to actions makes it hard to interpret the rationale behind the decision making of agents. In contrast to previous a-posteriori methods of visualising DeepRL policies, we propose an end-to-end trainable framework based on Rainbow, a representative Deep Q-Network (DQN) agent. Our method automatically detects important regions in the input domain, which enables characterization of general strategy and explanation for non-intuitive behaviors. Hence, we call it Region Sensitive Rainbow (RS-Rainbow). RS-Rainbow utilises a simple yet effective mechanism to incorporate innate visualisation ability into the learning model, not only improving the interpretability, but enabling the agent to leverage enhanced state representations for improved performance. Without extra supervision, specialised feature detectors focusing on distinct aspects of gameplay can be learned. Extensive experiments on the challenging platform of Atari 2600 demonstrates the superiority of RS-Rainbow. In particular, our agent achieves state of the art at just 25 parallel training.

READ FULL TEXT

page 3

page 4

page 6

page 7

page 8

research
11/03/2019

Online Robustness Training for Deep Reinforcement Learning

In deep reinforcement learning (RL), adversarial attacks can trick an ag...
research
10/23/2020

Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning

Priority dispatching rule (PDR) is widely used for solving real-world Jo...
research
08/13/2019

Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking

Counterfactual thinking describes a psychological phenomenon that people...
research
09/17/2022

Intrinsically Motivated Reinforcement Learning based Recommendation with Counterfactual Data Augmentation

Deep reinforcement learning (DRL) has been proven its efficiency in capt...
research
12/11/2019

Jason-RS, a Collaboration between Agents and an IoT Platform

In this article we start from the observation that REST services are the...
research
12/20/2022

A Comparison Between Tsetlin Machines and Deep Neural Networks in the Context of Recommendation Systems

Recommendation Systems (RSs) are ubiquitous in modern society and are on...

Please sign up or login with your details

Forgot password? Click here to reset