Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents

04/02/2019
by   Christian Rupprecht, et al.
0

As deep reinforcement learning driven by visual perception becomes more widely used there is a growing need to better understand and probe the learned agents. Understanding the decision making process and its relationship to visual inputs can be very valuable to identify problems in learned behavior. However, this topic has been relatively under-explored in the research community. In this work we present a method for synthesizing visual inputs of interest for a trained agent. Such inputs or states could be situations in which specific actions are necessary. Further, critical states in which a very high or a very low reward can be achieved are often interesting to understand the situational awareness of the system as they can correspond to risky states. To this end, we learn a generative model over the state space of the environment and use its latent space to optimize a target function for the state of interest. In our experiments we show that this method can generate insights for a variety of environments and reinforcement learning methods. We explore results in the standard Atari benchmark games as well as in an autonomous driving simulator. Based on the efficiency with which we have been able to identify behavioural weaknesses with this technique, we believe this general approach could serve as an important tool for AI safety applications.

READ FULL TEXT

page 4

page 5

page 6

page 8

page 11

page 12

page 13

page 14

research
08/27/2021

WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving

Urban autonomous driving is an open and challenging problem to solve as ...
research
06/15/2023

Semantic HELM: An Interpretable Memory for Reinforcement Learning

Reinforcement learning agents deployed in the real world often have to c...
research
02/27/2020

Training Adversarial Agents to Exploit Weaknesses in Deep Control Policies

Deep learning has become an increasingly common technique for various co...
research
01/23/2020

Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning

Unlike popular modularized framework, end-to-end autonomous driving seek...
research
05/10/2019

Do Autonomous Agents Benefit from Hearing?

Mapping states to actions in deep reinforcement learning is mainly based...
research
04/07/2020

How Do You Act? An Empirical Study to Understand Behavior of Deep Reinforcement Learning Agents

The demand for more transparency of decision-making processes of deep re...
research
02/02/2021

Towards a reinforcement learning de novo genome assembler

The use of reinforcement learning has proven to be very promising for so...

Please sign up or login with your details

Forgot password? Click here to reset