Am I Building a White Box Agent or Interpreting a Black Box Agent?

07/02/2020
by   Tom Bewley, et al.
2

The rule extraction literature contains the notion of a fidelity-accuracy dilemma: when building an interpretable model of a black box function, optimising for fidelity is likely to reduce performance on the underlying task, and vice versa. We reassert the relevance of this dilemma for the modern field of explainable artificial intelligence, and highlight how it is compounded when the black box is an agent interacting with a dynamic environment. We then discuss two independent research directions - building white box agents and interpreting black box agents - which are both coherent and worthy of attention, but must not be conflated by researchers embarking on projects in the domain of agent interpretability.

READ FULL TEXT
research
09/10/2020

TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

In explainable artificial intelligence, there is increasing interest in ...
research
03/15/2022

One Agent To Rule Them All: Towards Multi-agent Conversational AI

The increasing volume of commercially available conversational agents (C...
research
11/06/2022

ProtoX: Explaining a Reinforcement Learning Agent via Prototyping

While deep reinforcement learning has proven to be successful in solving...
research
06/18/2020

IReEn: Iterative Reverse-Engineering of Black-Box Functions via Neural Program Synthesis

In this work, we investigate the problem of revealing the functionality ...
research
05/20/2019

The Twin-System Approach as One Generic Solution for XAI: An Overview of ANN-CBR Twins for Explaining Deep Learning

The notion of twin systems is proposed to address the eXplainable AI (XA...
research
02/09/2023

Symbolic Metamodels for Interpreting Black-boxes Using Primitive Functions

One approach for interpreting black-box machine learning models is to fi...
research
03/10/2020

Metafeatures-based Rule-Extraction for Classifiers on Behavioral and Textual Data

Machine learning using behavioral and text data can result in highly acc...

Please sign up or login with your details

Forgot password? Click here to reset