Unwrapping The Black Box of Deep ReLU Networks: Interpretability, Diagnostics, and Simplification

11/08/2020
by   Agus Sudjianto, et al.
55

The deep neural networks (DNNs) have achieved great success in learning complex patterns with strong predictive power, but they are often thought of as "black box" models without a sufficient level of transparency and interpretability. It is important to demystify the DNNs with rigorous mathematics and practical tools, especially when they are used for mission-critical applications. This paper aims to unwrap the black box of deep ReLU networks through local linear representation, which utilizes the activation pattern and disentangles the complex network into an equivalent set of local linear models (LLMs). We develop a convenient LLM-based toolkit for interpretability, diagnostics, and simplification of a pre-trained deep ReLU network. We propose the local linear profile plot and other visualization methods for interpretation and diagnostics, and an effective merging strategy for network simplification. The proposed methods are demonstrated by simulation examples, benchmark datasets, and a real case study in home lending credit risk assessment.

READ FULL TEXT

page 7

page 14

page 15

page 17

page 24

page 31

research
11/02/2021

Designing Inherently Interpretable Machine Learning Models

Interpretable machine learning (IML) becomes increasingly important in h...
research
09/23/2019

Model-Agnostic Linear Competitors – When Interpretable Models Compete and Collaborate with Black-Box Models

Driven by an increasing need for model interpretability, interpretable m...
research
10/13/2021

Clustering-Based Interpretation of Deep ReLU Network

Amongst others, the adoption of Rectified Linear Units (ReLUs) is regard...
research
10/19/2020

A Framework to Learn with Interpretation

With increasingly widespread use of deep neural networks in critical dec...
research
05/05/2016

Not Just a Black Box: Learning Important Features Through Propagating Activation Differences

Note: This paper describes an older version of DeepLIFT. See https://arx...
research
05/18/2021

Self-interpretable Convolutional Neural Networks for Text Classification

Deep learning models for natural language processing (NLP) are inherentl...
research
06/01/2023

Learning Prescriptive ReLU Networks

We study the problem of learning optimal policy from a set of discrete t...

Please sign up or login with your details

Forgot password? Click here to reset