Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint

12/18/2022
by   Borui Zhang, et al.
0

Deep learning has revolutionized human society, yet the black-box nature of deep neural networks hinders further application to reliability-demanded industries. In the attempt to unpack them, many works observe or impact internal variables to improve the model's comprehensibility and transparency. However, existing methods rely on intuitive assumptions and lack mathematical guarantees. To bridge this gap, we introduce Bort, an optimizer for improving model explainability with boundedness and orthogonality constraints on model parameters, derived from the sufficient conditions of model comprehensibility and transparency. We perform reconstruction and backtracking on the model representations optimized by Bort and observe an evident improvement in model explainability. Based on Bort, we are able to synthesize explainable adversarial samples without additional parameters and training. Surprisingly, we find Bort constantly improves the classification accuracy of various architectures including ResNet and DeiT on MNIST, CIFAR-10, and ImageNet.

READ FULL TEXT

page 1

page 8

page 9

page 18

page 19

research
02/12/2020

Explainable Deep Modeling of Tabular Data using TableGraphNet

The vast majority of research on explainability focuses on post-explaina...
research
03/24/2022

Explainable Artificial Intelligence for Exhaust Gas Temperature of Turbofan Engines

Data-driven modeling is an imperative tool in various industrial applica...
research
01/12/2019

Enhancing Explainability of Neural Networks through Architecture Constraints

Prediction accuracy and model explainability are the two most important ...
research
11/27/2022

Foiling Explanations in Deep Neural Networks

Deep neural networks (DNNs) have greatly impacted numerous fields over t...
research
10/07/2022

Utilizing Explainable AI for improving the Performance of Neural Networks

Nowadays, deep neural networks are widely used in a variety of fields th...
research
09/15/2022

Visual Recognition with Deep Nearest Centroids

We devise deep nearest centroids (DNC), a conceptually elegant yet surpr...
research
09/01/2021

Towards Learning a Vocabulary of Visual Concepts and Operators using Deep Neural Networks

Deep neural networks have become the default choice for many application...

Please sign up or login with your details

Forgot password? Click here to reset