Closed-Form Expressions for Global and Local Interpretation of Tsetlin Machines with Applications to Explaining High-Dimensional Data

07/27/2020
by   Christian D. Blakely, et al.
16

Tsetlin Machines (TMs) capture patterns using conjunctive clauses in propositional logic, thus facilitating interpretation. However, recent TM-based approaches mainly rely on inspecting the full range of clauses individually. Such inspection does not necessarily scale to complex prediction problems that require a large number of clauses. In this paper, we propose closed-form expressions for understanding why a TM model makes a specific prediction (local interpretability). Additionally, the expressions capture the most important features of the model overall (global interpretability). We further introduce expressions for measuring the importance of feature value ranges for continuous features. The expressions are formulated directly from the conjunctive clauses of the TM, making it possible to capture the role of features in real-time, also during the learning process as the model evolves. Additionally, from the closed-form expressions, we derive a novel data clustering algorithm for visualizing high-dimensional data in three dimensions. Finally, we compare our proposed approach against SHAP and state-of-the-art interpretable machine learning techniques. For both classification and regression, our evaluation show correspondence with SHAP as well as competitive prediction accuracy in comparison with XGBoost, Explainable Boosting Machines, and Neural Additive Models.

READ FULL TEXT

page 9

page 12

page 13

page 14

page 15

page 18

page 19

research
01/10/2020

Explaining the Explainer: A First Theoretical Analysis of LIME

Machine learning is used more and more often for sensitive applications,...
research
03/08/2022

Logic-based AI for Interpretable Board Game Winner Prediction with Tsetlin Machine

Hex is a turn-based two-player connection game with a high branching fac...
research
03/25/2020

Boosting Ridge Regression for High Dimensional Data Classification

Ridge regression is a well established regression estimator which can co...
research
07/08/2020

Model-based Clustering using Automatic Differentiation: Confronting Misspecification and High-Dimensional Data

We study two practically important cases of model based clustering using...
research
09/02/2021

Inferring feature importance with uncertainties in high-dimensional data

Estimating feature importance is a significant aspect of explaining data...
research
04/16/2019

Discriminative Regression Machine: A Classifier for High-Dimensional Data or Imbalanced Data

We introduce a discriminative regression approach to supervised classifi...
research
12/25/2020

An analytic physically motivated model of the mammalian cochlea

We develop an analytic model of the mammalian cochlea. We use a mixed ph...

Please sign up or login with your details

Forgot password? Click here to reset