Hypothesis Testing and Machine Learning: Interpreting Variable Effects in Deep Artificial Neural Networks using Cohen's f2

02/02/2023
by   Wolfgang Messner, et al.
0

Deep artificial neural networks show high predictive performance in many fields, but they do not afford statistical inferences and their black-box operations are too complicated for humans to comprehend. Because positing that a relationship exists is often more important than prediction in scientific experiments and research models, machine learning is far less frequently used than inferential statistics. Additionally, statistics calls for improving the test of theory by showing the magnitude of the phenomena being studied. This article extends current XAI methods and develops a model agnostic hypothesis testing framework for machine learning. First, Fisher's variable permutation algorithm is tweaked to compute an effect size measure equivalent to Cohen's f2 for OLS regression models. Second, the Mann-Kendall test of monotonicity and the Theil-Sen estimator is applied to Apley's accumulated local effect plots to specify a variable's direction of influence and statistical significance. The usefulness of this approach is demonstrated on an artificial data set and a social survey with a Python sandbox implementation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2019

Interpreting Black Box Models with Statistical Guarantees

While many methods for interpreting machine learning models have been pr...
research
01/26/2023

Permutation-based Hypothesis Testing for Neural Networks

Neural networks are powerful predictive models, but they provide little ...
research
05/28/2021

Generalized Permutation Framework for Testing Model Variable Significance

A common problem in machine learning is determining if a variable signif...
research
10/20/2016

Reasoning with Memory Augmented Neural Networks for Language Comprehension

Hypothesis testing is an important cognitive process that supports human...
research
05/03/2022

A Falsificationist Account of Artificial Neural Networks

Machine learning operates at the intersection of statistics and computer...
research
04/16/2019

Scalable and Efficient Hypothesis Testing with Random Forests

Throughout the last decade, random forests have established themselves a...
research
07/14/2022

From Shapley back to Pearson: Hypothesis Testing via the Shapley Value

Machine learning models, in particular artificial neural networks, are i...

Please sign up or login with your details

Forgot password? Click here to reset