Ranking variables and interactions using predictive uncertainty measures

10/17/2019
by   Topi Paananen, et al.
29

For complex nonlinear supervised learning models, assessing the relevance of input variables or their interactions is not straightforward due to the lack of a direct measure of relevance, such as the regression coefficients in generalized linear models. One can assess the relevance of input variables locally by using the mean prediction or its derivative, but this disregards the predictive uncertainty. In this work, we present a Bayesian method for identifying relevant input variables with main effects and interactions by differentiating the Kullback-Leibler divergence of predictive distributions. The method averages over local measures of relevance and has a conservative property that takes into account the uncertainty in the predictive distribution. Our empirical results on simulated and real data sets with nonlinearities demonstrate accurate and efficient identification of relevant main effects and interactions compared to alternative methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2019

Understanding complex predictive models with Ghost Variables

We propose a procedure for assigning a relevance measure to each explana...
research
09/16/2022

Detection of Interacting Variables for Generalized Linear Models via Neural Networks

The quality of generalized linear models (GLMs), frequently used by insu...
research
12/15/2022

Robustness Evaluation of Regression Tasks with Skewed Domain Preferences

In natural phenomena, data distributions often deviate from normality. O...
research
04/27/2021

On dependent generalized sensitivity indices and asymptotic distributions

In this paper, we propose a novel methodology for better performing unce...
research
01/30/2020

TCMI: a non-parametric mutual-dependence estimator for multivariate continuous distributions

The identification of relevant features, i.e., the driving variables tha...
research
06/21/2020

Learned Uncertainty-Aware (LUNA) Bases for Bayesian Regression using Multi-Headed Auxiliary Networks

Neural Linear Models (NLM) are deep models that produce predictive uncer...
research
04/06/2021

Balancing Predictive Relevance of Ligand Biochemical Activities

In this paper, we present a technique for balancing predictive relevance...

Please sign up or login with your details

Forgot password? Click here to reset