Gaussian linear approximation for the estimation of the Shapley effects

by   Baptiste Broto, et al.

In this paper, we address the estimation of the sensitivity indices called "Shapley eects". These sensitivity indices enable to handle dependent input variables. The Shapley eects are generally dicult to estimate, but they are easily computable in the Gaussian linear framework. The aim of this work is to use the values of the Shapley eects in an approximated Gaussian linear framework as estimators of the true Shapley eects corresponding to a non-linear model. First, we assume that the input variables are Gaussian with small variances. We provide rates of convergence of the estimated Shapley eects to the true Shapley eects. Then, we focus on the case where the inputs are given by an non-Gaussian empirical mean. We prove that, under some mild assumptions, when the number of terms in the empirical mean increases, the dierence between the true Shapley eects and the estimated Shapley eects given by the Gaussian linear approximation converges to 0. Our theoretical results are supported by numerical studies, showing that the Gaussian linear approximation is accurate and enables to decrease the computational time signicantly.


page 1

page 2

page 3

page 4


Sensitivity indices for independent groups of variables

In this paper, we study sensitivity indices in an additive model and for...

Risk of estimators for Sobol sensitivity indices based on metamodels

Sobol sensitivity indices allow to quantify the respective effects of ra...

Efficient estimation of divergence-based sensitivity indices with Gaussian process surrogates

We consider the estimation of sensitivity indices based on divergence me...

On dependent generalized sensitivity indices and asymptotic distributions

In this paper, we propose a novel methodology for better performing unce...

Estimating Stochastic Linear Combination of Non-linear Regressions Efficiently and Scalably

Recently, many machine learning and statistical models such as non-linea...

An Asymptotically Optimal Approximation of the Conditional Mean Channel Estimator based on Gaussian Mixture Models

This paper investigates a channel estimator based on Gaussian mixture mo...

Constraints in Random Effects Age-Period-Cohort Models

Random effects (RE) models have been widely used to study the contextual...