Sparse dimension reduction based on energy and ball statistics

12/12/2020
by   Emmanuel Jordy Menvouta, et al.
0

As its name suggests, sufficient dimension reduction (SDR) targets to estimate a subspace from data that contains all information sufficient to explain a dependent variable. Ample approaches exist to SDR, some of the most recent of which rely on minimal to no model assumptions. These are defined according to an optimization criterion that maximizes a nonparametric measure of association. The original estimators are nonsparse, which means that all variables contribute to the model. However, in many practical applications, an SDR technique may be called for that is sparse and as such, intrinsically performs sufficient variable selection (SVS). This paper examines how such a sparse SDR estimator can be constructed. Three variants are investigated, depending on different measures of association: distance covariance, martingale difference divergence and ball covariance. A simulation study shows that each of these estimators can achieve correct variable selection in highly nonlinear contexts, yet are sensitive to outliers and computationally intensive. The study sheds light on the subtle differences between the methods. Two examples illustrate how these new estimators can be applied in practice, with a slight preference for the option based on martingale difference divergence in the bioinformatics example.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2019

MM Algorithms for Distance Covariance based Sufficient Dimension Reduction and Sufficient Variable Selection

Sufficient dimension reduction (SDR) using distance covariance (DCOV) wa...
research
10/22/2022

Model-free variable selection in sufficient dimension reduction via FDR control

Simultaneously identifying contributory variables and controlling the fa...
research
04/20/2021

Sparse Sliced Inverse Regression via Cholesky Matrix Penalization

We introduce a new sparse sliced inverse regression estimator called Cho...
research
08/05/2015

Direct Estimation of the Derivative of Quadratic Mutual Information with Application in Supervised Dimension Reduction

A typical goal of supervised dimension reduction is to find a low-dimens...
research
05/22/2023

Variable selection in multivariate regression model for spatially dependent data

This paper deals with variable selection in multivariate linear regressi...
research
01/15/2013

An Efficient Sufficient Dimension Reduction Method for Identifying Genetic Variants of Clinical Significance

Fast and cheaper next generation sequencing technologies will generate u...
research
02/02/2021

Divergence of an integral of a process with small ball estimate

The paper contains sufficient conditions on the function f and the stoch...

Please sign up or login with your details

Forgot password? Click here to reset