The Kendall Interaction Filter for Variable Interaction Screening in Ultra High Dimensional Classification Problems

10/13/2020
by   Youssef Anzarmou, et al.
0

Accounting for important interaction effects can improve prediction of many statistical learning models. Identification of relevant interactions, however, is a challenging issue owing to their ultrahigh-dimensional nature. Interaction screening strategies can alleviate such issues. However, due to heavier tail distribution and complex dependence structure of interaction effects, innovative robust and/or model-free methods for screening interactions are required to better scale analysis of complex and high-throughput data. In this work, we develop a new model-free interaction screening method, termed Kendall Interaction Filter (KIF), for the classification in high-dimensional settings. The KIF method suggests a weighted-sum measure, which compares the overall to the within-cluster Kendall's τ of pairs of predictors, to select interactive couples of features. The proposed KIF measure captures relevant interactions for the clusters response-variable, handles continuous, categorical or a mixture of continuous-categorical features, and is invariant under monotonic transformations. We show that the KIF measure enjoys the sure screening property in the high-dimensional setting under mild conditions, without imposing sub-exponential moment assumptions on the features' distributions. We illustrate the favorable behavior of the proposed methodology compared to the methods in the same category using simulation studies, and we conduct real data analyses to demonstrate its utility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/05/2015

Innovated interaction screening for high-dimensional nonlinear classification

This paper is concerned with the problems of interaction screening and n...
research
04/17/2023

Grouped feature screening for ultrahigh-dimensional classification via Gini distance correlation

Gini distance correlation (GDC) was recently proposed to measure the dep...
research
02/10/2019

BOLT-SSI: A Statistical Approach to Screening Interaction Effects for Ultra-High Dimensional Data

Detecting interaction effects is a crucial step in various applications....
research
11/12/2021

Epistasis Detection Via the Joint Cumulant

Selecting influential nonlinear interactive features from ultrahigh dime...
research
10/14/2022

Variable Importance Based Interaction Modeling with an Application on Initial Spread of COVID-19 in China

Interaction selection for linear regression models with both continuous ...
research
06/22/2023

Feature screening for clustering analysis

In this paper, we consider feature screening for ultrahigh dimensional c...
research
05/28/2016

Interaction Pursuit with Feature Screening and Selection

Understanding how features interact with each other is of paramount impo...

Please sign up or login with your details

Forgot password? Click here to reset