Selection and Estimation Optimality in High Dimensions with the TWIN Penalty

06/05/2018
by   Xiaowu Dai, et al.
0

We introduce a novel class of variable selection penalties called TWIN, which provides sensible data-adaptive penalization. Under a linear sparsity regime and random Gaussian designs we show that penalties in the TWIN class have a high probability of selecting the correct model and furthermore result in minimax optimal estimators. The general shape of penalty functions in the TWIN class is the key ingredient to its desirable properties and results in improved theoretical and empirical performance over existing penalties. In this work we introduce two examples of TWIN penalties that admit simple and efficient coordinate descent algorithms, making TWIN practical in large data settings. We demonstrate in challenging and realistic simulation settings with high correlations between active and inactive variables that TWIN has high power in variable selection while controlling the number of false discoveries, outperforming standard penalties.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2019

A High-dimensional M-estimator Framework for Bi-level Variable Selection

In high-dimensional data analysis, bi-level sparsity is often assumed wh...
research
12/14/2021

Variable Selection and Regularization via Arbitrary Rectangle-range Generalized Elastic Net

We introduce the arbitrary rectangle-range generalized elastic net penal...
research
02/26/2018

Scalable kernel-based variable selection with sparsistency

Variable selection is central to high-dimensional data analysis, and var...
research
09/08/2023

Generalized Variable Selection Algorithms for Gaussian Process Models by LASSO-like Penalty

With the rapid development of modern technology, massive amounts of data...
research
08/22/2023

Nonparametric Assessment of Variable Selection and Ranking Algorithms

Selecting from or ranking a set of candidates variables in terms of thei...
research
04/27/2018

Sequential Optimization in Locally Important Dimensions

Optimizing a black-box function is challenging when the underlying funct...
research
05/20/2014

Sequential Advantage Selection for Optimal Treatment Regimes

Variable selection for optimal treatment regime in a clinical trial or a...

Please sign up or login with your details

Forgot password? Click here to reset