Signed Support Recovery for Single Index Models in High-Dimensions

11/07/2015
by   Matey Neykov, et al.
0

In this paper we study the support recovery problem for single index models Y=f(X^β,ε), where f is an unknown link function, X∼ N_p(0,I_p) and β is an s-sparse unit vector such that β_i∈{±1/√(s),0}. In particular, we look into the performance of two computationally inexpensive algorithms: (a) the diagonal thresholding sliced inverse regression (DT-SIR) introduced by Lin et al. (2015); and (b) a semi-definite programming (SDP) approach inspired by Amini & Wainwright (2008). When s=O(p^1-δ) for some δ>0, we demonstrate that both procedures can succeed in recovering the support of β as long as the rescaled sample size κ=n/s(p-s) is larger than a certain critical threshold. On the other hand, when κ is smaller than a critical value, any algorithm fails to recover the support with probability at least 1/2 asymptotically. In other words, we demonstrate that both DT-SIR and the SDP approach are optimal (up to a scalar) for recovering the support of β in terms of sample size. We provide extensive simulations, as well as a real dataset application to help verify our theoretical observations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/25/2015

L1-Regularized Least Squares for Support Recovery of High Dimensional Single Index Models with Gaussian Designs

It is known that for a certain class of single index models (SIMs) Y = f...
research
07/30/2013

Sharp Threshold for Multivariate Multi-Response Linear Regression via Block Regularized Lasso

In this paper, we investigate a multivariate multi-response (MVMR) linea...
research
08/17/2018

Fitting Probabilistic Index Models on Large Datasets

Recently, Thas et al. (2012) introduced a new statistical model for the ...
research
06/16/2013

Do semidefinite relaxations solve sparse PCA up to the information limit?

Estimating the leading principal components of data, assuming they are s...
research
02/19/2019

A Semiparametric Effect Size Index

Effect size indices are useful tools in study design and reporting becau...
research
05/04/2018

Global testing under the sparse alternatives for single index models

For the single index model y=f(β^τx,ϵ) with Gaussian design, and β is a...
research
11/14/2019

Concordance probability in a big data setting: application in non-life insurance

The concordance probability or C-index is a popular measure to capture t...

Please sign up or login with your details

Forgot password? Click here to reset