Parallel subgroup analysis of high-dimensional data via M-regression

05/01/2020
by   Chao Cheng, et al.
0

It becomes an interesting problem to identify subgroup structures in data analysis as populations are probably heterogeneous in practice. In this paper, we consider M-estimators together with both concave and pairwise fusion penalties, which can deal with high-dimensional data containing some outliers. The penalties are applied both on covariates and treatment effects, where the estimation is expected to achieve both variable selection and data clustering simultaneously. An algorithm is proposed to process relatively large datasets based on parallel computing. We establish the convergence analysis of the proposed algorithm, the oracle property of the penalized M-estimators, and the selection consistency of the proposed criterion. Our numerical study demonstrates that the proposed method is promising to efficiently identify subgroups hidden in high-dimensional data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2019

A High-dimensional M-estimator Framework for Bi-level Variable Selection

In high-dimensional data analysis, bi-level sparsity is often assumed wh...
research
06/04/2008

High dimensional gaussian classification

High dimensional data analysis is known to be as a challenging problem. ...
research
09/14/2019

Higher Order Refinements by Bootstrap in Lasso and other Penalized Regression Methods

Selection of important covariates and to drop the unimportant ones from ...
research
05/26/2021

An algorithm-based multiple detection influence measure for high dimensional regression using expectile

The identification of influential observations is an important part of d...
research
10/15/2015

Robust Learning for Optimal Treatment Decision with NP-Dimensionality

In order to identify important variables that are involved in making opt...
research
03/22/2021

If You Must Choose Among Your Children, Pick the Right One

Given a simplicial complex K and an injective function f from the vertic...
research
06/02/2021

On Selection of Semiparametric Spatial Regression Models

In this paper, we focus on the variable selection techniques for a class...

Please sign up or login with your details

Forgot password? Click here to reset