Semiparametric Expectile Regression for High-dimensional Heavy-tailed and Heterogeneous Data

08/18/2019
by   Jun Zhao, et al.
0

Recently, high-dimensional heterogeneous data have attracted a lot of attention and discussion. Under heterogeneity, semiparametric regression is a popular choice to model data in statistics. In this paper, we take advantages of expectile regression in computation and analysis of heterogeneity, and propose the regularized partially linear additive expectile regression with nonconvex penalty, for example, SCAD or MCP for such high-dimensional heterogeneous data. We focus on a more realistic scenario: the regression error is heavy-tailed distributed and only has finite moments, which is violated with the classical sub-gaussian distribution assumption and more common in practise. Under some regular conditions, we show that with probability tending to one, the oracle estimator is one of the local minima of our optimization problem. The theoretical study indicates that the dimension cardinality of linear covariates our procedure can handle with is essentially restricted by the moment condition of the regression error. For computation, since the corresponding optimization problem is nonconvex and nonsmooth, we derive a two-step algorithm to solve this problem. Finally, we demonstrate that the proposed method enjoys good performances in estimation accuracy and model selection through Monto Carlo simulation studies and a real data example. What's more, by taking different expectile weights α, we are able to detect heterogeneity and explore the entire conditional distribution of the response variable, which indicates the usefulness of our proposed method for analyzing high dimensional heterogeneous data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/20/2019

Robust Estimation and Shrinkage in Ultrahigh Dimensional Expectile Regression with Heavy Tails and Variance Heterogeneity

High-dimensional data subject to heavy-tailed phenomena and heterogeneit...
research
01/01/2015

Statistical consistency and asymptotic normality for high-dimensional robust M-estimators

We study theoretical properties of regularized robust M-estimators, appl...
research
11/06/2018

Scale calibration for high-dimensional robust regression

We present a new method for high-dimensional linear regression when a sc...
research
01/06/2018

High Dimensional Elliptical Sliced Inverse Regression in non-Gaussian Distributions

Sliced inverse regression (SIR) is the most widely-used sufficient dimen...
research
07/07/2021

Robust Variable Selection and Estimation Via Adaptive Elastic Net S-Estimators for Linear Regression

Heavy-tailed error distributions and predictors with anomalous values ar...
research
10/30/2020

Enveloped Huber Regression

Huber regression (HR) is a popular robust alternative to the least squar...
research
07/09/2019

Nonconvex Regularized Robust Regression with Oracle Properties in Polynomial Time

This paper investigates tradeoffs among optimization errors, statistical...

Please sign up or login with your details

Forgot password? Click here to reset