Sparse Regression for Extreme Values

07/08/2020
by   Andersen Chang, et al.
0

We study the problem of selecting features associated with extreme values in high dimensional linear regression. Normally, in linear modeling problems, the presence of abnormal extreme values or outliers is considered an anomaly which should either be removed from the data or remedied using robust regression methods. In many situations, however, the extreme values in regression modeling are not outliers but rather the signals of interest; consider traces from spiking neurons, volatility in finance, or extreme events in climate science, for example. In this paper, we propose a new method for sparse high-dimensional linear regression for extreme values which is motivated by the Subbotin, or generalized normal distribution. This leads us to utilize an ℓ_p norm loss where p is an even integer greater than two; we demonstrate that this loss increases the weight on extreme values. We prove consistency and variable selection consistency for the ℓ_p norm regression with a Lasso penalty, which we term the Extreme Lasso. Through simulation studies and real-world data data examples, we show that this method outperforms other methods currently used in the literature for selecting features of interest associated with extreme values in high-dimensional regression.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2020

On regularization methods based on Rényi's pseudodistances for sparse high-dimensional linear regression models

Several regularization methods have been considered over the last decade...
research
06/10/2019

Selection consistency of Lasso-based procedures for misspecified high-dimensional binary model and random regressors

We consider selection of random predictors for high-dimensional regressi...
research
05/22/2019

Outlier Robust Extreme Learning Machine for Multi-Target Regression

The popularity of algorithms based on Extreme Learning Machine (ELM), wh...
research
06/22/2021

Extreme Graphical Models with Applications to Functional Neuronal Connectivity

With modern calcium imaging technology, the activities of thousands of n...
research
01/18/2017

Surrogate Aided Unsupervised Recovery of Sparse Signals in Single Index Models for Binary Outcomes

We consider the recovery of regression coefficients, denoted by β_0, for...
research
12/07/2017

High-dimensional robust regression and outliers detection with SLOPE

The problems of outliers detection and robust regression in a high-dimen...
research
02/17/2021

Deep Extreme Value Copulas for Estimation and Sampling

We propose a new method for modeling the distribution function of high d...

Please sign up or login with your details

Forgot password? Click here to reset