Robust Nonparametric Regression via Sparsity Control with Application to Load Curve Data Cleansing

04/04/2011
by   Gonzalo Mateos, et al.
0

Nonparametric methods are widely applicable to statistical inference problems, since they rely on a few modeling assumptions. In this context, the fresh look advocated here permeates benefits from variable selection and compressive sampling, to robustify nonparametric regression against outliers - that is, data markedly deviating from the postulated models. A variational counterpart to least-trimmed squares regression is shown closely related to an L0-(pseudo)norm-regularized estimator, that encourages sparsity in a vector explicitly modeling the outliers. This connection suggests efficient solvers based on convex relaxation, which lead naturally to a variational M-type estimator equivalent to the least-absolute shrinkage and selection operator (Lasso). Outliers are identified by judiciously tuning regularization parameters, which amounts to controlling the sparsity of the outlier vector along the whole robustification path of Lasso solutions. Reduced bias and enhanced generalization capability are attractive features of an improved estimator obtained after replacing the L0-(pseudo)norm with a nonconvex surrogate. The novel robust spline-based smoother is adopted to cleanse load curve data, a key task aiding operational decisions in the envisioned smart grid system. Computer simulations and tests on real load curve data corroborate the effectiveness of the novel sparsity-controlling robust estimators.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2011

Robust PCA as Bilinear Decomposition with Outlier-Sparsity Regularization

Principal component analysis (PCA) is widely used for dimensionality red...
research
04/18/2023

The Adaptive τ-Lasso: Its Robustness and Oracle Properties

This paper introduces a new regularized version of the robust τ-regressi...
research
12/28/2020

High-dimensional inference robust to outliers with l1-norm penalization

This paper studies inference in the high-dimensional linear regression m...
research
06/04/2019

Inference robust to outliers with l1-norm penalization

This paper considers the problem of inference in a linear regression mod...
research
07/11/2013

Minimum Distance Estimation for Robust High-Dimensional Regression

We propose a minimum distance estimation method for robust regression in...
research
04/26/2018

GEP-MSCRA for computing the group zero-norm regularized least squares estimator

This paper concerns with the group zero-norm regularized least squares e...
research
05/16/2019

How Entropic Regression Beats the Outliers Problem in Nonlinear System Identification

System identification (SID) is central in science and engineering applic...

Please sign up or login with your details

Forgot password? Click here to reset