Robust Multivariate Estimation Based On Statistical Data Depth Filters

09/10/2019
by   Giovanni Saraceno, et al.
0

In the classical contamination models, such as the gross-error (Huber and Tukey contamination model or Case-wise Contamination), observations are considered as the units to be identified as outliers or not, this model is very useful when the number of considered variables is moderately small. Alqallaf et al. [2009] shows the limits of this approach for a larger number of variables and introduced the Independent contamination model (Cell-wise Contamination) where now the cells are the units to be identified as outliers or not. One approach to deal, at the same time, with both type of contaminations is filter out the contaminated cells from the data set and then apply a robust procedure able to handle case-wise outliers and missing values. Here we develop a general framework to build filters in any dimension based on statistical data depth functions. We show that previous approaches, e.g., Agostinelli et al. [2015a] and Leung et al. [2017] are special cases. We illustrate our method by using the half-space depth.

READ FULL TEXT
research
06/04/2018

MacroPCA: An all-in-one PCA method allowing for missing values as well as cellwise and rowwise outliers

Multivariate data are typically represented by a rectangular matrix (tab...
research
01/29/2012

A robust and sparse K-means clustering algorithm

In many situations where the interest lies in identifying clusters one m...
research
03/08/2022

Detection and treatment of outliers for multivariate robust loss reserving

Traditional techniques for calculating outstanding claim liabilities suc...
research
11/29/2019

A robust method based on LOVO functions for solving least squares problems

The robust adjustment of nonlinear models to data is considered in this ...
research
07/02/2020

Adapting k-means algorithms for outliers

This paper shows how to adapt several simple and classical sampling-base...
research
07/06/2020

Surprise sampling: improving and extending the local case-control sampling

Fithian and Hastie (2014) proposed a new sampling scheme called local ca...
research
02/23/2023

IlocA: An algorithm to Cluster Cells and form Imputation Groups from a pair of Classification Variables

We set out the novel bottom up procedure to aggregate or cluster cells w...

Please sign up or login with your details

Forgot password? Click here to reset