Doubts on the efficacy of outliers correction methods

07/23/2019
by   Marjorie Fonnesu, et al.
0

While the utilisation of different methods of outliers correction has been shown to counteract the inferential error produced by the presence of contaminating data not belonging to the studied population; the effects produced by their utilisation when samples do not contain contaminating outliers are less clear. Here a simulation approach shows that the most popular methods of outliers correction (2 Sigma, 3 Sigma, MAD, IQR, Grubbs and winsorizing) worsen the inferential evaluation of the studied population in this condition, in particular producing an inflation of Type I error and increasing the error committed in estimating the population mean and STD. We show that those methods that have the highest efficacy in counteract the inflation of Type I and Type II errors in the presence of contaminating outliers also produce the stronger increase of false positive results in their absence, suggesting that the systematic utilisation of methods for outliers correction risk to produce more harmful than beneficial effect on statistical inference. We finally propose that the safest way to deal with the presence of outliers for statistical comparisons is the utilisation of non-parametric tests

READ FULL TEXT

page 6

page 9

research
04/24/2018

Differences of Type I error rates using SAS and SPSS for repeated measures designs

We examined Type I error rates of Multilevel Linear Models (MLM) and rep...
research
07/23/2018

Outliers and The Ostensibly Heavy Tails

The aim of the paper is to show that the presence of one possible type o...
research
10/27/2019

Kernel Stein Tests for Multiple Model Comparison

We address the problem of non-parametric multiple model comparison: give...
research
07/08/2022

Outliers, Dynamics, and the Independence Postulate

We show that outliers occur almost surely in computable dynamics over in...
research
10/01/2020

Estimation of copulas via Maximum Mean Discrepancy

This paper deals with robust inference for parametric copula models. Est...
research
07/16/2019

Outliers in meta-analysis: an asymmetric trimmed-mean approach

The adaptive asymmetric trimmed mean is a known way of estimating centra...
research
08/04/2021

An autoregressive model for a censored data denoising method robust to outliers with application to the Obépine SARS-Cov-2 monitoring

A sentinel network, Obépine, has been designed to monitor SARS-CoV-2 vir...

Please sign up or login with your details

Forgot password? Click here to reset