Estimating minimum effect with outlier selection

09/21/2018
by   Alexandra Carpentier, et al.
0

We introduce one-sided versions of Huber's contamination model, in which corrupted samples tend to take larger values than uncorrupted ones. Two intertwined problems are addressed: estimation of the mean of uncorrupted samples (minimum effect) and selection of corrupted samples (outliers). Regarding the minimum effect estimation, we derive the minimax risks and introduce adaptive estimators to the unknown number of contaminations. Interestingly, the optimal convergence rate highly differs from that in classical Huber's contamination model. Also, our analysis uncovers the effect of particular structural assumptions on the distribution of the contaminated samples. As for the problem of selecting the outliers, we formulate the problem in a multiple testing framework for which the location/scaling of the null hypotheses are unknown. We rigorously prove how estimating the null hypothesis is possible while maintaining a theoretical guarantee on the amount of the falsely selected outliers, both through false discovery rate (FDR) or post hoc bounds. As a by-product, we address a long-standing open issue on FDR control under equi-correlation, which reinforces the interest of removing dependency when making multiple testing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2019

On using empirical null distributions in Benjamini-Hochberg procedure

When performing multiple testing, adjusting the distribution of the null...
research
12/06/2019

On using empirical null distribution in Benjamini-Hochberg procedure

When performing multiple testing, adjusting the distribution of the null...
research
08/10/2023

Rank tests for outlier detection

In novelty detection, the objective is to determine whether the test sam...
research
02/17/2020

Estimating the number and effect sizes of non-null hypotheses

We study the problem of estimating the distribution of effect sizes (the...
research
12/30/2017

Gaining power in multiple testing of interval hypotheses via conditionalization

In this paper we introduce a novel procedure for improving multiple test...
research
05/11/2021

Trimmed Minimum Error Entropy for Robust Online Regression

In this paper, online linear regression in environments corrupted by non...
research
07/04/2018

Post hoc false positive control for spatially structured hypotheses

In a high dimensional multiple testing framework, we present new confide...

Please sign up or login with your details

Forgot password? Click here to reset