Robust High-dimensional Tuning Free Multiple Testing

11/22/2022
by   Jianqing Fan, et al.
0

A stylized feature of high-dimensional data is that many variables have heavy tails, and robust statistical inference is critical for valid large-scale statistical inference. Yet, the existing developments such as Winsorization, Huberization and median of means require the bounded second moments and involve variable-dependent tuning parameters, which hamper their fidelity in applications to large-scale problems. To liberate these constraints, this paper revisits the celebrated Hodges-Lehmann (HL) estimator for estimating location parameters in both the one- and two-sample problems, from a non-asymptotic perspective. Our study develops Berry-Esseen inequality and Cramér type moderate deviation for the HL estimator based on newly developed non-asymptotic Bahadur representation, and builds data-driven confidence intervals via a weighted bootstrap approach. These results allow us to extend the HL estimator to large-scale studies and propose tuning-free and moment-free high-dimensional inference procedures for testing global null and for large-scale multiple testing with false discovery proportion control. It is convincingly shown that the resulting tuning-free and moment-free methods control false discovery proportion at a prescribed level. The simulation studies lend further support to our developed theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2017

A New Perspective on Robust M-Estimation: Finite Sample Theory and Applications to Dependence-Adjusted Multiple Testing

Heavy-tailed errors impair the accuracy of the least squares estimate, w...
research
01/09/2023

Statistical Inference for Ultrahigh Dimensional Location Parameter Based on Spatial Median

Motivated by the widely used geometric median-of-means estimator in mach...
research
11/29/2022

On Large-Scale Multiple Testing Over Networks: An Asymptotic Approach

This work concerns developing communication- and computation-efficient m...
research
08/17/2022

Two-Stage Robust and Sparse Distributed Statistical Inference for Large-Scale Data

In this paper, we address the problem of conducting statistical inferenc...
research
03/14/2023

Robust Multiple Testing under High-dimensional Dynamic Factor Model

Large-scale multiple testing under static factor models is commonly used...
research
11/15/2017

FARM-Test: Factor-Adjusted Robust Multiple Testing with False Discovery Control

Large-scale multiple testing with correlated and heavy-tailed data arise...
research
10/30/2015

A Unified Theory of Confidence Regions and Testing for High Dimensional Estimating Equations

We propose a new inferential framework for constructing confidence regio...

Please sign up or login with your details

Forgot password? Click here to reset