Distributed Bootstrap for Simultaneous Inference Under High Dimensionality

02/19/2021
by   Yang Yu, et al.
0

We propose a distributed bootstrap method for simultaneous inference on high-dimensional massive data that are stored and processed with many machines. The method produces a ℓ_∞-norm confidence region based on a communication-efficient de-biased lasso, and we propose an efficient cross-validation approach to tune the method at every iteration. We theoretically prove a lower bound on the number of communication rounds τ_min that warrants the statistical accuracy and efficiency. Furthermore, τ_min only increases logarithmically with the number of workers and intrinsic dimensionality, while nearly invariant to the nominal dimensionality. We test our theory by extensive simulation studies, and a variable screening task on a semi-synthetic dataset based on the US Airline On-time Performance dataset. The code to reproduce the numerical results is available at GitHub: https://github.com/skchao74/Distributed-bootstrap.

READ FULL TEXT
research
02/19/2020

Simultaneous Inference for Massive Data: Distributed Bootstrap

In this paper, we propose a bootstrap method applied to massive data pro...
research
05/06/2017

Comments on `High-dimensional simultaneous inference with the bootstrap'

We provide comments on the article "High-dimensional simultaneous infere...
research
11/09/2017

Debiasing the Debiased Lasso with Bootstrap

In this paper, we prove that under proper conditions, bootstrap can furt...
research
11/02/2021

High-dimensional Simultaneous Inference on Non-Gaussian VAR Model via De-biased Estimator

Simultaneous inference for high-dimensional non-Gaussian time series is ...
research
06/13/2018

LASSO-Driven Inference in Time and Space

We consider the estimation and inference in a system of high-dimensional...
research
01/31/2022

Fast Distributed k-Means with a Small Number of Rounds

We propose a new algorithm for k-means clustering in a distributed setti...
research
07/19/2022

ReBoot: Distributed statistical learning via refitting Bootstrap samples

In this paper, we study a one-shot distributed learning algorithm via re...

Please sign up or login with your details

Forgot password? Click here to reset