Quantile Regression Under Memory Constraint

10/18/2018
by   Xi Chen, et al.
0

This paper studies the inference problem in quantile regression (QR) for a large sample size n but under a limited memory constraint, where the memory can only store a small batch of data of size m. A natural method is the naïve divide-and-conquer approach, which splits data into batches of size m, computes the local QR estimator for each batch, and then aggregates the estimators via averaging. However, this method only works when n=o(m^2) and is computationally expensive. This paper proposes a computationally efficient method, which only requires an initial QR estimator on a small batch of data and then successively refines the estimator via multiple rounds of aggregations. Theoretically, as long as n grows polynomially in m, we establish the asymptotic normality for the obtained estimator and show that our estimator with only a few rounds of aggregations achieves the same efficiency as the QR estimator computed on all the data. Moreover, our result allows the case that the dimensionality p goes to infinity. The proposed method can also be applied to address the QR problem under distributed computing environment (e.g., in a large-scale sensor network) or for real-time streaming data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2019

A Pooled Quantile Estimator for Parallel Simulations

Quantile is an important risk measure quantifying the stochastic system ...
research
12/09/2020

Smoothed Quantile Regression with Large-Scale Inference

Quantile regression is a powerful tool for learning the relationship bet...
research
06/30/2021

Adaptive Capped Least Squares

This paper proposes the capped least squares regression with an adaptive...
research
08/18/2022

Optimal One-pass Nonparametric Estimation Under Memory Constraint

For nonparametric regression in the streaming setting, where data consta...
research
04/17/2020

A Survey of Approximate Quantile Computation on Large-scale Data (Technical Report)

As data volume grows extensively, data profiling helps to extract metada...
research
10/15/2022

Distributed Estimation and Inference for Semi-parametric Binary Response Models

The development of modern technology has enabled data collection of unpr...
research
04/16/2019

A Global Bias-Correction DC Method for Biased Estimation under Memory Constraint

This paper introduces a global bias-correction divide-and-conquer (GBC-D...

Please sign up or login with your details

Forgot password? Click here to reset