Global Bias-Corrected Divide-and-Conquer by Quantile-Matched Composite for General Nonparametric Regressions

01/29/2022
by   Yan Chen, et al.
0

The issues of bias-correction and robustness are crucial in the strategy of divide-and-conquer (DC), especially for asymmetric nonparametric models with massive data. It is known that quantile-based methods can achieve the robustness, but the quantile estimation for nonparametric regression has non-ignorable bias when the error distribution is asymmetric. This paper explores a global bias-corrected DC by quantile-matched composite for nonparametric regressions with general error distributions. The proposed strategies can achieve the bias-correction and robustness, simultaneously. Unlike common DC quantile estimations that use an identical quantile level to construct a local estimator by each local machine, in the new methodologies, the local estimators are obtained at various quantile levels for different data batches, and then the global estimator is elaborately constructed as a weighted sum of the local estimators. In the weighted sum, the weights and quantile levels are well-matched such that the bias of the global estimator is corrected significantly, especially for the case where the error distribution is asymmetric. Based on the asymptotic properties of the global estimator, the optimal weights are attained, and the corresponding algorithms are then suggested. The behaviors of the new methods are further illustrated by various numerical examples from simulation experiments and real data analyses. Compared with the competitors, the new methods have the favorable features of estimation accuracy, robustness, applicability and computational efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2019

A Global Bias-Correction DC Method for Biased Estimation under Memory Constraint

This paper introduces a global bias-correction divide-and-conquer (GBC-D...
research
09/26/2022

Renewable Composite Quantile Method and Algorithm for Nonparametric Models with Streaming Data

We are interested in renewable estimations and algorithms for nonparamet...
research
11/27/2019

A race-DC in Big Data

The strategy of divide-and-combine (DC) has been widely used in the area...
research
06/12/2020

Detangling robustness in high dimensions: composite versus model-averaged estimation

Robust methods, though ubiquitous in practice, are yet to be fully under...
research
11/05/2020

Conditional quantile estimators: A small sample theory

This paper studies the small sample properties and bias of just-identifi...
research
03/29/2020

Naive linkage error corrected dual system estimation

The utility of capture-recapture methods is offset by strong underlying ...

Please sign up or login with your details

Forgot password? Click here to reset