A debiased distributed estimation for sparse partially linear models in diverging dimensions

08/18/2017
by   Shaogao Lv, et al.
0

We consider a distributed estimation of the double-penalized least squares approach for high dimensional partial linear models, where the sample with a total of N data points is randomly distributed among m machines and the parameters of interest are calculated by merging their m individual estimators. This paper primarily focuses on the investigation of the high dimensional linear components in partial linear models, which is often of more interest. We propose a new debiased averaging estimator of parametric coefficients on the basis of each individual estimator, and establish new non-asymptotic oracle results in high dimensional and distributed settings, provided that m≤√(N/ p) and other mild conditions are satisfied, where p is the linear coefficient dimension. We also provide an experimental evaluation of the proposed method, indicating the numerical effectiveness on simulated data. Even under the classical non-distributed setting, we give the optimal rates of the parametric estimator with a looser tuning parameter limitation, which is required for our error analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2021

Support Recovery with Stochastic Gates: Theory and Application for Linear Models

We analyze the problem of simultaneous support recovery and estimation o...
research
07/10/2014

On the Optimality of Averaging in Distributed Statistical Learning

A common approach to statistical learning with big-data is to randomly s...
research
10/16/2017

Sparse Linear Isotonic Models

In machine learning and data mining, linear models have been widely used...
research
07/21/2023

Sandwich Boosting for Accurate Estimation in Partially Linear Models for Grouped Data

We study partially linear models in settings where observations are arra...
research
02/28/2019

Distributed Learning with Sublinear Communication

In distributed statistical learning, N samples are split across m machin...
research
02/28/2020

Modelling High-Dimensional Categorical Data Using Nonconvex Fusion Penalties

We propose a method for estimation in high-dimensional linear models wit...
research
09/09/2019

Robust testing in generalized linear models by sign-flipping score contributions

Generalized linear models are often misspecified due to overdispersion, ...

Please sign up or login with your details

Forgot password? Click here to reset