Distributed Estimation and Inference for Semi-parametric Binary Response Models

10/15/2022
by   Xi Chen, et al.
0

The development of modern technology has enabled data collection of unprecedented size, which poses new challenges to many statistical estimation and inference problems. This paper studies the maximum score estimator of a semi-parametric binary choice model under a distributed computing environment without pre-specifying the noise distribution. An intuitive divide-and-conquer estimator is computationally expensive and restricted by a non-regular constraint on the number of machines, due to the highly non-smooth nature of the objective function. We propose (1) a one-shot divide-and-conquer estimator after smoothing the objective to relax the constraint, and (2) a multi-round estimator to completely remove the constraint via iterative smoothing. We specify an adaptive choice of kernel smoother with a sequentially shrinking bandwidth to achieve the superlinear improvement of the optimization error over the multiple iterations. The improved statistical accuracy per iteration is derived, and a quadratic convergence up to the optimal statistical error rate is established. We further provide two generalizations to handle the heterogeneity of datasets with covariate shift and high-dimensional problems where the parameter of interest is sparse.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/05/2020

Distributed Estimation for Principal Component Analysis: a Gap-free Approach

The growing size of modern data sets brings many challenges to the exist...
research
11/29/2018

Distributed Inference for Linear Support Vector Machine

The growing size of modern data brings many new challenges to existing s...
research
06/17/2023

Distributed Semi-Supervised Sparse Statistical Inference

This paper is devoted to studying the semi-supervised sparse statistical...
research
10/18/2018

Quantile Regression Under Memory Constraint

This paper studies the inference problem in quantile regression (QR) for...
research
11/28/2018

First-order Newton-type Estimator for Distributed Estimation and Inference

This paper studies distributed estimation and inference for a general st...
research
04/13/2023

A review of distributed statistical inference

The rapid emergence of massive datasets in various fields poses a seriou...
research
03/24/2019

Non-Standard Asymptotics in High Dimensions: Manski's Maximum Score Estimator Revisited

Manski's celebrated maximum score estimator for the binary choice model ...

Please sign up or login with your details

Forgot password? Click here to reset