Distributed Semi-Supervised Sparse Statistical Inference

06/17/2023
by   Jiyuan Tu, et al.
0

This paper is devoted to studying the semi-supervised sparse statistical inference in a distributed setup. An efficient multi-round distributed debiased estimator, which integrates both labeled and unlabelled data, is developed. We will show that the additional unlabeled data helps to improve the statistical rate of each round of iteration. Our approach offers tailored debiasing methods for M-estimation and generalized linear model according to the specific form of the loss function. Our method also applies to a non-smooth loss like absolute deviation loss. Furthermore, our algorithm is computationally efficient since it requires only one estimation of a high-dimensional inverse covariance matrix. We demonstrate the effectiveness of our method by presenting simulation studies and real data applications that highlight the benefits of incorporating unlabeled data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/16/2018

Semi-supervised Inference for Explained Variance in High-dimensional Linear Regression and Its Applications

We consider statistical inference for the explained variance β^Σβ under ...
research
11/28/2020

Optimal Semi-supervised Estimation and Inference for High-dimensional Linear Regression

There are many scenarios such as the electronic health records where the...
research
06/02/2020

A generalized linear joint trained framework for semi-supervised learning of sparse features

The elastic-net is among the most widely used types of regularization al...
research
10/15/2022

Distributed Estimation and Inference for Semi-parametric Binary Response Models

The development of modern technology has enabled data collection of unpr...
research
09/14/2020

Semi-supervised learning and the question of true versus estimated propensity scores

A straightforward application of semi-supervised machine learning to the...
research
05/03/2016

Efficient Distributed Estimation of Inverse Covariance Matrices

In distributed systems, communication is a major concern due to issues s...
research
11/25/2018

Generalized R^2 Measures for a Mixture of Bivariate Linear Dependences

Motivated by the pressing needs for capturing complex but interperetable...

Please sign up or login with your details

Forgot password? Click here to reset