High-dimensional semi-supervised learning: in search for optimal inference of the mean

02/02/2019
by   Yuqian Zhang, et al.
0

We provide a high-dimensional semi-supervised inference framework focused on the mean and variance of the response. Our data are comprised of an extensive set of observations regarding the covariate vectors and a much smaller set of labeled observations where we observe both the response as well as the covariates. We allow the size of the covariates to be much larger than the sample size and impose weak conditions on a statistical form of the data. We provide new estimators of the mean and variance of the response that extend some of the recent results presented in low-dimensional models. In particular, at times we will not necessitate consistent estimation of the functional form of the data. Together with estimation of the population mean and variance, we provide their asymptotic distribution and confidence intervals where we showcase gains in efficiency compared to the sample mean and variance. Our procedure, with minor modifications, is then presented to make important contributions regarding inference about average treatment effects. We also investigate the robustness of estimation and coverage and showcase widespread applicability and generality of the proposed method.

READ FULL TEXT
research
06/23/2016

Semi-supervised Inference: General Theory and Estimation of Means

We propose a general semi-supervised inference framework focused on the ...
research
01/03/2022

A General Framework for Treatment Effect Estimation in Semi-Supervised and High Dimensional Settings

In this article, we aim to provide a general and complete understanding ...
research
01/25/2022

Semi-Supervised Quantile Estimation: Robust and Efficient Inference in High Dimensional Settings

We consider quantile estimation in a semi-supervised setting, characteri...
research
06/16/2018

Semi-supervised Inference for Explained Variance in High-dimensional Linear Regression and Its Applications

We consider statistical inference for the explained variance β^Σβ under ...
research
07/02/2020

High-dimensional MANOVA via Bootstrapping and its Application to Functional and Sparse Count Data

We propose a new approach to the problem of high-dimensional multivariat...
research
11/12/2021

Dynamic treatment effects: high-dimensional inference under model misspecification

This paper considers the inference for heterogeneous treatment effects i...
research
09/04/2023

Challenges of the inconsistency regime: Novel debiasing methods for missing data models

We study semi-parametric estimation of the population mean when data is ...

Please sign up or login with your details

Forgot password? Click here to reset