High Dimensional M-Estimation with Missing Outcomes: A Semi-Parametric Framework

11/26/2019
by   Abhishek Chakrabortty, et al.
17

We consider high dimensional M-estimation in settings where the response Y is possibly missing at random and the covariates X∈R^p can be high dimensional compared to the sample size n. The parameter of interest θ_0 ∈R^d is defined as the minimizer of the risk of a convex loss, under a fully non-parametric model, and θ_0 itself is high dimensional which is a key distinction from existing works. Standard high dimensional regression and series estimation with possibly misspecified models and missing Y are included as special cases, as well as their counterparts in causal inference using 'potential outcomes'. Assuming θ_0 is s-sparse (s ≪ n), we propose an L_1-regularized debiased and doubly robust (DDR) estimator of θ_0 based on a high dimensional adaptation of the traditional double robust (DR) estimator's construction. Under mild tail assumptions and arbitrarily chosen (working) models for the propensity score (PS) and the outcome regression (OR) estimators, satisfying only some high-level conditions, we establish finite sample performance bounds for the DDR estimator showing its (optimal) L_2 error rate to be √(s (log d)/ n) when both models are correct, and its consistency and DR properties when only one of them is correct. Further, when both the models are correct, we propose a desparsified version of our DDR estimator that satisfies an asymptotic linear expansion and facilitates inference on low dimensional components of θ_0. Finally, we discuss various of choices of high dimensional parametric/semi-parametric working models for the PS and OR estimators. All results are validated via detailed simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2023

Debiased Regression Adjustment in Completely Randomized Experiments with Moderately High-dimensional Covariates

Completely randomized experiment is the gold standard for causal inferen...
research
12/06/2021

On the computation of a non-parametric estimator by convex optimization

Estimation of linear functionals from observed data is an important task...
research
06/05/2018

High-Dimensional Econometrics and Regularized GMM

This chapter presents key concepts and theoretical results for analyzing...
research
09/26/2017

On Stein's Identity and Near-Optimal Estimation in High-dimensional Index Models

We consider estimating the parametric components of semi-parametric mult...
research
11/11/2020

Learning a high-dimensional classification rule using auxiliary outcomes

Correlated outcomes are common in many practical problems. Based on a de...
research
09/02/2019

Asymptotic linear expansion of regularized M-estimators

Parametric high-dimensional regression analysis requires the usage of re...

Please sign up or login with your details

Forgot password? Click here to reset