Efficient Truncated Linear Regression with Unknown Noise Variance

08/25/2022
by   Constantinos Daskalakis, et al.
0

Truncated linear regression is a classical challenge in Statistics, wherein a label, y = w^T x + ε, and its corresponding feature vector, x ∈ℝ^k, are only observed if the label falls in some subset S ⊆ℝ; otherwise the existence of the pair (x, y) is hidden from observation. Linear regression with truncated observations has remained a challenge, in its general form, since the early works of <cit.>. When the distribution of the error is normal with known variance, recent work of <cit.> provides computationally and statistically efficient estimators of the linear model, w. In this paper, we provide the first computationally and statistically efficient estimators for truncated linear regression when the noise variance is unknown, estimating both the linear model and the variance of the noise. Our estimator is based on an efficient implementation of Projected Stochastic Gradient Descent on the negative log-likelihood of the truncated sample. Importantly, we show that the error of our estimates is asymptotically normal, and we use this to provide explicit confidence regions for our estimates.

READ FULL TEXT
research
10/22/2020

Computationally and Statistically Efficient Truncated Regression

We provide a computationally and statistically efficient estimator for t...
research
07/29/2020

Truncated Linear Regression in High Dimensions

As in standard linear regression, in truncated linear regression, we are...
research
02/07/2020

Distribution free testing for linear regression. Extension to general parametric regression

Recently a distribution free approach for testing parametric hypotheses ...
research
04/29/2023

Data-Driven Subgroup Identification for Linear Regression

Medical studies frequently require to extract the relationship between e...
research
12/02/2020

Improving KernelSHAP: Practical Shapley Value Estimation via Linear Regression

The Shapley value solution concept from cooperative game theory has beco...
research
05/03/2017

Linear Regression with Shuffled Labels

Is it possible to perform linear regression on datasets whose labels are...
research
06/13/2019

Variance Estimation For Online Regression via Spectrum Thresholding

We consider the online linear regression problem, where the predictor ve...

Please sign up or login with your details

Forgot password? Click here to reset