DeepAI AI Chat
Log In Sign Up

Online Debiased Lasso

by   Ruijian Han, et al.

We propose an online debiased lasso (ODL) method for statistical inference in high-dimensional linear models with streaming data. The proposed ODL consists of an efficient computational algorithm for streaming data and approximately normal estimators for the regression coefficients. Its implementation only requires the availability of the current data batch in the data stream and sufficient statistics of the historical data at each stage of the analysis. A new dynamic procedure is developed to select and update the tuning parameters upon the arrival of each new data batch so that we can adjust the amount of regularization adaptively along the data stream. The asymptotic normality of the ODL estimator is established under the conditions similar to those in an offline setting and mild conditions on the size of data batches in the stream, which provides theoretical justification for the proposed online statistical inference procedure. We conduct extensive numerical experiments to evaluate the performance of ODL. These experiments demonstrate the effectiveness of our algorithm and support the theoretical results. An air quality dataset is analyzed to illustrate the application of the proposed method.


page 1

page 2

page 3

page 4


Statistical Inference in High-dimensional Generalized Linear Models with Streaming Data

In this paper we develop an online statistical inference approach for hi...

Inference on High-dimensional Single-index Models with Streaming Data

Traditional statistical methods are faced with new challenges due to str...

Statistical Inference on Partially Linear Panel Model under Unobserved Linearity

A new statistical procedure, based on a modified spline basis, is propos...

Adaptive Estimation and Statistical Inference for High-Dimensional Graph-Based Linear Models

We consider adaptive estimation and statistical inference for high-dimen...

Real-Time Regression Analysis of Streaming Clustered Data With Possible Abnormal Data Batches

This paper develops an incremental learning algorithm based on quadratic...

Scalable inference in functional linear regression with streaming data

Traditional static functional data analysis is facing new challenges due...

Statistical Inference for Streamed Longitudinal Data

Modern longitudinal data, for example from wearable devices, measures bi...