Online Debiased Lasso

06/10/2021
by   Ruijian Han, et al.
0

We propose an online debiased lasso (ODL) method for statistical inference in high-dimensional linear models with streaming data. The proposed ODL consists of an efficient computational algorithm for streaming data and approximately normal estimators for the regression coefficients. Its implementation only requires the availability of the current data batch in the data stream and sufficient statistics of the historical data at each stage of the analysis. A new dynamic procedure is developed to select and update the tuning parameters upon the arrival of each new data batch so that we can adjust the amount of regularization adaptively along the data stream. The asymptotic normality of the ODL estimator is established under the conditions similar to those in an offline setting and mild conditions on the size of data batches in the stream, which provides theoretical justification for the proposed online statistical inference procedure. We conduct extensive numerical experiments to evaluate the performance of ODL. These experiments demonstrate the effectiveness of our algorithm and support the theoretical results. An air quality dataset is analyzed to illustrate the application of the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2021

Statistical Inference in High-dimensional Generalized Linear Models with Streaming Data

In this paper we develop an online statistical inference approach for hi...
research
10/03/2022

Inference on High-dimensional Single-index Models with Streaming Data

Traditional statistical methods are faced with new challenges due to str...
research
11/20/2019

Statistical Inference on Partially Linear Panel Model under Unobserved Linearity

A new statistical procedure, based on a modified spline basis, is propos...
research
01/29/2020

Adaptive Estimation and Statistical Inference for High-Dimensional Graph-Based Linear Models

We consider adaptive estimation and statistical inference for high-dimen...
research
06/30/2021

Real-Time Regression Analysis of Streaming Clustered Data With Possible Abnormal Data Batches

This paper develops an incremental learning algorithm based on quadratic...
research
08/04/2022

Statistical Inference for Streamed Longitudinal Data

Modern longitudinal data, for example from wearable devices, measures bi...
research
02/05/2023

Scalable inference in functional linear regression with streaming data

Traditional static functional data analysis is facing new challenges due...

Please sign up or login with your details

Forgot password? Click here to reset