Inference on High-dimensional Single-index Models with Streaming Data

10/03/2022
by   Dongxiao Han, et al.
0

Traditional statistical methods are faced with new challenges due to streaming data. The major challenge is the rapidly growing volume and velocity of data, which makes storing such huge datasets in memory impossible. The paper presents an online inference framework for regression parameters in high-dimensional semiparametric single-index models with unknown link functions. The proposed online procedure updates only the current data batch and summary statistics of historical data instead of re-accessing the entire raw data set. At the same time, we do not need to estimate the unknown link function, which is a highly challenging task. In addition, a generalized convex loss function is used in the proposed inference procedure. To illustrate the proposed method, we use the Huber loss function and the logistic regression model's negative log-likelihood. In this study, the asymptotic normality of the proposed online debiased Lasso estimators and the bounds of the proposed online Lasso estimators are investigated. To evaluate the performance of the proposed method, extensive simulation studies have been conducted. We provide applications to Nasdaq stock prices and financial distress datasets.

READ FULL TEXT

page 26

page 27

research
08/10/2021

Statistical Inference in High-dimensional Generalized Linear Models with Streaming Data

In this paper we develop an online statistical inference approach for hi...
research
06/10/2021

Online Debiased Lasso

We propose an online debiased lasso (ODL) method for statistical inferen...
research
02/05/2023

Scalable inference in functional linear regression with streaming data

Traditional static functional data analysis is facing new challenges due...
research
10/11/2022

Renewable Learning for Multiplicative Regression with Streaming Datasets

When large amounts of data continuously arrive in streams, online updati...
research
04/14/2022

Observable adjustments in single-index models for regularized M-estimators

We consider observations (X,y) from single index models with unknown lin...
research
06/30/2021

Real-Time Regression Analysis of Streaming Clustered Data With Possible Abnormal Data Batches

This paper develops an incremental learning algorithm based on quadratic...
research
09/08/2019

Inference In General Single-Index Models Under High-dimensional Symmetric Designs

We consider the problem of statistical inference for a finite number of ...

Please sign up or login with your details

Forgot password? Click here to reset