Optimal One-pass Nonparametric Estimation Under Memory Constraint

08/18/2022
by   Mingxue Quan, et al.
0

For nonparametric regression in the streaming setting, where data constantly flow in and require real-time analysis, a main challenge is that data are cleared from the computer system once processed due to limited computer memory and storage. We tackle the challenge by proposing a novel one-pass estimator based on penalized orthogonal basis expansions and developing a general framework to study the interplay between statistical efficiency and memory consumption of estimators. We show that, the proposed estimator is statistically optimal under memory constraint, and has asymptotically minimal memory footprints among all one-pass estimators of the same estimation quality. Numerical studies demonstrate that the proposed one-pass estimator is nearly as efficient as its non-streaming counterpart that has access to all historical data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2019

Communication and Memory Efficient Testing of Discrete Distributions

We study distribution testing with communication and memory constraints ...
research
02/05/2023

Scalable inference in functional linear regression with streaming data

Traditional static functional data analysis is facing new challenges due...
research
10/18/2018

Quantile Regression Under Memory Constraint

This paper studies the inference problem in quantile regression (QR) for...
research
09/05/2015

A commentary on "The now-or-never bottleneck: a fundamental constraint on language", by Christiansen and Chater (2016)

In a recent article, Christiansen and Chater (2016) present a fundamenta...
research
04/16/2019

A Global Bias-Correction DC Method for Biased Estimation under Memory Constraint

This paper introduces a global bias-correction divide-and-conquer (GBC-D...
research
04/13/2023

Subsampling and Jackknifing: A Practically Convenient Solution for Large Data Analysis with Limited Computational Resources

Modern statistical analysis often encounters datasets with large sizes. ...
research
09/12/2022

On Nonparametric Estimation in Online Problems

Offline estimators are often inadequate for real-time applications. Neve...

Please sign up or login with your details

Forgot password? Click here to reset