Functional L-Optimality Subsampling for Massive Data

04/08/2021
by   Hua Liu, et al.
0

Massive data bring the big challenges of memory and computation for analysis. These challenges can be tackled by taking subsamples from the full data as a surrogate. For functional data, it is common to collect multiple measurements over their domains, which require even more memory and computation time when the sample size is large. The computation would be much more intensive when statistical inference is required through bootstrap samples. To the best of our knowledge, this article is the first attempt to study the subsampling method for the functional linear model. We propose an optimal subsampling method based on the functional L-optimality criterion. When the response is a discrete or categorical variable, we further extend our proposed functional L-optimality subsampling (FLoS) method to the functional generalized linear model. We establish the asymptotic properties of the estimators by the FLoS method. The finite sample performance of our proposed FLoS method is investigated by extensive simulation studies. The FLoS method is further demonstrated by analyzing two large-scale datasets: the global climate data and the kidney transplant data. The analysis results on these data show that the FLoS method is much better than the uniform subsampling approach and can well approximate the results based on the full data while dramatically reducing the computation time and memory.

READ FULL TEXT
research
10/03/2021

A Sequential Addressing Subsampling Method for Massive Data Analysis under Memory Constraint

The emergence of massive data in recent years brings challenges to autom...
research
02/05/2023

Scalable inference in functional linear regression with streaming data

Traditional static functional data analysis is facing new challenges due...
research
01/03/2023

Least product relative error estimation for functional multiplicative model and optimal subsampling

In this paper, we study the functional linear multiplicative model based...
research
05/28/2019

Sparse Estimation of Historical Functional Linear Models with a Nested Group Bridge Approach

The conventional historical functional linear model relates the current ...
research
02/04/2022

Model Averaging for Generalized Linear Models in Fragmentary Data Prediction

Fragmentary data is becoming more and more popular in many areas which b...
research
07/03/2020

Unified statistical inference for a novel nonlinear dynamic functional/longitudinal data model

In light of recent work studying massive functional/longitudinal data, s...

Please sign up or login with your details

Forgot password? Click here to reset