Independence-Encouraging Subsampling for Nonparametric Additive Models

02/26/2023
by   Yi Zhang, et al.
0

The additive model is a popular nonparametric regression method due to its ability to retain modeling flexibility while avoiding the curse of dimensionality. The backfitting algorithm is an intuitive and widely used numerical approach for fitting additive models. However, its application to large datasets may incur a high computational cost and is thus infeasible in practice. To address this problem, we propose a novel approach called independence-encouraging subsampling (IES) to select a subsample from big data for training additive models. Inspired by the minimax optimality of an orthogonal array (OA) due to its pairwise independent predictors and uniform coverage for the range of each predictor, the IES approach selects a subsample that approximates an OA to achieve the minimax optimality. Our asymptotic analyses demonstrate that an IES subsample converges to an OA and that the backfitting algorithm over the subsample converges to a unique solution even if the predictors are highly dependent in the original big data. The proposed IES method is also shown to be numerically appealing via simulations and a real data application.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/09/2016

Sparse additive Gaussian process with soft interactions

Additive nonparametric regression models provide an attractive tool for ...
research
06/17/2019

An Optimal Test for the Additive Model with Discrete or Categorical Predictors

In multivariate nonparametric regression the additive models are very us...
research
01/31/2016

Additive Approximations in High Dimensional Nonparametric Regression via the SALSA

High dimensional nonparametric regression is an inherently difficult pro...
research
11/01/2018

Sparse Model Identification and Learning for Ultra-high-dimensional Additive Partially Linear Models

The additive partially linear model (APLM) combines the flexibility of n...
research
05/04/2021

Semiparametric Spatiotemporal Model with Mixed Frequencies

In modelling time series data coming from different sources, frequencies...
research
12/04/2022

Classification by sparse additive models

We consider (nonparametric) sparse additive models (SpAM) for classifica...

Please sign up or login with your details

Forgot password? Click here to reset