Distributed Feature Selection for High-dimensional Additive Models

05/16/2022
by   Yifan He, et al.
0

Distributed statistical learning is a common strategy for handling massive data where we divide the learning task into multiple local machines and aggregate the results afterward. However, most existing work considers the case where the samples are divided. In this work, we propose a new algorithm, DDAC-SpAM, that divides features under the high-dimensional sparse additive model. The new algorithm contains three steps: divide, decorrelate, and conquer. We show that after the decorrelation operation, every local estimator can recover the sparsity pattern for each additive component consistently without imposing strict constraints to the correlation structure among variables. Theoretical analysis of the aggregated estimator and empirical results on synthetic and real data illustrate that the DDAC-SpAM algorithm is effective and competitive in fitting sparse additive models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2016

Communication-efficient Distributed Sparse Linear Discriminant Analysis

We propose a communication-efficient distributed estimation method for s...
research
06/25/2008

High-dimensional additive modeling

We propose a new sparsity-smoothness penalty for high-dimensional genera...
research
05/01/2017

Convex-constrained Sparse Additive Modeling and Its Extensions

Sparse additive modeling is a class of effective methods for performing ...
research
12/19/2018

Max-Diversity Distributed Learning: Theory and Algorithms

We study the risk performance of distributed learning for the regulariza...
research
03/10/2015

Post-Regularization Confidence Bands for High Dimensional Nonparametric Models with Local Sparsity

We propose a novel high dimensional nonparametric model named ATLAS whic...
research
12/07/2021

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

Sparse shrunk additive models and sparse random feature models have been...
research
02/14/2013

Locally epistatic genomic relationship matrices for genomic association, prediction and selection

As the amount and complexity of genetic information increases it is nece...

Please sign up or login with your details

Forgot password? Click here to reset