Algorithms for Learning Sparse Additive Models with Interactions in High Dimensions

05/02/2016
by   Hemant Tyagi, et al.
0

A function f: R^d →R is a Sparse Additive Model (SPAM), if it is of the form f(x) = ∑_l ∈Sϕ_l(x_l) where S⊂ [d], |S| ≪ d. Assuming ϕ's, S to be unknown, there exists extensive work for estimating f from its samples. In this work, we consider a generalized version of SPAMs, that also allows for the presence of a sparse number of second order interaction terms. For some S_1 ⊂ [d], S_2 ⊂[d] 2, with |S_1| ≪ d, |S_2| ≪ d^2, the function f is now assumed to be of the form: ∑_p ∈S_1ϕ_p (x_p) + ∑_(l,l^') ∈S_2ϕ_(l,l^') (x_l,x_l^'). Assuming we have the freedom to query f anywhere in its domain, we derive efficient algorithms that provably recover S_1,S_2 with finite sample bounds. Our analysis covers the noiseless setting where exact samples of f are obtained, and also extends to the noisy setting where the queries are corrupted with noise. For the noisy setting in particular, we consider two noise models namely: i.i.d Gaussian noise and arbitrary but bounded noise. Our main methods for identification of S_2 essentially rely on estimation of sparse Hessian matrices, for which we provide two novel compressed sensing based schemes. Once S_1, S_2 are known, we show how the individual components ϕ_p, ϕ_(l,l^') can be estimated via additional queries of f, with uniform error bounds. Lastly, we provide simulation results on synthetic data that validate our theoretical findings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2016

Learning Sparse Additive Models with Interactions in High Dimensions

A function f: R^d →R is referred to as a Sparse Additive Model (SPAM), i...
research
11/07/2012

Blind Signal Separation in the Presence of Gaussian Noise

A prototypical blind signal separation problem is the so-called cocktail...
research
11/02/2014

Noisy Matrix Completion under Sparse Factor Models

This paper examines a general class of noisy matrix completion tasks whe...
research
09/09/2022

Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes

In this paper, we propose a sample complexity bound for learning a simpl...
research
02/08/2021

Reconstruction of Sparse Signals under Gaussian Noise and Saturation

Most compressed sensing algorithms do not account for the effect of satu...
research
06/29/2020

Recovery of Sparse Signals from a Mixture of Linear Samples

Mixture of linear regressions is a popular learning theoretic model that...
research
02/14/2023

Estimation of coefficients for periodic autoregressive model with additive noise – a finite-variance case

Periodic autoregressive (PAR) time series is considered as one of the mo...

Please sign up or login with your details

Forgot password? Click here to reset