Learning Sparse Additive Models with Interactions in High Dimensions

04/18/2016
by   Hemant Tyagi, et al.
0

A function f: R^d →R is referred to as a Sparse Additive Model (SPAM), if it is of the form f(x) = ∑_l ∈Sϕ_l(x_l), where S⊂ [d], |S| ≪ d. Assuming ϕ_l's and S to be unknown, the problem of estimating f from its samples has been studied extensively. In this work, we consider a generalized SPAM, allowing for second order interaction terms. For some S_1 ⊂ [d], S_2 ⊂[d] 2, the function f is assumed to be of the form: f(x) = ∑_p ∈S_1ϕ_p (x_p) + ∑_(l,l^') ∈S_2ϕ_(l,l^') (x_l,x_l^'). Assuming ϕ_p,ϕ_(l,l^'), S_1 and, S_2 to be unknown, we provide a randomized algorithm that queries f and exactly recovers S_1,S_2. Consequently, this also enables us to estimate the underlying ϕ_p, ϕ_(l,l^'). We derive sample complexity bounds for our scheme and also extend our analysis to include the situation where the queries are corrupted with noise -- either stochastic, or arbitrary but bounded. Lastly, we provide simulation results on synthetic data, that validate our theoretical findings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2016

Algorithms for Learning Sparse Additive Models with Interactions in High Dimensions

A function f: R^d →R is a Sparse Additive Model (SPAM), if it is of the ...
research
09/09/2022

Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes

In this paper, we propose a sample complexity bound for learning a simpl...
research
01/13/2023

Non-Stochastic CDF Estimation Using Threshold Queries

Estimating the empirical distribution of a scalar-valued data set is a b...
research
05/08/2019

Multi-target Detection with an Arbitrary Spacing Distribution

Motivated by the structure reconstruction problem in cryo-electron micro...
research
02/24/2019

Testing Preferential Domains Using Sampling

A preferential domain is a collection of sets of preferences which are l...
research
03/05/2015

High Dimensional Bayesian Optimisation and Bandits via Additive Models

Bayesian Optimisation (BO) is a technique used in optimising a D-dimensi...

Please sign up or login with your details

Forgot password? Click here to reset