Tree-Projected Gradient Descent for Estimating Gradient-Sparse Parameters on Graphs

05/31/2020
by   Sheng Xu, et al.
0

We study estimation of a gradient-sparse parameter vector θ^* ∈ℝ^p, having strong gradient-sparsity s^*:=∇_G θ^*_0 on an underlying graph G. Given observations Z_1,…,Z_n and a smooth, convex loss function ℒ for which θ^* minimizes the population risk 𝔼[ℒ(θ;Z_1,…,Z_n)], we propose to estimate θ^* by a projected gradient descent algorithm that iteratively and approximately projects gradient steps onto spaces of vectors having small gradient-sparsity over low-degree spanning trees of G. We show that, under suitable restricted strong convexity and smoothness assumptions for the loss, the resulting estimator achieves the squared-error risk s^*/nlog (1+p/s^*) up to a multiplicative constant that is independent of G. In contrast, previous polynomial-time algorithms have only been shown to achieve this guarantee in more specialized settings, or under additional assumptions for G and/or the sparsity pattern of ∇_G θ^*. As applications of our general framework, we apply our results to the examples of linear models and generalized linear models with random design.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2012

Learning Model-Based Sparsity via Projected Gradient Descent

Several convex formulation methods have been proposed previously for sta...
research
11/29/2022

Estimating the minimizer and the minimum value of a regression function under passive design

We propose a new method for estimating the minimizer x^* and the minimum...
research
05/29/2020

Agnostic Learning of a Single Neuron with Gradient Descent

We consider the problem of learning the best-fitting single neuron as me...
research
08/02/2022

A Note on Zeroth-Order Optimization on the Simplex

We construct a zeroth-order gradient estimator for a smooth function def...
research
03/02/2023

Tight Risk Bounds for Gradient Descent on Separable Data

We study the generalization properties of unregularized gradient methods...
research
02/09/2022

Improving Computational Complexity in Statistical Models with Second-Order Information

It is known that when the statistical models are singular, i.e., the Fis...
research
08/12/2017

Sparse Coding and Autoencoders

In "Dictionary Learning" one tries to recover incoherent matrices A^* ∈R...

Please sign up or login with your details

Forgot password? Click here to reset