Learning with latent group sparsity via heat flow dynamics on networks

01/20/2022
by   Subhroshekhar Ghosh, et al.
0

Group or cluster structure on explanatory variables in machine learning problems is a very general phenomenon, which has attracted broad interest from practitioners and theoreticians alike. In this work we contribute an approach to learning under such group structure, that does not require prior information on the group identities. Our paradigm is motivated by the Laplacian geometry of an underlying network with a related community structure, and proceeds by directly incorporating this into a penalty that is effectively computed via a heat flow-based local network dynamics. In fact, we demonstrate a procedure to construct such a network based on the available data. Notably, we dispense with computationally intensive pre-processing involving clustering of variables, spectral or otherwise. Our technique is underpinned by rigorous theorems that guarantee its effective performance and provide bounds on its sample complexity. In particular, in a wide range of settings, it provably suffices to run the heat flow dynamics for time that is only logarithmic in the problem dimensions. We explore in detail the interfaces of our approach with key statistical physics models in network science, such as the Gaussian Free Field and the Stochastic Block Model. We validate our approach by successful applications to real-world data from a wide array of application domains, including computer science, genetics, climatology and economics. Our work raises the possibility of applying similar diffusion-based techniques to classical learning tasks, exploiting the interplay between geometric, dynamical and stochastic structures underlying the data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2023

Learning Networks from Gaussian Graphical Models and Gaussian Free Fields

We investigate the problem of estimating the structure of a weighted net...
research
11/03/2020

Frequency-compensated PINNs for Fluid-dynamic Design Problems

Incompressible fluid flow around a cylinder is one of the classical prob...
research
02/26/2022

Direct data-driven forecast of local turbulent heat flux in Rayleigh-Bénard convection

A combined convolutional autoencoder-recurrent neural network machine le...
research
10/28/2019

The spectral dimension of simplicial complexes: a renormalization group theory

Simplicial complexes are increasingly used to study complex system struc...
research
11/01/2021

Network Clustering for Latent State and Changepoint Detection

Network models provide a powerful and flexible framework for analyzing a...
research
12/05/2018

Characterization and space embedding of directed graphs trough magnetic Laplacians

Directed graphs are essential data structures which model several real-w...
research
04/06/2015

A New Approach to Building the Interindustry Input--Output Table

We present a new approach to estimating the interdependence of industrie...

Please sign up or login with your details

Forgot password? Click here to reset