Semi-supervised Learning in Network-Structured Data via Total Variation Minimization

01/28/2019
by   Alexander Jung, et al.
18

We propose and analyze a method for semi-supervised learning from partially-labeled network-structured data. Our approach is based on a graph signal recovery interpretation under a clustering hypothesis that labels of data points belonging to the same well-connected subset (cluster) are similar valued. This lends naturally to learning the labels by total variation (TV) minimization, which we solve by applying a recently proposed primal-dual method for non-smooth convex optimization. The resulting algorithm allows for a highly scalable implementation using message passing over the underlying empirical graph, which renders the algorithm suitable for big data applications. By applying tools of compressed sensing, we derive a sufficient condition on the underlying network structure such that TV minimization recovers clusters in the empirical graph of the data. In particular, we show that the proposed primal-dual method amounts to maximizing network flows over the empirical graph of the dataset. Moreover, the learning accuracy of the proposed algorithm is linked to the set of network flows between data points having known labels. The effectiveness and scalability of our approach is verified by numerical experiments.

READ FULL TEXT
research
03/26/2019

Classifying Partially Labeled Networked Data via Logistic Network Lasso

We apply the network Lasso to classify partially labeled data points whi...
research
11/03/2019

Clustering in Partially Labeled Stochastic Block Models via Total Variation Minimization

A main task in data analysis is to organize data points into coherent gr...
research
05/11/2017

The Network Nullspace Property for Compressed Sensing of Big Data over Networks

We adapt the nullspace property of compressed sensing for sparse vectors...
research
08/21/2018

Faster PET Reconstruction with Non-Smooth Priors by Randomization and Preconditioning

Uncompressed clinical data from modern positron emission tomography (PET...
research
05/22/2019

Learning Networked Exponential Families with Network Lasso

The data arising in many important big-data applications, ranging from s...
research
08/22/2018

Analysis of Network Lasso For Semi-Supervised Regression

We characterize the statistical properties of network Lasso for semi-sup...
research
02/07/2022

A Least Square Approach to Semi-supervised Local Cluster Extraction

A least square semi-supervised local clustering algorithm based on the i...

Please sign up or login with your details

Forgot password? Click here to reset