Optimizing Diffusion Rate and Label Reliability in a Graph-Based Semi-supervised Classifier

Semi-supervised learning has received attention from researchers, as it allows one to exploit the structure of unlabeled data to achieve competitive classification results with much fewer labels than supervised approaches. The Local and Global Consistency (LGC) algorithm is one of the most well-known graph-based semi-supervised (GSSL) classifiers. Notably, its solution can be written as a linear combination of the known labels. The coefficients of this linear combination depend on a parameter α, determining the decay of the reward over time when reaching labeled vertices in a random walk. In this work, we discuss how removing the self-influence of a labeled instance may be beneficial, and how it relates to leave-one-out error. Moreover, we propose to minimize this leave-one-out loss with automatic differentiation. Within this framework, we propose methods to estimate label reliability and diffusion rate. Optimizing the diffusion rate is more efficiently accomplished with a spectral representation. Results show that the label reliability approach competes with robust L1-norm methods and that removing diagonal entries reduces the risk of overfitting and leads to suitable criteria for parameter selection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2020

Identifying noisy labels with a transductive semi-supervised leave-one-out filter

Obtaining data with meaningful labels is often costly and error-prone. I...
research
09/27/2020

Analysis of label noise in graph-based semi-supervised learning

In machine learning, one must acquire labels to help supervise a model t...
research
08/27/2020

A Consistent Diffusion-Based Algorithm for Semi-Supervised Classification on Graphs

Semi-supervised classification on graphs aims at assigning labels to all...
research
07/05/2022

A Safe Semi-supervised Graph Convolution Network

In the semi-supervised learning field, Graph Convolution Network (GCN), ...
research
06/15/2021

Graph-based Label Propagation for Semi-Supervised Speaker Identification

Speaker identification in the household scenario (e.g., for smart speake...
research
01/07/2019

Semi-supervised learning in unbalanced and heterogeneous networks

Community detection was a hot topic on network analysis, where the main ...
research
05/30/2021

ℓ_2-norm Flow Diffusion in Near-Linear Time

Diffusion is a fundamental graph procedure and has been a basic building...

Please sign up or login with your details

Forgot password? Click here to reset