Non-iterative Label Propagation on Optimal Leading Forest

09/25/2017
by   Ji Xu, et al.
0

Graph based semi-supervised learning (GSSL) has intuitive representation and can be improved by exploiting the matrix calculation. However, it has to perform iterative optimization to achieve a preset objective, which usually leads to low efficiency. Another inconvenience lying in GSSL is that when new data come, the graph construction and the optimization have to be conducted all over again. We propose a sound assumption, arguing that: the neighboring data points are not in peer-to-peer relation, but in a partial-ordered relation induced by the local density and distance between the data; and the label of a center can be regarded as the contribution of its followers. Starting from the assumption, we develop a highly efficient non-iterative label propagation algorithm based on a novel data structure named as optimal leading forest (LaPOLeaF). The major weaknesses of the traditional GSSL are addressed by this study. We further scale LaPOLeaF to accommodate big data by utilizing block distance matrix technique, parallel computing, and Locality-Sensitive Hashing (LSH). Experiments on large datasets have shown the promising results of the proposed methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2015

Pairwise Constraint Propagation on Multi-View Data

This paper presents a graph-based learning approach to pairwise constrai...
research
07/08/2016

Graph Construction with Label Information for Semi-Supervised Learning

In the literature, most existing graph-based semi-supervised learning (S...
research
01/16/2013

Kernelized Locality-Sensitive Hashing for Semi-Supervised Agglomerative Clustering

Large scale agglomerative clustering is hindered by computational burden...
research
10/01/2021

Label Propagation Through Optimal Transport

In this paper, we tackle the transductive semi-supervised learning probl...
research
03/22/2021

Forest Fire Clustering: Cluster-oriented Label Propagation Clustering and Monte Carlo Verification Inspired by Forest Fire Dynamics

Clustering methods group data points together and assign them group-leve...

Please sign up or login with your details

Forgot password? Click here to reset