Local Label Propagation for Large-Scale Semi-Supervised Learning

05/28/2019
by   Chengxu Zhuang, et al.
0

A significant issue in training deep neural networks to solve supervised learning tasks is the need for large numbers of labelled datapoints. The goal of semi-supervised learning is to leverage ubiquitous unlabelled data, together with small quantities of labelled data, to achieve high task performance. Though substantial recent progress has been made in developing semi-supervised algorithms that are effective for comparatively small datasets, many of these techniques do not scale readily to the large (unlaballed) datasets characteristic of real-world applications. In this paper we introduce a novel approach to scalable semi-supervised learning, called Local Label Propagation (LLP). Extending ideas from recent work on unsupervised embedding learning, LLP first embeds datapoints, labelled and otherwise, in a common latent space using a deep neural network. It then propagates pseudolabels from known to unknown datapoints in a manner that depends on the local geometry of the embedding, taking into account both inter-point distance and local data density as a weighting on propagation likelihood. The parameters of the deep embedding are then trained to simultaneously maximize pseudolabel categorization performance as well as a metric of the clustering of datapoints within each psuedo-label group, iteratively alternating stages of network training and label propagation. We illustrate the utility of the LLP method on the ImageNet dataset, achieving results that outperform previous state-of-the-art scalable semi-supervised learning algorithms by large margins, consistently across a wide variety of training regimes. We also show that the feature representation learned with LLP transfers well to scene recognition in the Places 205 dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 9

research
07/07/2022

FastHebb: Scaling Hebbian Training of Deep Neural Networks to ImageNet Level

Learning algorithms for Deep Neural Networks are typically based on supe...
research
08/22/2022

Semi-supervised classification using a supervised autoencoder for biomedical applications

In this paper we present a new approach to solve semi-supervised classif...
research
03/09/2020

Embedding Propagation: Smoother Manifold for Few-Shot Classification

Few-shot classification is challenging because the data distribution of ...
research
04/18/2021

Deep Clustering with Measure Propagation

Deep models have improved state-of-the-art for both supervised and unsup...
research
03/29/2019

Local Aggregation for Unsupervised Learning of Visual Embeddings

Unsupervised approaches to learning in neural networks are of substantia...
research
06/26/2021

Scalable Teacher Forcing Network for Semi-Supervised Large Scale Data Streams

The large-scale data stream problem refers to high-speed information flo...
research
03/25/2022

Digital Fingerprinting of Microstructures

Finding efficient means of fingerprinting microstructural information is...

Please sign up or login with your details

Forgot password? Click here to reset