Persistent Laplacian-enhanced Algorithm for Scarcely Labeled Data Classification

05/25/2023
by   Gokul Bhusal, et al.
0

The success of many machine learning (ML) methods depends crucially on having large amounts of labeled data. However, obtaining enough labeled data can be expensive, time-consuming, and subject to ethical constraints for many applications. One approach that has shown tremendous value in addressing this challenge is semi-supervised learning (SSL); this technique utilizes both labeled and unlabeled data during training, often with much less labeled data than unlabeled data, which is often relatively easy and inexpensive to obtain. In fact, SSL methods are particularly useful in applications where the cost of labeling data is especially expensive, such as medical analysis, natural language processing (NLP), or speech recognition. A subset of SSL methods that have achieved great success in various domains involves algorithms that integrate graph-based techniques. These procedures are popular due to the vast amount of information provided by the graphical framework and the versatility of their applications. In this work, we propose an algebraic topology-based semi-supervised method called persistent Laplacian-enhanced graph MBO (PL-MBO) by integrating persistent spectral graph theory with the classical Merriman-Bence- Osher (MBO) scheme. Specifically, we use a filtration procedure to generate a sequence of chain complexes and associated families of simplicial complexes, from which we construct a family of persistent Laplacians. Overall, it is a very efficient procedure that requires much less labeled data to perform well compared to many ML techniques, and it can be adapted for both small and large datasets. We evaluate the performance of the proposed method on data classification, and the results indicate that the proposed technique outperforms other existing semi-supervised algorithms.

READ FULL TEXT
research
09/23/2021

Learning to Robustly Aggregate Labeling Functions for Semi-supervised Data Programming

A critical bottleneck in supervised machine learning is the need for lar...
research
11/12/2022

Integrating Transformer and Autoencoder Techniques with Spectral Graph Algorithms for the Prediction of Scarcely Labeled Molecular Data

In molecular and biological sciences, experiments are expensive, time-co...
research
05/02/2018

Graph Based Semi-supervised Learning with Convolution Neural Networks to Classify Crisis Related Tweets

During time-critical situations such as natural disasters, rapid classif...
research
09/02/2023

Leveraging Semi-Supervised Graph Learning for Enhanced Diabetic Retinopathy Detection

Diabetic Retinopathy (DR) is a significant cause of blindness globally, ...
research
11/01/2022

The Perils of Learning From Unlabeled Data: Backdoor Attacks on Semi-supervised Learning

Semi-supervised machine learning (SSL) is gaining popularity as it reduc...
research
11/24/2020

Temporal Action Detection with Multi-level Supervision

Training temporal action detection in videos requires large amounts of l...
research
09/24/2020

Semi-supervised sequence classification through change point detection

Sequential sensor data is generated in a wide variety of practical appli...

Please sign up or login with your details

Forgot password? Click here to reset