Noisy multi-label semi-supervised dimensionality reduction

02/20/2019
by   Karl Øyvind Mikalsen, et al.
0

Noisy labeled data represent a rich source of information that often are easily accessible and cheap to obtain, but label noise might also have many negative consequences if not accounted for. How to fully utilize noisy labels has been studied extensively within the framework of standard supervised machine learning over a period of several decades. However, very little research has been conducted on solving the challenge posed by noisy labels in non-standard settings. This includes situations where only a fraction of the samples are labeled (semi-supervised) and each high-dimensional sample is associated with multiple labels. In this work, we present a novel semi-supervised and multi-label dimensionality reduction method that effectively utilizes information from both noisy multi-labels and unlabeled data. With the proposed Noisy multi-label semi-supervised dimensionality reduction (NMLSDR) method, the noisy multi-labels are denoised and unlabeled data are labeled simultaneously via a specially designed label propagation algorithm. NMLSDR then learns a projection matrix for reducing the dimensionality by maximizing the dependence between the enlarged and denoised multi-label space and the features in the projected space. Extensive experiments on synthetic data, benchmark datasets, as well as a real-world case study, demonstrate the effectiveness of the proposed algorithm and show that it outperforms state-of-the-art multi-label feature extraction algorithms.

READ FULL TEXT
research
02/18/2020

DivideMix: Learning with Noisy Labels as Semi-supervised Learning

Deep neural networks are known to be annotation-hungry. Numerous efforts...
research
02/20/2008

Classification Constrained Dimensionality Reduction

Dimensionality reduction is a topic of recent interest. In this paper, w...
research
06/18/2016

An Efficient Large-scale Semi-supervised Multi-label Classifier Capable of Handling Missing labels

Multi-label classification has received considerable interest in recent ...
research
08/10/2020

Feature Ranking for Semi-supervised Learning

The data made available for analysis are becoming more and more complex ...
research
06/18/2023

Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction:A Multi-Dataset Study

Electrocardiography (ECG) is a non-invasive tool for predicting cardiova...
research
09/09/2020

Semi-Supervised Active Learning for COVID-19 Lung Ultrasound Multi-symptom Classification

Ultrasound (US) is a non-invasive yet effective medical diagnostic imagi...
research
09/24/2020

Identifying noisy labels with a transductive semi-supervised leave-one-out filter

Obtaining data with meaningful labels is often costly and error-prone. I...

Please sign up or login with your details

Forgot password? Click here to reset