Adaptive Positive-Unlabelled Learning via Markov Diffusion

08/13/2021
by   Paola Stolfi, et al.
0

Positive-Unlabelled (PU) learning is the machine learning setting in which only a set of positive instances are labelled, while the rest of the data set is unlabelled. The unlabelled instances may be either unspecified positive samples or true negative samples. Over the years, many solutions have been proposed to deal with PU learning. Some techniques consider the unlabelled samples as negative ones, reducing the problem to a binary classification with a noisy negative set, while others aim to detect sets of possible negative examples to later apply a supervised machine learning strategy (two-step techniques). The approach proposed in this work falls in the latter category and works in a semi-supervised fashion: motivated and inspired by previous works, a Markov diffusion process with restart is used to assign pseudo-labels to unlabelled instances. Afterward, a machine learning model, exploiting the newly assigned classes, is trained. The principal aim of the algorithm is to identify a set of instances which are likely to contain positive instances that were originally unlabelled.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/26/2019

A method on selecting reliable samples based on fuzziness in positive and unlabeled learning

Traditional semi-supervised learning uses only labeled instances to trai...
research
08/02/2022

Binary Classification with Positive Labeling Sources

To create a large amount of training labels for machine learning models ...
research
08/14/2020

Negative Confidence-Aware Weakly Supervised Binary Classification for Effective Review Helpfulness Classification

The incompleteness of positive labels and the presence of many unlabelle...
research
11/28/2022

Semi-supervised binary classification with latent distance learning

Binary classification (BC) is a practical task that is ubiquitous in rea...
research
03/14/2023

PULSNAR – Positive unlabeled learning selected not at random: class proportion estimation when the SCAR assumption does not hold

Positive and Unlabeled (PU) learning is a type of semi-supervised binary...
research
03/21/2023

Dens-PU: PU Learning with Density-Based Positive Labeled Augmentation

This study proposes a novel approach for solving the PU learning problem...
research
09/18/2015

Evaluation of Protein-protein Interaction Predictors with Noisy Partially Labeled Data Sets

Protein-protein interaction (PPI) prediction is an important problem in ...

Please sign up or login with your details

Forgot password? Click here to reset