Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning

03/10/2016
by   Gang Niu, et al.
0

In PU learning, a binary classifier is trained from positive (P) and unlabeled (U) data without negative (N) data. Although N data is missing, it sometimes outperforms PN learning (i.e., ordinary supervised learning). Hitherto, neither theoretical nor experimental analysis has been given to explain this phenomenon. In this paper, we theoretically compare PU (and NU) learning against PN learning based on the upper bounds on estimation errors. We find simple conditions when PU and NU learning are likely to outperform PN learning, and we prove that, in terms of the upper bounds, either PU or NU learning (depending on the class-prior probability and the sizes of P and N data) given infinite U data will improve on PN learning. Our theoretical findings well agree with the experimental results on artificial and benchmark data even when the experimental setup does not match the theoretical assumptions exactly.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2022

Learning From Positive and Unlabeled Data Using Observer-GAN

The problem of learning from positive and unlabeled data (A.K.A. PU lear...
research
02/24/2020

Learning from Positive and Unlabeled Data with Arbitrary Positive Shift

Positive-unlabeled (PU) learning trains a binary classifier using only p...
research
04/21/2020

Improving Positive Unlabeled Learning: Practical AUL Estimation and New Training Method for Extremely Imbalanced Data Sets

Positive Unlabeled (PU) learning is widely used in many applications, wh...
research
01/28/2019

An analytic formulation for positive-unlabeled learning via weighted integral probability metric

We consider the problem of learning a binary classifier from only positi...
research
10/01/2018

Classification from Positive, Unlabeled and Biased Negative Data

Positive-unlabeled (PU) learning addresses the problem of learning a bin...
research
06/03/2019

Discriminative adversarial networks for positive-unlabeled learning

As an important semi-supervised learning task, positive-unlabeled (PU) l...
research
01/08/2016

Nonparametric semi-supervised learning of class proportions

The problem of developing binary classifiers from positive and unlabeled...

Please sign up or login with your details

Forgot password? Click here to reset