DEDPUL: Method for Mixture Proportion Estimation and Positive-Unlabeled Classification based on Density Estimation

02/19/2019
by   Dmitry Ivanov, et al.
0

This paper studies Positive-Unlabeled Classification, the problem of semi-supervised binary classification in the case when Negative (N) class in the training set is contaminated with instances of Positive (P) class. We develop a novel method (DEDPUL) that simultaneously solves two problems concerning the contaminated Unlabeled (U) sample: estimates the proportions of the mixing components (P and N) in U, and classifies U. By conducting experiments on synthetic and real-world data we favorably compare DEDPUL with current state-of-the-art methods for both problems. We introduce an automatic procedure for DEDPUL hyperparameter optimization. Additionally, we improve two methods in the literature and achieve DEDPUL level of performance with one of them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/14/2023

PULSNAR – Positive unlabeled learning selected not at random: class proportion estimation when the SCAR assumption does not hold

Positive and Unlabeled (PU) learning is a type of semi-supervised binary...
research
01/08/2016

Nonparametric semi-supervised learning of class proportions

The problem of developing binary classifiers from positive and unlabeled...
research
10/05/2010

A bagging SVM to learn from positive and unlabeled examples

We consider the problem of learning a binary classifier from a training ...
research
01/30/2018

Mixture Proportion Estimation for Positive--Unlabeled Learning via Classifier Dimension Reduction

Positive--unlabeled (PU) learning considers two samples, a positive set ...
research
10/01/2018

Classification from Positive, Unlabeled and Biased Negative Data

Positive-unlabeled (PU) learning addresses the problem of learning a bin...
research
04/22/2020

Quantifying With Only Positive Training Data

Quantification is the research field that studies the task of counting h...
research
03/08/2023

Automatic Debiased Learning from Positive, Unlabeled, and Exposure Data

We address the issue of binary classification from positive and unlabele...

Please sign up or login with your details

Forgot password? Click here to reset