Enhancing naive classifier for positive unlabeled data based on logistic regression approach

06/05/2023
by   Mateusz Płatek, et al.
0

We argue that for analysis of Positive Unlabeled (PU) data under Selected Completely At Random (SCAR) assumption it is fruitful to view the problem as fitting of misspecified model to the data. Namely, we show that the results on misspecified fit imply that in the case when posterior probability of the response is modelled by logistic regression, fitting the logistic regression to the observable PU data which does not follow this model, still yields the vector of estimated parameters approximately colinear with the true vector of parameters. This observation together with choosing the intercept of the classifier based on optimisation of analogue of F1 measure yields a classifier which performs on par or better than its competitors on several real data sets considered.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2021

A Theoretical Analysis of Logistic Regression and Bayesian Classifiers

This study aims to show the fundamental difference between logistic regr...
research
03/03/2023

Estimation of logistic regression parameters for complex survey data: a real data based simulation study

In complex survey data, each sampled observation has assigned a sampling...
research
11/07/2018

Interpreting the Ising Model: The Input Matters

The Ising model is a widely used model for multivariate binary data. It ...
research
12/25/2020

Using the Naive Bayes as a discriminative classifier

For classification tasks, probabilistic models can be categorized into t...
research
05/21/2013

Robust Logistic Regression using Shift Parameters (Long Version)

Annotation errors can significantly hurt classifier performance, yet dat...
research
02/05/2023

Revisiting Discriminative vs. Generative Classifiers: Theory and Implications

A large-scale deep model pre-trained on massive labeled or unlabeled dat...
research
08/28/2020

Introduction to logistic regression

For random field theory based multiple comparison corrections In brain i...

Please sign up or login with your details

Forgot password? Click here to reset