Binary classification with ambiguous training data

11/05/2020
by   Naoya Otani, et al.
0

In supervised learning, we often face with ambiguous (A) samples that are difficult to label even by domain experts. In this paper, we consider a binary classification problem in the presence of such A samples. This problem is substantially different from semi-supervised learning since unlabeled samples are not necessarily difficult samples. Also, it is different from 3-class classification with the positive (P), negative (N), and A classes since we do not want to classify test samples into the A class. Our proposed method extends binary classification with reject option, which trains a classifier and a rejector simultaneously using P and N samples based on the 0-1-c loss with rejection cost c. More specifically, we propose to train a classifier and a rejector under the 0-1-c-d loss using P, N, and A samples, where d is the misclassification penalty for ambiguous samples. In our practical implementation, we use a convex upper bound of the 0-1-c-d loss for computational tractability. Numerical experiments demonstrate that our method can successfully utilize the additional information brought by such A training data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2017

Binary Classification from Positive-Confidence Data

Reducing labeling costs in supervised learning is a critical issue in ma...
research
11/28/2022

Semi-supervised binary classification with latent distance learning

Binary classification (BC) is a practical task that is ubiquitous in rea...
research
09/21/2019

Single Class Universum-SVM

This paper extends the idea of Universum learning [1, 2] to single-class...
research
06/16/2021

Binary classification with corrupted labels

In a binary classification problem where the goal is to fit an accurate ...
research
09/29/2020

Testing for Normality with Neural Networks

In this paper, we treat the problem of testing for normality as a binary...
research
06/02/2018

Binary Classification with Karmic, Threshold-Quasi-Concave Metrics

Complex performance measures, beyond the popular measure of accuracy, ar...
research
02/04/2020

Introduction to quasi-open set semi-supervised learning for big data analytics

State-of-the-art performance and low system complexity has made deep-lea...

Please sign up or login with your details

Forgot password? Click here to reset