Becoming More Robust to Label Noise with Classifier Diversity

03/07/2014
by   Michael R. Smith, et al.
0

It is widely known in the machine learning community that class noise can be (and often is) detrimental to inducing a model of the data. Many current approaches use a single, often biased, measurement to determine if an instance is noisy. A biased measure may work well on certain data sets, but it can also be less effective on a broader set of data sets. In this paper, we present noise identification using classifier diversity (NICD) -- a method for deriving a less biased noise measurement and integrating it into the learning process. To lessen the bias of the noise measure, NICD selects a diverse set of classifiers (based on their predictions of novel instances) to determine which instances are noisy. We examine NICD as a technique for filtering, instance weighting, and selecting the base classifiers of a voting ensemble. We compare NICD with several other noise handling techniques that do not consider classifier diversity on a set of 54 data sets and 5 learning algorithms. NICD significantly increases the classification accuracy over the other considered approaches and is effective across a broad set of data sets and learning algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2013

An Extensive Evaluation of Filtering Misclassified Instances in Supervised Classification Tasks

Removing or filtering outliers and mislabeled instances prior to trainin...
research
06/01/2021

Analysis of classifiers robust to noisy labels

We explore contemporary robust classification algorithms for overcoming ...
research
04/20/2018

An Ensemble Generation Method Based on Instance Hardness

In Machine Learning, ensemble methods have been receiving a great deal o...
research
04/20/2018

An Ensemble Generation MethodBased on Instance Hardness

In Machine Learning, ensemble methods have been receiving a great deal o...
research
09/17/2018

From Same Photo: Cheating on Visual Kinship Challenges

With the propensity for deep learning models to learn unintended signals...
research
08/30/2019

Classifying single-qubit noise using machine learning

Quantum characterization, validation, and verification (QCVV) techniques...
research
06/07/2022

Inferring Unfairness and Error from Population Statistics in Binary and Multiclass Classification

We propose methods for making inferences on the fairness and accuracy of...

Please sign up or login with your details

Forgot password? Click here to reset