Analysis of the Effect of Unexpected Outliers in the Classification of Spectroscopy Data

06/14/2018
by   Frank G. Glavin, et al.
0

Multi-class classification algorithms are very widely used, but we argue that they are not always ideal from a theoretical perspective, because they assume all classes are characterized by the data, whereas in many applications, training data for some classes may be entirely absent, rare, or statistically unrepresentative. We evaluate one-sided classifiers as an alternative, since they assume that only one class (the target) is well characterized. We consider a task of identifying whether a substance contains a chlorinated solvent, based on its chemical spectrum. For this application, it is not really feasible to collect a statistically representative set of outliers, since that group may contain anything apart from the target chlorinated solvents. Using a new one-sided classification toolkit, we compare a One-Sided k-NN algorithm with two well-known binary classification algorithms, and conclude that the one-sided classifier is more robust to unexpected outliers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2018

A One-Sided Classification Toolkit with Applications in the Analysis of Spectroscopy Data

This dissertation investigates the use of one-sided classification algor...
research
03/26/2020

Robust Classification of High-Dimensional Spectroscopy Data Using Deep Learning and Data Synthesis

This paper presents a new approach to classification of high dimensional...
research
11/30/2020

Binary Classification: Counterbalancing Class Imbalance by Applying Regression Models in Combination with One-Sided Label Shifts

In many real-world pattern recognition scenarios, such as in medical app...
research
11/05/2020

An Orthogonality Principle for Select-Maximum Estimation of Exponential Variables

It was recently proposed to encode the one-sided exponential source X in...
research
02/27/2020

Comparison of Multi-Class and Binary Classification Machine Learning Models in Identifying Strong Gravitational Lenses

Typically, binary classification lens-finding schemes are used to discri...
research
10/25/2021

Computing elements of certain form in ideals to prove properties of operators

Proving statements about linear operators expressed in terms of identiti...
research
10/30/2017

Continuous Authentication Using One-class Classifiers and their Fusion

While developing continuous authentication systems (CAS), we generally a...

Please sign up or login with your details

Forgot password? Click here to reset