Robust Classification of High-Dimensional Spectroscopy Data Using Deep Learning and Data Synthesis

03/26/2020
by   James Houston, et al.
0

This paper presents a new approach to classification of high dimensional spectroscopy data and demonstrates that it outperforms other current state-of-the art approaches. The specific task we consider is identifying whether samples contain chlorinated solvents or not, based on their Raman spectra. We also examine robustness to classification of outlier samples that are not represented in the training set (negative outliers). A novel application of a locally-connected neural network (NN) for the binary classification of spectroscopy data is proposed and demonstrated to yield improved accuracy over traditionally popular algorithms. Additionally, we present the ability to further increase the accuracy of the locally-connected NN algorithm through the use of synthetic training spectra and we investigate the use of autoencoder based one-class classifiers and outlier detectors. Finally, a two-step classification process is presented as an alternative to the binary and one-class classification paradigms. This process combines the locally-connected NN classifier, the use of synthetic training data, and an autoencoder based outlier detector to produce a model which is shown to both produce high classification accuracy, and be robust to the presence of negative outliers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2018

Analysis of the Effect of Unexpected Outliers in the Classification of Spectroscopy Data

Multi-class classification algorithms are very widely used, but we argue...
research
06/12/2018

A One-Sided Classification Toolkit with Applications in the Analysis of Spectroscopy Data

This dissertation investigates the use of one-sided classification algor...
research
04/22/2011

Robust Clustering Using Outlier-Sparsity Regularization

Notwithstanding the popularity of conventional clustering algorithms suc...
research
07/02/2020

Outlier Detection through Null Space Analysis of Neural Networks

Many machine learning classification systems lack competency awareness. ...
research
10/10/2020

Anomaly Detection based on Zero-Shot Outlier Synthesis and Hierarchical Feature Distillation

Anomaly detection suffers from unbalanced data since anomalies are quite...
research
08/16/2017

Visualizing and Exploring Dynamic High-Dimensional Datasets with LION-tSNE

T-distributed stochastic neighbor embedding (tSNE) is a popular and priz...
research
07/15/2020

Evaluation of Neural Network Classification Systems on Document Stream

One major drawback of state of the art Neural Networks (NN)-based approa...

Please sign up or login with your details

Forgot password? Click here to reset