Interpretable Signal Analysis with Knockoffs Enhances Classification of Bacterial Raman Spectra

06/08/2020
by   Charmaine Chia, et al.
0

Interpretability is important for many applications of machine learning to signal data, covering aspects such as how well a model fits the data, how accurately explanations are drawn from it, and how well these can be understood by people. Feature extraction and selection can improve model interpretability by identifying structures in the data that are both informative and intuitively meaningful. To this end, we propose a signal classification framework that combines feature extraction with feature selection using the knockoff filter, a method which provides guarantees on the false discovery rate (FDR) amongst selected features. We apply this to a dataset of Raman spectroscopy measurements from bacterial samples. Using a wavelet-based feature representation of the data and a logistic regression classifier, our framework achieves significantly higher predictive accuracy compared to using the original features as input. Benchmarking was also done with features obtained through principal components analysis, as well as the original features input into a neural network-based classifier. Our proposed framework achieved better predictive performance at the former task and comparable performance at the latter task, while offering the advantage of a more compact and human-interpretable set of features.

READ FULL TEXT

page 1

page 6

research
09/25/2022

Deep Feature Selection Using a Novel Complementary Feature Mask

Feature selection has drawn much attention over the last decades in mach...
research
05/09/2016

Why (and How) Avoid Orthogonal Procrustes in Regularized Multivariate Analysis

Multivariate Analysis (MVA) comprises a family of well-known methods for...
research
06/27/2015

A Novel Approach for Stable Selection of Informative Redundant Features from High Dimensional fMRI Data

Feature selection is among the most important components because it not ...
research
03/21/2019

Feature quantization for parsimonious and interpretable predictive models

For regulatory and interpretability reasons, logistic regression is stil...
research
02/14/2022

A Machine Learning Framework for Event Identification via Modal Analysis of PMU Data

Power systems are prone to a variety of events (e.g. line trips and gene...

Please sign up or login with your details

Forgot password? Click here to reset