Benchmark of structured machine learning methods for microbial identification from mass-spectrometry data

06/24/2015
by   Kévin Vervier, et al.
0

Microbial identification is a central issue in microbiology, in particular in the fields of infectious diseases diagnosis and industrial quality control. The concept of species is tightly linked to the concept of biological and clinical classification where the proximity between species is generally measured in terms of evolutionary distances and/or clinical phenotypes. Surprisingly, the information provided by this well-known hierarchical structure is rarely used by machine learning-based automatic microbial identification systems. Structured machine learning methods were recently proposed for taking into account the structure embedded in a hierarchy and using it as additional a priori information, and could therefore allow to improve microbial identification systems. We test and compare several state-of-the-art machine learning methods for microbial identification on a new Matrix-Assisted Laser Desorption/Ionization Time-of-Flight mass spectrometry (MALDI-TOF MS) dataset. We include in the benchmark standard and structured methods, that leverage the knowledge of the underlying hierarchical structure in the learning process. Our results show that although some methods perform better than others, structured methods do not consistently perform better than their "flat" counterparts. We postulate that this is partly due to the fact that standard methods already reach a high level of accuracy in this context, and that they mainly confuse species close to each other in the tree, a case where using the known hierarchy is not helpful.

READ FULL TEXT
research
06/07/2021

Digital Taxonomist: Identifying Plant Species in Citizen Scientists' Photographs

Automatic identification of plant specimens from amateur photographs cou...
research
01/23/2017

Comparative study on supervised learning methods for identifying phytoplankton species

Phytoplankton plays an important role in marine ecosystem. It is defined...
research
06/22/2019

Deep learning approach to description and classification of fungi microscopic images

Diagnosis of fungal infections can rely on microscopic examination, howe...
research
03/29/2017

Hierarchical Classification for Spoken Arabic Dialect Identification using Prosody: Case of Algerian Dialects

In daily communications, Arabs use local dialects which are hard to iden...
research
02/18/2022

Churn modeling of life insurance policies via statistical and machine learning methods – Analysis of important features

Life assurance companies typically possess a wealth of data covering mul...
research
05/24/2020

Deep learning approach to describe and classify fungi microscopic images

Preliminary diagnosis of fungal infections can rely on microscopic exami...

Please sign up or login with your details

Forgot password? Click here to reset