Greedy Biomarker Discovery in the Genome with Applications to Antimicrobial Resistance

05/22/2015
by   Alexandre Drouin, et al.
0

The Set Covering Machine (SCM) is a greedy learning algorithm that produces sparse classifiers. We extend the SCM for datasets that contain a huge number of features. The whole genetic material of living organisms is an example of such a case, where the number of feature exceeds 10^7. Three human pathogens were used to evaluate the performance of the SCM at predicting antimicrobial resistance. Our results show that the SCM compares favorably in terms of sparsity and accuracy against L1 and L2 regularized Support Vector Machines and CART decision trees. Moreover, the SCM was the only algorithm that could consider the full feature space. For all other algorithms, the latter had to be filtered as a preprocessing step.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2019

Support Feature Machines

Support Vector Machines (SVMs) with various kernels have played dominant...
research
06/11/2023

Efficient Learning of Minimax Risk Classifiers in High Dimensions

High-dimensional data is common in multiple areas, such as health care a...
research
05/28/2019

Integrated Neural Network and Machine Vision Approach For Leather Defect Classification

Leather is a type of natural, durable, flexible, soft, supple and pliabl...
research
08/26/2022

Algebraically Explainable Controllers: Decision Trees and Support Vector Machines Join Forces

Recently, decision trees (DT) have been used as an explainable represent...
research
10/14/2021

Algorithms for Sparse Support Vector Machines

Many problems in classification involve huge numbers of irrelevant featu...
research
12/02/2014

Learning interpretable models of phenotypes from whole genome sequences with the Set Covering Machine

The increased affordability of whole genome sequencing has motivated its...
research
03/02/2017

Optimization of distributions differences for classification

In this paper we introduce a new classification algorithm called Optimiz...

Please sign up or login with your details

Forgot password? Click here to reset