RandomSCM: interpretable ensembles of sparse classifiers tailored for omics data

08/11/2022
by   Thibaud Godon, et al.
2

Background: Understanding the relationship between the Omics and the phenotype is a central problem in precision medicine. The high dimensionality of metabolomics data challenges learning algorithms in terms of scalability and generalization. Most learning algorithms do not produce interpretable models – Method: We propose an ensemble learning algorithm based on conjunctions or disjunctions of decision rules. – Results : Applications on metabolomics data shows that it produces models that achieves high predictive performances. The interpretability of the models makes them useful for biomarker discovery and patterns discovery in high dimensional data.

READ FULL TEXT
research
03/24/2020

Born-Again Tree Ensembles

The use of machine learning algorithms in finance, medicine, and crimina...
research
04/29/2020

Interpretable Random Forests via Rule Extraction

We introduce SIRUS (Stable and Interpretable RUle Set) for regression, a...
research
06/12/2020

A Brief Look at Generalization in Visual Meta-Reinforcement Learning

Due to the realization that deep reinforcement learning algorithms train...
research
12/25/2021

Pedagogical Rule Extraction for Learning Interpretable Models

Machine-learning models are ubiquitous. In some domains, for instance, i...
research
07/15/2022

Subgroup Discovery in Unstructured Data

Subgroup discovery is a descriptive and exploratory data mining techniqu...
research
06/12/2023

FIRE: An Optimization Approach for Fast Interpretable Rule Extraction

We present FIRE, Fast Interpretable Rule Extraction, an optimization-bas...
research
07/11/2021

Dual Optimization for Kolmogorov Model Learning Using Enhanced Gradient Descent

Data representation techniques have made a substantial contribution to a...

Please sign up or login with your details

Forgot password? Click here to reset