Improving generalization of machine learning-identified biomarkers with causal modeling: an investigation into immune receptor diagnostics

04/20/2022
by   Milena Pavlović, et al.
6

Machine learning is increasingly used to discover diagnostic and prognostic biomarkers from high-dimensional molecular data. However, a variety of factors related to experimental design may affect the ability to learn generalizable and clinically applicable diagnostics. Here, we argue that a causal perspective improves the identification of these challenges, and formalizes their relation to the robustness and generalization of machine learning-based diagnostics. To make for a concrete discussion, we focus on a specific, recently established high-dimensional biomarker - adaptive immune receptor repertoires (AIRRs). We discuss how the main biological and experimental factors of the AIRR domain may influence the learned biomarkers and provide easily adjustable simulations of such effects. In conclusion, we find that causal modeling improves machine learning-based biomarker robustness by identifying stable relations between variables and by guiding the adjustment of the relations and variables that vary between populations.

READ FULL TEXT

page 4

page 6

research
06/13/2022

iCITRIS: Causal Representation Learning for Instantaneous Temporal Effects

Causal representation learning is the task of identifying the underlying...
research
12/09/2022

Deep Learning of Causal Structures in High Dimensions

Recent years have seen rapid progress at the intersection between causal...
research
04/03/2019

OpBerg: Discovering causal sentences using optimal alignments

The biological literature is rich with sentences that describe causal re...
research
11/26/2021

Confounder Identification-free Causal Visual Feature Learning

Confounders in deep learning are in general detrimental to model's gener...
research
02/27/2022

Architectural Optimization and Feature Learning for High-Dimensional Time Series Datasets

As our ability to sense increases, we are experiencing a transition from...
research
12/21/2022

Interpretability and causal discovery of the machine learning models to predict the production of CBM wells after hydraulic fracturing

Machine learning approaches are widely studied in the production predict...
research
12/03/2018

Generalization in anti-causal learning

The ability to learn and act in novel situations is still a prerogative ...

Please sign up or login with your details

Forgot password? Click here to reset