Preventing dataset shift from breaking machine-learning biomarkers

07/21/2021
by   Jéroôme Dockès, et al.
0

Machine learning brings the hope of finding new biomarkers extracted from cohorts with rich biomedical measurements. A good biomarker is one that gives reliable detection of the corresponding condition. However, biomarkers are often extracted from a cohort that differs from the target population. Such a mismatch, known as a dataset shift, can undermine the application of the biomarker to new individuals. Dataset shifts are frequent in biomedical research, e.g. because of recruitment biases. When a dataset shift occurs, standard machine-learning techniques do not suffice to extract and validate biomarkers. This article provides an overview of when and how dataset shifts breaks machine-learning extracted biomarkers, as well as detection and correction strategies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2021

SHIFT15M: Multiobjective Large-Scale Fashion Dataset with Distributional Shifts

Many machine learning algorithms assume that the training data and the t...
research
04/18/2021

Failing Conceptually: Concept-Based Explanations of Dataset Shift

Despite their remarkable performance on a wide range of visual tasks, ma...
research
05/27/2022

MIMII DG: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization Task

We present a machine sound dataset to benchmark domain generalization te...
research
06/28/2021

Ensembling Shift Detectors: an Extensive Empirical Evaluation

The term dataset shift refers to the situation where the data used to tr...
research
01/12/2016

Creativity in Machine Learning

Recent machine learning techniques can be modified to produce creative r...
research
06/17/2019

Dataset shift quantification for credit card fraud detection

Machine learning and data mining techniques have been used extensively i...
research
10/29/2018

Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift

We might hope that when faced with unexpected inputs, well-designed soft...

Please sign up or login with your details

Forgot password? Click here to reset