Evaluating Progress on Machine Learning for Longitudinal Electronic Healthcare Data

10/02/2020
by   David Bellamy, et al.
0

The Large Scale Visual Recognition Challenge based on the well-known Imagenet dataset catalyzed an intense flurry of progress in computer vision. Benchmark tasks have propelled other sub-fields of machine learning forward at an equally impressive pace, but in healthcare it has primarily been image processing tasks, such as in dermatology and radiology, that have experienced similar benchmark-driven progress. In the present study, we performed a comprehensive review of benchmarks in medical machine learning for structured data, identifying one based on the Medical Information Mart for Intensive Care (MIMIC-III) that allows the first direct comparison of predictive performance and thus the evaluation of progress on four clinical prediction tasks: mortality, length of stay, phenotyping, and patient decompensation. We find that little meaningful progress has been made over a 3 year period on these tasks, despite significant community engagement. Through our meta-analysis, we find that the performance of deep recurrent models is only superior to logistic regression on certain tasks. We conclude with a synthesis of these results, possible explanations, and a list of desirable qualities for future benchmarks in medical machine learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2017

Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets

Deep learning models (aka Deep Neural Networks) have revolutionized many...
research
03/22/2017

Multitask Learning and Benchmarking with Clinical Time Series Data

Health care is one of the most exciting frontiers in data mining and mac...
research
10/02/2019

Benchmarking machine learning models on eICU critical care dataset

Progress of machine learning in critical care has been difficult to trac...
research
05/30/2022

FLICU: A Federated Learning Workflow for Intensive Care Unit Mortality Prediction

Although Machine Learning (ML) can be seen as a promising tool to improv...
research
07/05/2023

EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models

While the general machine learning (ML) community has benefited from pub...
research
01/24/2019

ISeeU: Visually interpretable deep learning for mortality prediction inside the ICU

To improve the performance of Intensive Care Units (ICUs), the field of ...
research
11/30/2018

Rethinking clinical prediction: Why machine learning must consider year of care and feature aggregation

Machine learning for healthcare often trains models on de-identified dat...

Please sign up or login with your details

Forgot password? Click here to reset