Diagnosis Prevalence vs. Efficacy in Machine-learning Based Diagnostic Decision Support

06/24/2020
by   Gil Alon, et al.
0

Many recent studies use machine learning to predict a small number of ICD-9-CM codes. In practice, on the other hand, physicians have to consider a broader range of diagnoses. This study aims to put these previously incongruent evaluation settings on a more equal footing by predicting ICD-9-CM codes based on electronic health record properties and demonstrating the relationship between diagnosis prevalence and system performance. We extracted patient features from the MIMIC-III dataset for each admission. We trained and evaluated 43 different machine learning classifiers. Among this pool, the most successful classifier was a Multi-Layer Perceptron. In accordance with general machine learning expectation, we observed all classifiers' F1 scores to drop as disease prevalence decreased. Scores fell from 0.28 for the 50 most prevalent ICD-9-CM codes to 0.03 for the 1000 most prevalent ICD-9-CM codes. Statistical analyses showed a moderate positive correlation between disease prevalence and efficacy (0.5866).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2019

High-Throughput Machine Learning from Electronic Health Records

The widespread digitization of patient data via electronic health record...
research
05/18/2022

A Scalable Workflow to Build Machine Learning Classifiers with Clinician-in-the-Loop to Identify Patients in Specific Diseases

Clinicians may rely on medical coding systems such as International Clas...
research
07/05/2022

ICE-NODE: Integration of Clinical Embeddings with Neural Ordinary Differential Equations

Early diagnosis of disease can result in improved health outcomes, such ...
research
06/25/2018

Using routinely collected patient data to support clinical trials research in accountable care organizations

Background: More than half (57 support of clinical trials. One reason is...
research
08/23/2022

POPDx: An Automated Framework for Patient Phenotyping across 392,246 Individuals in the UK Biobank Study

Objective For the UK Biobank standardized phenotype codes are associated...
research
12/22/2020

Prediction of Chronic Kidney Disease Using Deep Neural Network

Deep neural Network (DNN) is becoming a focal point in Machine Learning ...

Please sign up or login with your details

Forgot password? Click here to reset