A Bootstrap Machine Learning Approach to Identify Rare Disease Patients from Electronic Health Records

09/06/2016
by   Ravi Garg, et al.
0

Rare diseases are very difficult to identify among large number of other possible diagnoses. Better availability of patient data and improvement in machine learning algorithms empower us to tackle this problem computationally. In this paper, we target one such rare disease - cardiac amyloidosis. We aim to automate the process of identifying potential cardiac amyloidosis patients with the help of machine learning algorithms and also learn most predictive factors. With the help of experienced cardiologists, we prepared a gold standard with 73 positive (cardiac amyloidosis) and 197 negative instances. We achieved high average cross-validation F1 score of 0.98 using an ensemble machine learning classifier. Some of the predictive variables were: Age and Diagnosis of cardiac arrest, chest pain, congestive heart failure, hypertension, prim open angle glaucoma, and shoulder arthritis. Further studies are needed to validate the accuracy of the system across an entire health system and its generalizability for other diseases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/16/2023

Ensemble Framework for Cardiovascular Disease Prediction

Heart disease is the major cause of non-communicable and silent death wo...
research
07/03/2019

High-Throughput Machine Learning from Electronic Health Records

The widespread digitization of patient data via electronic health record...
research
11/19/2020

Novel Classification of Ischemic Heart Disease Using Artificial Neural Network

Ischemic heart disease (IHD), particularly in its chronic stable form, i...
research
06/18/2022

Tree-Guided Rare Feature Selection and Logic Aggregation with Electronic Health Records Data

Statistical learning with a large number of rare binary features is comm...
research
10/25/2022

Diagnostic Posture Control System for Seated-Style Echocardiography Robot

Purpose: Conventional robotic ultrasound systems were utilized with pati...
research
01/28/2022

Developing a Machine-Learning Algorithm to Diagnose Age-Related Macular Degeneration

Today, more than 12 million people over the age of 40 suffer from ocular...
research
08/23/2022

POPDx: An Automated Framework for Patient Phenotyping across 392,246 Individuals in the UK Biobank Study

Objective For the UK Biobank standardized phenotype codes are associated...

Please sign up or login with your details

Forgot password? Click here to reset