Machine Learning Methods for Identifying Atrial Fibrillation Cases and Their Predictors in Patients With Hypertrophic Cardiomyopathy: The HCM-AF-Risk Model

09/19/2021
by   Moumita Bhattacharya, et al.
8

Hypertrophic cardiomyopathy (HCM) patients have a high incidence of atrial fibrillation (AF) and increased stroke risk, even with low risk of congestive heart failure, hypertension, age, diabetes, previous stroke/transient ischemic attack scores. Hence, there is a need to understand the pathophysiology of AF and stroke in HCM. In this retrospective study, we develop and apply a data-driven, machine learning based method to identify AF cases, and clinical and imaging features associated with AF, using electronic health record data. HCM patients with documented paroxysmal/persistent/permanent AF (n = 191) were considered AF cases, and the remaining patients in sinus rhythm (n = 640) were tagged as No-AF. We evaluated 93 clinical variables and the most informative variables useful for distinguishing AF from No-AF cases were selected based on the 2-sample t test and the information gain criterion. We identified 18 highly informative variables that are positively (n = 11) and negatively (n = 7) correlated with AF in HCM. Next, patient records were represented via these 18 variables. Data imbalance resulting from the relatively low number of AF cases was addressed via a combination of oversampling and under-sampling strategies. We trained and tested multiple classifiers under this sampling approach, showing effective classification. Specifically, an ensemble of logistic regression and naive Bayes classifiers, trained based on the 18 variables and corrected for data imbalance, proved most effective for separating AF from No-AF cases (sensitivity = 0.74, specificity = 0.70, C-index = 0.80). Our model is the first machine learning based method for identification of AF cases in HCM. This model demonstrates good performance, addresses data imbalance, and suggests that AF is associated with a more severe cardiac HCM phenotype.

READ FULL TEXT

page 2

page 3

page 7

page 8

page 10

page 11

page 12

page 13

research
10/01/2019

Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods

Cardiotoxicity related to cancer therapies has become a serious issue, d...
research
01/06/2021

Risk markers by sex and age group for in-hospital mortality in patients with STEMI or NSTEMI: an approach based on machine learning

Machine learning (ML) has demonstrated promising results in the identifi...
research
03/07/2019

Development and validation of computable Phenotype to Identify and Characterize Kidney Health in Adult Hospitalized Patients

Background: Acute kidney injury (AKI) is one of the most common complica...
research
04/28/2022

Machine Learning for Violence Risk Assessment Using Dutch Clinical Notes

Violence risk assessment in psychiatric institutions enables interventio...
research
03/31/2019

Surrogate-guided sampling designs for classification of rare outcomes from electronic medical records data

Scalable and accurate identification of specific clinical outcomes has b...
research
07/13/2021

AutoScore-Imbalance: An interpretable machine learning tool for development of clinical scores with rare events data

Background: Medical decision-making impacts both individual and public h...

Please sign up or login with your details

Forgot password? Click here to reset