Log In Sign Up

Machine learning using longitudinal prescription and medical claims for the detection of nonalcoholic steatohepatitis (NASH)

by   Ozge Yasar, et al.

Objectives To develop and evaluate machine learning models to detect suspected undiagnosed nonalcoholic steatohepatitis (NASH) patients for diagnostic screening and clinical management. Methods In this retrospective observational noninterventional study using administrative medical claims data from 1,463,089 patients, gradient-boosted decision trees were trained to detect likely NASH patients from an at-risk patient population with a history of obesity, type 2 diabetes mellitus (T2DM), metabolic disorder, or nonalcoholic fatty liver (NAFL). Models were trained to detect likely NASH in all at-risk patients or in the subset without a prior NAFL diagnosis (non-NAFL at-risk patients). Models were trained and validated using retrospective medical claims data and assessed using area under precision recall and receiver operating characteristic curves (AUPRCs, AUROCs). Results The 6-month incidence of NASH in claims data was 1 per 1,437 at-risk patients and 1 per 2,127 non-NAFL at-risk patients. The model trained to detect NASH in all at-risk patients had an AUPRC of 0.0107 (95 an AUROC of 0.84. At 10 NASH incidence. The model trained to detect NASH in non-NAFL patients had an AUPRC of 0.003 (95 model precision was 1 Conclusion The low incidence of NASH in medical claims data corroborates the pattern of NASH underdiagnosis in clinical practice. Claims-based machine learning could facilitate the detection of probable NASH patients for diagnostic testing and disease management.


page 1

page 2

page 3

page 4


Rare Disease Detection by Sequence Modeling with Generative Adversarial Networks

Rare diseases affecting 350 million individuals are commonly associated ...

Suicide Risk Modeling with Uncertain Diagnostic Records

Motivated by the pressing need for suicide prevention through improving ...

The Atrial Fibrillation Risk Score for Hyperthyroidism Patients

Thyrotoxicosis (TT) is associated with an increase in both total and car...

Machine Learning with Abstention for Automated Liver Disease Diagnosis

This paper presents a novel approach for detection of liver abnormalitie...