Automatically Explaining Machine Learning Prediction Results: A Demonstration on Type 2 Diabetes Risk Prediction

12/06/2018
by   Gang Luo, et al.
0

Background: Predictive modeling is a key component of solutions to many healthcare problems. Among all predictive modeling approaches, machine learning methods often achieve the highest prediction accuracy, but suffer from a long-standing open problem precluding their widespread use in healthcare. Most machine learning models give no explanation for their prediction results, whereas interpretability is essential for a predictive model to be adopted in typical healthcare settings. Methods: This paper presents the first complete method for automatically explaining results for any machine learning predictive model without degrading accuracy. We did a computer coding implementation of the method. Using the electronic medical record data set from the Practice Fusion diabetes classification competition containing patient records from all 50 states in the United States, we demonstrated the method on predicting type 2 diabetes diagnosis within the next year. Results: For the champion machine learning model of the competition, our method explained prediction results for 87.4 diabetes diagnosis within the next year. Conclusions: Our demonstration showed the feasibility of automatically explaining results for any machine learning predictive model without degrading accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2020

Probabilistic Machine Learning for Healthcare

Machine learning can be used to make sense of healthcare data. Probabili...
research
08/01/2017

Application of machine learning for hematological diagnosis

Quick and accurate medical diagnosis is crucial for the successful treat...
research
07/25/2022

MedML: Fusing Medical Knowledge and Machine Learning Models for Early Pediatric COVID-19 Hospitalization and Severity Prediction

The COVID-19 pandemic has caused devastating economic and social disrupt...
research
06/01/2017

Beyond Volume: The Impact of Complex Healthcare Data on the Machine Learning Pipeline

From medical charts to national census, healthcare has traditionally ope...
research
09/07/2021

Sequential Diagnosis Prediction with Transformer and Ontological Representation

Sequential diagnosis prediction on the Electronic Health Record (EHR) ha...
research
12/20/2022

Construction of extra-large scale screening tools for risks of severe mental illnesses using real world healthcare data

Importance: The prevalence of severe mental illnesses (SMIs) in the Unit...
research
03/09/2018

Competitive Machine Learning: Best Theoretical Prediction vs Optimization

Machine learning is often used in competitive scenarios: Participants le...

Please sign up or login with your details

Forgot password? Click here to reset