Global and Local Interpretation of black-box Machine Learning models to determine prognostic factors from early COVID-19 data

09/10/2021
by   Ananya Jana, et al.
5

The COVID-19 corona virus has claimed 4.1 million lives, as of July 24, 2021. A variety of machine learning models have been applied to related data to predict important factors such as the severity of the disease, infection rate and discover important prognostic factors. Often the usefulness of the findings from the use of these techniques is reduced due to lack of method interpretability. Some recent progress made on the interpretability of machine learning models has the potential to unravel more insights while using conventional machine learning models. In this work, we analyze COVID-19 blood work data with some of the popular machine learning models; then we employ state-of-the-art post-hoc local interpretability techniques(e.g.- SHAP, LIME), and global interpretability techniques(e.g. - symbolic metamodeling) to the trained black-box models to draw interpretable conclusions. In the gamut of machine learning algorithms, regressions remain one of the simplest and most explainable models with clear mathematical formulation. We explore one of the most recent techniques called symbolic metamodeling to find the mathematical expression of the machine learning models for COVID-19. We identify Acute Kidney Injury (AKI), initial Albumin level (ALBI), Aspartate aminotransferase (ASTI), Total Bilirubin initial(TBILI) and D-Dimer initial (DIMER) as major prognostic factors of the disease severity. Our contributions are- (i) uncover the underlying mathematical expression for the black-box models on COVID-19 severity prediction task (ii) we are the first to apply symbolic metamodeling to this task, and (iii) discover important features and feature interactions.

READ FULL TEXT
research
09/30/2020

Is AI Model Interpretable to Combat with COVID? An Empirical Study on Severity Prediction Task

Black-box nature hinders the deployment of many high-accuracy models in ...
research
04/12/2021

An Approach to Symbolic Regression Using Feyn

In this article we introduce the supervised machine learning tool called...
research
01/03/2021

Combining Graph Neural Networks and Spatio-temporal Disease Models to Predict COVID-19 Cases in Germany

During 2020, the infection rate of COVID-19 has been investigated by man...
research
01/18/2021

Interactive slice visualization for exploring machine learning models

Machine learning models fit complex algorithms to arbitrarily large data...
research
09/10/2020

Actionable Interpretation of Machine Learning Models for Sequential Data: Dementia-related Agitation Use Case

Machine learning has shown successes for complex learning problems in wh...
research
07/12/2022

Using Interpretable Machine Learning to Predict Maternal and Fetal Outcomes

Most pregnancies and births result in a good outcome, but complications ...
research
10/21/2021

Using NASA Satellite Data Sources and Geometric Deep Learning to Uncover Hidden Patterns in COVID-19 Clinical Severity

As multiple adverse events in 2021 illustrated, virtually all aspects of...

Please sign up or login with your details

Forgot password? Click here to reset