An introduction to causal reasoning in health analytics

05/10/2021
by   Wenhao Zhang, et al.
0

A data science task can be deemed as making sense of the data and/or testing a hypothesis about it. The conclusions inferred from data can greatly guide us to make informative decisions. Big data has enabled us to carry out countless prediction tasks in conjunction with machine learning, such as identifying high risk patients suffering from a certain disease and taking preventable measures. However, healthcare practitioners are not content with mere predictions - they are also interested in the cause-effect relation between input features and clinical outcomes. Understanding such relations will help doctors treat patients and reduce the risk effectively. Causality is typically identified by randomized controlled trials. Often such trials are not feasible when scientists and researchers turn to observational studies and attempt to draw inferences. However, observational studies may also be affected by selection and/or confounding biases that can result in wrong causal conclusions. In this chapter, we will try to highlight some of the drawbacks that may arise in traditional machine learning and statistical approaches to analyze the observational data, particularly in the healthcare data analytics domain. We will discuss causal inference and ways to discover the cause-effect from observational studies in healthcare domain. Moreover, we will demonstrate the applications of causal inference in tackling some common machine learning issues such as missing data and model transportability. Finally, we will discuss the possibility of integrating reinforcement learning with causality as a way to counter confounding bias.

READ FULL TEXT
research
05/23/2023

Statistical causal inference methods for observational research in PER: a primer

Recent critiques of Physics Education Research (PER) studies have revoic...
research
04/28/2018

Data science is science's second chance to get causal inference right: A classification of data science tasks

Causal inference from observational data is the goal of many health and ...
research
05/07/2021

Precise Unbiased Estimation in Randomized Experiments using Auxiliary Observational Data

Randomized controlled trials (RCTs) are increasingly prevalent in educat...
research
01/07/2022

Similarities and Differences between Machine Learning and Traditional Advanced Statistical Modeling in Healthcare Analytics

Data scientists and statisticians are often at odds when determining the...
research
04/21/2023

A Common Misassumption in Online Experiments with Machine Learning Models

Online experiments such as Randomised Controlled Trials (RCTs) or A/B-te...
research
10/21/2019

Causal bootstrapping

To draw scientifically meaningful conclusions and build reliable models ...

Please sign up or login with your details

Forgot password? Click here to reset