An Empirical Characterization of Fair Machine Learning For Clinical Risk Prediction

07/20/2020
by   Stephen R. Pfohl, et al.
17

The use of machine learning to guide clinical decision making has the potential to worsen existing health disparities. Several recent works frame the problem as that of algorithmic fairness, a framework that has attracted considerable attention and criticism. However, the appropriateness of this framework is unclear due to both ethical as well as technical considerations, the latter of which include trade-offs between measures of fairness and model performance that are not well-understood for predictive models of clinical outcomes. To inform the ongoing debate, we conduct an empirical study to characterize the impact of penalizing group fairness violations on an array of measures of model performance and group fairness. We repeat the analyses across multiple observational healthcare databases, clinical outcomes, and sensitive attributes. We find that procedures that penalize differences between the distributions of predictions across groups induce nearly-universal degradation of multiple performance metrics within groups. On examining the secondary impact of these procedures, we observe heterogeneity of the effect of these procedures on measures of fairness in calibration and ranking across experimental conditions. Beyond the reported trade-offs, we emphasize that analyses of algorithmic fairness in healthcare lack the contextual grounding and causal awareness necessary to reason about the mechanisms that lead to health disparities, as well as about the potential of algorithmic fairness methods to counteract those mechanisms. In light of these limitations, we encourage researchers building predictive models for clinical use to step outside the algorithmic fairness frame and engage critically with the broader sociotechnical context surrounding the use of machine learning in healthcare.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2022

Net benefit, calibration, threshold selection, and training objectives for algorithmic fairness in healthcare

A growing body of work uses the paradigm of algorithmic fairness to fram...
research
06/30/2020

Evaluation of Fairness Trade-offs in Predicting Student Success

Predictive models for identifying at-risk students early can help teachi...
research
07/14/2019

Counterfactual Reasoning for Fair Clinical Risk Prediction

The use of machine learning systems to support decision making in health...
research
10/28/2022

Mitigating Health Disparities in EHR via Deconfounder

Health disparities, or inequalities between different patient demographi...
research
08/10/2021

Retiring Adult: New Datasets for Fair Machine Learning

Although the fairness community has recognized the importance of data, r...
research
07/25/2022

Representational Ethical Model Calibration

Equity is widely held to be fundamental to the ethics of healthcare. In ...
research
06/04/2022

When Personalization Harms: Reconsidering the Use of Group Attributes in Prediction

The standard approach to personalization in machine learning consists of...

Please sign up or login with your details

Forgot password? Click here to reset