Write It Like You See It: Detectable Differences in Clinical Notes By Race Lead To Differential Model Recommendations

05/08/2022
by   Hammaad Adam, et al.
0

Clinical notes are becoming an increasingly important data source for machine learning (ML) applications in healthcare. Prior research has shown that deploying ML models can perpetuate existing biases against racial minorities, as bias can be implicitly embedded in data. In this study, we investigate the level of implicit race information available to ML models and human experts and the implications of model-detectable differences in clinical notes. Our work makes three key contributions. First, we find that models can identify patient self-reported race from clinical notes even when the notes are stripped of explicit indicators of race. Second, we determine that human experts are not able to accurately predict patient race from the same redacted clinical notes. Finally, we demonstrate the potential harm of this implicit information in a simulation study, and show that models trained on these race-redacted clinical notes can still perpetuate existing biases in clinical treatment decisions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/27/2019

Clinical XLNet: Modeling Sequential Clinical Notes and Predicting Prolonged Mechanical Ventilation

Clinical notes contain rich data, which is unexploited in predictive mod...
research
08/01/2022

Disparate Censorship Undertesting: A Source of Label Bias in Clinical Machine Learning

As machine learning (ML) models gain traction in clinical applications, ...
research
06/17/2023

Reevaluating the Role of Race and Ethnicity in Diabetes Screening

There is active debate over whether to consider patient race and ethnici...
research
06/14/2023

Towards trustworthy seizure onset detection using workflow notes

A major barrier to deploying healthcare AI models is their trustworthine...
research
04/01/2022

Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

In recent years, machine learning models have rapidly become better at g...
research
11/14/2017

Unsupervised patient representations from clinical notes with interpretable classification decisions

We have two main contributions in this work: 1. We explore the usage of ...
research
07/06/2017

RIDDLE: Race and ethnicity Imputation from Disease history with Deep LEarning

Anonymized electronic medical records are an increasingly popular source...

Please sign up or login with your details

Forgot password? Click here to reset