Exploring Text Specific and Blackbox Fairness Algorithms in Multimodal Clinical NLP

11/19/2020
by   John Chen, et al.
27

Clinical machine learning is increasingly multimodal, collected in both structured tabular formats and unstructured forms such as freetext. We propose a novel task of exploring fairness on a multimodal clinical dataset, adopting equalized odds for the downstream medical prediction tasks. To this end, we investigate a modality-agnostic fairness algorithm - equalized odds post processing - and compare it to a text-specific fairness algorithm: debiased clinical word embeddings. Despite the fact that debiased word embeddings do not explicitly address equalized odds of protected groups, we show that a text-specific approach to fairness may simultaneously achieve a good balance of performance and classical notions of fairness. We hope that our paper inspires future contributions at the critical intersection of clinical NLP and fairness. The full source code is available here: https://github.com/johntiger1/multimodal_fairness

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2019

ETNLP: A Toolkit for Extraction, Evaluation and Visualization of Pre-trained Word Embeddings

In this paper, we introduce a comprehensive toolkit, ETNLP, which can ev...
research
06/02/2023

Word Embeddings for Banking Industry

Applications of Natural Language Processing (NLP) are plentiful, from se...
research
03/11/2020

Hurtful Words: Quantifying Biases in Clinical Contextual Word Embeddings

In this work, we examine the extent to which embeddings may encode margi...
research
03/29/2021

Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models

Recent studies have revealed a security threat to natural language proce...
research
06/15/2023

FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods

This paper introduces the Fair Fairness Benchmark (), a benchmarking fra...
research
08/02/2022

Gender bias in (non)-contextual clinical word embeddings for stereotypical medical categories

Clinical word embeddings are extensively used in various Bio-NLP problem...
research
10/29/2021

Measuring a Texts Fairness Dimensions Using Machine Learning Based on Social Psychological Factors

Fairness is a principal social value that can be observed in civilisatio...

Please sign up or login with your details

Forgot password? Click here to reset