MEDFAIR: Benchmarking Fairness for Medical Imaging

10/04/2022
by   Yongshuo Zong, et al.
0

A multitude of work has shown that machine learning-based medical diagnosis systems can be biased against certain subgroups of people. This has motivated a growing number of bias mitigation algorithms that aim to address fairness issues in machine learning. However, it is difficult to compare their effectiveness in medical imaging for two reasons. First, there is little consensus on the criteria to assess fairness. Second, existing bias mitigation algorithms are developed under different settings, e.g., datasets, model selection strategies, backbones, and fairness metrics, making a direct comparison and evaluation based on existing results impossible. In this work, we introduce MEDFAIR, a framework to benchmark the fairness of machine learning models for medical imaging. MEDFAIR covers eleven algorithms from various categories, nine datasets from different imaging modalities, and three model selection criteria. Through extensive experiments, we find that the under-studied issue of model selection criterion can have a significant impact on fairness outcomes; while in contrast, state-of-the-art bias mitigation algorithms do not significantly improve fairness outcomes over empirical risk minimization (ERM) in both in-distribution and out-of-distribution settings. We evaluate fairness from various perspectives and make recommendations for different medical application scenarios that require different ethical principles. Our framework provides a reproducible and easy-to-use entry point for the development and evaluation of future bias mitigation algorithms in deep learning. Code is available at https://github.com/ys-zong/MEDFAIR.

READ FULL TEXT
research
04/01/2021

Model Selection's Disparate Impact in Real-World Deep Learning Applications

Algorithmic fairness has emphasized the role of biased data in automated...
research
07/31/2023

No Fair Lunch: A Causal Perspective on Dataset Bias in Machine Learning for Medical Imaging

As machine learning methods gain prominence within clinical decision-mak...
research
03/18/2021

How I failed machine learning in medical imaging – shortcomings and recommendations

Medical imaging is an important research field with many opportunities f...
research
02/12/2021

Technical Challenges for Training Fair Neural Networks

As machine learning algorithms have been widely deployed across applicat...
research
02/11/2023

Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP

Modern NLP systems exhibit a range of biases, which a growing literature...
research
05/31/2022

Inducing bias is simpler than you think

Machine learning may be oblivious to human bias but it is not immune to ...
research
07/12/2023

Ecosystem-level Analysis of Deployed Machine Learning Reveals Homogeneous Outcomes

Machine learning is traditionally studied at the model level: researcher...

Please sign up or login with your details

Forgot password? Click here to reset