Discover and Mitigate Unknown Biases with Debiasing Alternate Networks

07/20/2022
by   Zhiheng Li, et al.
0

Deep image classifiers have been found to learn biases from datasets. To mitigate the biases, most previous methods require labels of protected attributes (e.g., age, skin tone) as full-supervision, which has two limitations: 1) it is infeasible when the labels are unavailable; 2) they are incapable of mitigating unknown biases – biases that humans do not preconceive. To resolve those problems, we propose Debiasing Alternate Networks (DebiAN), which comprises two networks – a Discoverer and a Classifier. By training in an alternate manner, the discoverer tries to find multiple unknown biases of the classifier without any annotations of biases, and the classifier aims at unlearning the biases identified by the discoverer. While previous works evaluate debiasing results in terms of a single bias, we create Multi-Color MNIST dataset to better benchmark mitigation of multiple biases in a multi-bias setting, which not only reveals the problems in previous methods but also demonstrates the advantage of DebiAN in identifying and mitigating multiple biases simultaneously. We further conduct extensive experiments on real-world datasets, showing that the discoverer in DebiAN can identify unknown biases that may be hard to be found by humans. Regarding debiasing, DebiAN achieves strong bias mitigation performance.

READ FULL TEXT

page 13

page 31

research
04/20/2022

Epistemic Uncertainty-Weighted Loss for Visual Bias Mitigation

Deep neural networks are highly susceptible to learning biases in visual...
research
04/29/2021

Discover the Unknown Biased Attribute of an Image Classifier

Recent works find that AI algorithms learn biases from data. Therefore, ...
research
08/19/2023

Partition-and-Debias: Agnostic Biases Mitigation via A Mixture of Biases-Specific Experts

Bias mitigation in image classification has been widely researched, and ...
research
10/27/2021

Feature and Label Embedding Spaces Matter in Addressing Image Classifier Bias

This paper strives to address image classifier bias, with a focus on bot...
research
11/05/2020

Investigating Societal Biases in a Poetry Composition System

There is a growing collection of work analyzing and mitigating societal ...
research
04/28/2022

Learning to Split for Automatic Bias Detection

Classifiers are biased when trained on biased datasets. As a remedy, we ...
research
06/14/2021

Mitigating Biases in Toxic Language Detection through Invariant Rationalization

Automatic detection of toxic language plays an essential role in protect...

Please sign up or login with your details

Forgot password? Click here to reset