Discover the Unknown Biased Attribute of an Image Classifier

04/29/2021
by   Zhiheng Li, et al.
2

Recent works find that AI algorithms learn biases from data. Therefore, it is urgent and vital to identify biases in AI algorithms. However, the previous bias identification pipeline overly relies on human experts to conjecture potential biases (e.g., gender), which may neglect other underlying biases not realized by humans. To help human experts better find the AI algorithms' biases, we study a new problem in this work – for a classifier that predicts a target attribute of the input image, discover its unknown biased attribute. To solve this challenging problem, we use a hyperplane in the generative model's latent space to represent an image attribute; thus, the original problem is transformed to optimizing the hyperplane's normal vector and offset. We propose a novel total-variation loss within this framework as the objective function and a new orthogonalization penalty as a constraint. The latter prevents trivial solutions in which the discovered biased attribute is identical with the target or one of the known-biased attributes. Extensive experiments on both disentanglement datasets and real-world datasets show that our method can discover biased attributes and achieve better disentanglement w.r.t. target attributes. Furthermore, the qualitative results show that our method can discover unnoticeable biased attributes for various object and scene classifiers, proving our method's generalizability for detecting biased attributes in diverse domains of images. The code is available at https://git.io/J3kMh.

READ FULL TEXT

page 6

page 7

page 8

page 17

page 18

page 19

page 20

page 22

research
07/20/2022

Discover and Mitigate Unknown Biases with Debiasing Alternate Networks

Deep image classifiers have been found to learn biases from datasets. To...
research
06/22/2022

Learning Debiased Classifier with Biased Committee

Neural networks are prone to be biased towards spurious correlations bet...
research
04/12/2022

VisCUIT: Visual Auditor for Bias in CNN Image Classifier

CNN image classifiers are widely used, thanks to their efficiency and ac...
research
12/02/2020

Fair Attribute Classification through Latent Space De-biasing

Fairness in visual recognition is becoming a prominent and critical topi...
research
08/11/2022

A Comprehensive Analysis of AI Biases in DeepFake Detection With Massively Annotated Databases

In recent years, image and video manipulations with DeepFake have become...
research
07/06/2020

Learning from Failure: Training Debiased Classifier from Biased Classifier

Neural networks often learn to make predictions that overly rely on spur...
research
09/02/2019

Analysis of Bias in Gathering Information Between User Attributes in News Application

In the process of information gathering on the web, confirmation bias is...

Please sign up or login with your details

Forgot password? Click here to reset