Debiasing Convolutional Neural Networks via Meta Orthogonalization

11/15/2020
by   Kurtis Evan David, et al.
3

While deep learning models often achieve strong task performance, their successes are hampered by their inability to disentangle spurious correlations from causative factors, such as when they use protected attributes (e.g., race, gender, etc.) to make decisions. In this work, we tackle the problem of debiasing convolutional neural networks (CNNs) in such instances. Building off of existing work on debiasing word embeddings and model interpretability, our Meta Orthogonalization method encourages the CNN representations of different concepts (e.g., gender and class labels) to be orthogonal to one another in activation space while maintaining strong downstream task performance. Through a variety of experiments, we systematically test our method and demonstrate that it significantly mitigates model bias and is competitive against current adversarial debiasing methods.

READ FULL TEXT

page 5

page 7

research
03/09/2020

Joint Multiclass Debiasing of Word Embeddings

Bias in Word Embeddings has been a subject of recent interest, along wit...
research
05/13/2022

Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes

To tackle the rising phenomenon of hate speech, efforts have been made t...
research
12/15/2022

The effects of gender bias in word embeddings on depression prediction

Word embeddings are extensively used in various NLP problems as a state-...
research
02/02/2018

Interpretable Deep Convolutional Neural Networks via Meta-learning

Model interpretability is a requirement in many applications in which cr...
research
06/16/2018

Right for the Right Reason: Training Agnostic Networks

We consider the problem of a neural network being requested to classify ...
research
08/12/2020

Null-sampling for Interpretable and Fair Representations

We propose to learn invariant representations, in the data domain, to ac...
research
11/20/2018

Adversarial Removal of Gender from Deep Image Representations

In this work we analyze visual recognition tasks such as object and acti...

Please sign up or login with your details

Forgot password? Click here to reset