Learning Not to Learn: Training Deep Neural Networks with Biased Data

12/26/2018
by   Byungju Kim, et al.
0

We propose a novel regularization algorithm to train deep neural networks, in which data at training time is severely biased. Since a neural network efficiently learns data distribution, a network is likely to learn the bias information to categorize input data. It leads to poor performance at test time, if the bias is, in fact, irrelevant to the categorization. In this paper, we formulate a regularization loss based on mutual information between feature embedding and bias. Based on the idea of minimizing this mutual information, we propose an iterative algorithm to unlearn the bias information. We employ an additional network to predict the bias distribution and train the network adversarially against the feature embedding network. At the end of learning, the bias prediction network is not able to predict the bias not because it is poorly trained, but because the feature embedding network successfully unlearns the bias information. We also demonstrate quantitative and qualitative experimental results which show that our algorithm effectively removes the bias information from feature embedding.

READ FULL TEXT

page 5

page 8

research
05/18/2021

A note on the unbiased estimation of mutual information

Estimators for mutual information are typically biased. However, in the ...
research
05/13/2018

Doing the impossible: Why neural networks can be trained at all

As deep neural networks grow in size, from thousands to millions to bill...
research
08/11/2021

Learning Bias-Invariant Representation by Cross-Sample Mutual Information Minimization

Deep learning algorithms mine knowledge from the training data and thus ...
research
03/13/2020

Learning Unbiased Representations via Mutual Information Backpropagation

We are interested in learning data-driven representations that can gener...
research
12/14/2022

Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology

We propose a fully unsupervised method to detect bias in contextualized ...
research
02/21/2022

Toward more generalized Malicious URL Detection Models

This paper reveals a data bias issue that can severely affect the perfor...
research
01/07/2023

ExcelFormer: A Neural Network Surpassing GBDTs on Tabular Data

Though neural networks have achieved enormous breakthroughs on various f...

Please sign up or login with your details

Forgot password? Click here to reset