Censoring Representations with an Adversary

11/18/2015
by   Harrison Edwards, et al.
0

In practice, there are often explicit constraints on what representations or decisions are acceptable in an application of machine learning. For example it may be a legal requirement that a decision must not favour a particular group. Alternatively it can be that that representation of data must not have identifying information. We address these two related issues by learning flexible representations that minimize the capability of an adversarial critic. This adversary is trying to predict the relevant sensitive variable from the representation, and so minimizing the performance of the adversary ensures there is little or no information in the representation about the sensitive variable. We demonstrate this adversarial approach on two problems: making decisions free from discrimination and removing private information from images. We formulate the adversarial model as a minimax problem, and optimize that minimax objective using a stochastic gradient alternate min-max optimizer. We demonstrate the ability to provide discriminant free representations for standard test problems, and compare with previous state of the art methods for fairness, showing statistically significant improvement across most cases. The flexibility of this method is shown via a novel problem: removing annotations from images, from unaligned training examples of annotated and unannotated images, and with no a priori knowledge of the form of annotation provided to the model.

READ FULL TEXT
research
09/06/2019

Approaching Machine Learning Fairness through Adversarial Network

Fairness is becoming a rising concern w.r.t. machine learning model perf...
research
05/24/2018

Evading the Adversary in Invariant Representation

Representations of data that are invariant to changes in specified nuisa...
research
09/12/2021

Adversarial Representation Learning With Closed-Form Solvers

Adversarial representation learning aims to learn data representations f...
research
09/02/2022

Exploiting Fairness to Enhance Sensitive Attributes Reconstruction

In recent years, a growing body of work has emerged on how to learn mach...
research
09/07/2020

Learning Unbiased Representations via Rényi Minimization

In recent years, significant work has been done to include fairness cons...
research
01/22/2018

Mitigating Unwanted Biases with Adversarial Learning

Machine learning is a tool for building models that accurately represent...

Please sign up or login with your details

Forgot password? Click here to reset