Fair Regression under Sample Selection Bias

10/08/2021
by   Wei Du, et al.
0

Recent research on fair regression focused on developing new fairness notions and approximation methods as target variables and even the sensitive attribute are continuous in the regression setting. However, all previous fair regression research assumed the training data and testing data are drawn from the same distributions. This assumption is often violated in real world due to the sample selection bias between the training and testing data. In this paper, we develop a framework for fair regression under sample selection bias when dependent variable values of a set of samples from the training data are missing as a result of another hidden process. Our framework adopts the classic Heckman model for bias correction and the Lagrange duality to achieve fairness in regression based on a variety of fairness notions. Heckman model describes the sample selection process and uses a derived variable called the Inverse Mills Ratio (IMR) to correct sample selection bias. We use fairness inequality and equality constraints to describe a variety of fairness notions and apply the Lagrange duality theory to transform the primal problem into the dual convex optimization. For the two popular fairness notions, mean difference and mean squared error difference, we derive explicit formulas without iterative optimization, and for Pearson correlation, we derive its conditions of achieving strong duality. We conduct experiments on three real-world datasets and the experimental results demonstrate the approach's effectiveness in terms of both utility and fairness metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2021

Robust Fairness-aware Learning Under Sample Selection Bias

The underlying assumption of many machine learning algorithms is that th...
research
04/09/2021

Implementing Fair Regression In The Real World

Most fair regression algorithms mitigate bias towards sensitive sub popu...
research
07/07/2021

Bias-Tolerant Fair Classification

The label bias and selection bias are acknowledged as two reasons in dat...
research
05/25/2023

A Robust Classifier Under Missing-Not-At-Random Sample Selection Bias

The shift between the training and testing distributions is commonly due...
research
09/14/2023

On Prediction Feature Assignment in the Heckman Selection Model

Under missing-not-at-random (MNAR) sample selection bias, the performanc...
research
02/19/2021

Fair Sparse Regression with Clustering: An Invex Relaxation for a Combinatorial Problem

In this paper, we study the problem of fair sparse regression on a biase...
research
11/20/2021

Control Analysis of Packet Transmission Algorithms: Study on Fairness and Stability

This document is a study of fairness, feedback and stability notions of ...

Please sign up or login with your details

Forgot password? Click here to reset