Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making

by   Yuancheng Xu, et al.

Decisions made by machine learning models may have lasting impacts over time, making long-term fairness a crucial consideration. It has been shown that when ignoring the long-term effect, naively imposing fairness criterion in static settings can actually exacerbate bias over time. To explicitly address biases in sequential decision-making, recent works formulate long-term fairness notions in Markov Decision Process (MDP) framework. They define the long-term bias to be the sum of static bias over each time step. However, we demonstrate that naively summing up the step-wise bias can cause a false sense of fairness since it fails to consider the importance difference of different time steps during transition. In this work, we introduce a long-term fairness notion called Equal Long-term Benefit Rate (ELBERT), which explicitly considers varying temporal importance and adapts static fairness principles to the sequential setting. Moreover, we show that the policy gradient of Long-term Benefit Rate can be analytically reduced to standard policy gradient. This makes standard policy optimization methods applicable for reducing the bias, leading to our proposed bias mitigation method ELBERT-PO. Experiments on three sequential decision making environments show that ELBERT-PO significantly reduces bias and maintains high utility. Code is available at https://github.com/Yuancheng-Xu/ELBERT.


page 1

page 2

page 3

page 4


Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems

Long-term fairness is an important factor of consideration in designing ...

Achieving Long-Term Fairness in Sequential Decision Making

In this paper, we propose a framework for achieving long-term fair seque...

Equal Improvability: A New Fairness Notion Considering the Long-term Impact

Devising a fair classifier that does not discriminate against different ...

Deciding Not To Decide

Sometimes unexpected, novel, unconceivable events enter our lives. The c...

State-Visitation Fairness in Average-Reward MDPs

Fairness has emerged as an important concern in automated decision-makin...

Spatial epidemiology and adaptive targeted sampling to manage the Chagas disease vector Triatoma dimidiata

Widespread application of insecticide remains the primary form of contro...

Delayed Impact of Fair Machine Learning

Fairness in machine learning has predominantly been studied in static cl...

Please sign up or login with your details

Forgot password? Click here to reset