State-Visitation Fairness in Average-Reward MDPs

02/14/2021
by   Ganesh Ghalme, et al.
0

Fairness has emerged as an important concern in automated decision-making in recent years, especially when these decisions affect human welfare. In this work, we study fairness in temporally extended decision-making settings, specifically those formulated as Markov Decision Processes (MDPs). Our proposed notion of fairness ensures that each state's long-term visitation frequency is more than a specified fraction. In an average-reward MDP (AMDP) setting, we formulate the problem as a bilinear saddle point program and, for a generative model, solve it using a Stochastic Mirror Descent (SMD) based algorithm. The proposed solution guarantees a simultaneous approximation on the expected average-reward and the long-term state-visitation frequency. We validate our theoretical results with experiments on synthetic data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2022

Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems

Long-term fairness is an important factor of consideration in designing ...
research
01/24/2019

Fairness with Dynamics

It has recently been shown that if feedback effects of decisions are ign...
research
09/07/2023

Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making

Decisions made by machine learning models may have lasting impacts over ...
research
02/27/2021

Parallel Stochastic Mirror Descent for MDPs

We consider the problem of learning the optimal policy for infinite-hori...
research
06/25/2017

Specifying Non-Markovian Rewards in MDPs Using LDL on Finite Traces (Preliminary Version)

In Markov Decision Processes (MDPs), the reward obtained in a state depe...
research
11/19/2021

Towards Return Parity in Markov Decision Processes

Algorithmic decisions made by machine learning models in high-stakes dom...
research
12/21/2018

The Design and Implementation of XiaoIce, an Empathetic Social Chatbot

This paper describes the development of the Microsoft XiaoIce system, th...

Please sign up or login with your details

Forgot password? Click here to reset