A Survey on Techniques for Identifying and Resolving Representation Bias in Data

03/22/2022
by   Nima Shahbazi, et al.
0

The grand goal of data-driven decision-making is to help humans make decisions, not only easily and at scale but also wisely, accurately, and just. However, data-driven algorithms are only as good as the data they work with, while data sets, especially social data, often miss representing minorities. Representation Bias in data can happen due to various reasons ranging from historical discrimination to selection and sampling biases in the data acquisition and preparation methods. One cannot expect AI-based societal solutions to have equitable outcomes without addressing the representation bias. This paper surveys the existing literature on representation bias in the data. It presents a taxonomy to categorize the studied techniques based on multiple design dimensions and provide a side-by-side comparison of their properties. There is still a long way to fully address representation bias issues in data. The authors hope that this survey motivates researchers to approach these challenges in the future by observing existing work within their respective domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2019

A Survey on Bias and Fairness in Machine Learning

With the widespread use of AI systems and applications in our everyday l...
research
01/14/2020

Bias in Data-driven AI Systems – An Introductory Survey

AI-based systems are widely employed nowadays to make decisions that hav...
research
10/07/2020

Bias and Debias in Recommender System: A Survey and Future Directions

While recent years have witnessed a rapid growth of research papers on r...
research
01/02/2021

Mining the online infosphere: A survey

The evolution of AI-based system and applications had pervaded everyday ...
research
10/14/2022

A Survey of Parameters Associated with the Quality of Benchmarks in NLP

Several benchmarks have been built with heavy investment in resources to...
research
04/01/2020

Bias in Machine Learning What is it Good (and Bad) for?

In public media as well as in scientific publications, the term bias is ...

Please sign up or login with your details

Forgot password? Click here to reset