Biases in Data Science Lifecycle

09/10/2020
by   Dinh-An Ho, et al.
0

In recent years, data science has become an indispensable part of our society. Over time, we have become reliant on this technology because of its opportunity to gain value and new insights from data in any field - business, socializing, research and society. At the same time, it raises questions about how justified we are in placing our trust in these technologies. There is a risk that such powers may lead to biased, inappropriate or unintended actions. Therefore, ethical considerations which might occur as the result of data science practices should be carefully considered and these potential problems should be identified during the data science lifecycle and mitigated if possible. However, a typical data scientist has not enough knowledge for identifying these challenges and it is not always possible to include an ethics expert during data science production. The aim of this study is to provide a practical guideline to data scientists and increase their awareness. In this work, we reviewed different sources of biases and grouped them under different stages of the data science lifecycle. The work is still under progress. The aim of early publishing is to collect community feedback and improve the curated knowledge base for bias types and solutions.

READ FULL TEXT
research
07/21/2017

Data, Science and Society

Reflections on the Concept of Data and its Implications for Science and ...
research
02/12/2020

The Big Three: A Methodology to Increase Data Science ROI by Answering the Questions Companies Care About

Companies may be achieving only a third of the value they could be getti...
research
07/21/2019

Conscientious Classification: A Data Scientist's Guide to Discrimination-Aware Classification

Recent research has helped to cultivate growing awareness that machine l...
research
11/06/2018

Data Science as Political Action: Grounding Data Science in a Politics of Justice

In response to recent controversies, the field of data science has rushe...
research
01/08/2019

Problem Formulation and Fairness

Formulating data science problems is an uncertain and difficult process....
research
08/19/2022

Atomist or Holist? A Diagnosis and Vision for More Productive Interdisciplinary AI Ethics Dialogue

In response to growing recognition of the social, legal, and ethical imp...
research
03/27/2023

Philosophical Foundations of GeoAI: Exploring Sustainability, Diversity, and Bias in GeoAI and Spatial Data Science

This chapter presents some of the fundamental assumptions and principles...

Please sign up or login with your details

Forgot password? Click here to reset