Gender Bias in Big Data Analysis

11/17/2022
by   Thomas J. Misa, et al.
0

This article combines humanistic "data critique" with informed inspection of big data analysis. It measures gender bias when gender prediction software tools (Gender API, Namsor, and Genderize.io) are used in historical big data research. Gender bias is measured by contrasting personally identified computer science authors in the well-regarded DBLP dataset (1950-1980) with exactly comparable results from the software tools. Implications for public understanding of gender bias in computing and the nature of the computing profession are outlined. Preliminary assessment of the Semantic Scholar dataset is presented. The conclusion combines humanistic approaches with selective use of big data methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2022

Temporal Analysis and Gender Bias in Computing

Recent studies of gender bias in computing use large datasets involving ...
research
10/29/2022

Gender Bias in Computing

This paper examines the historical dimension of gender bias in the US co...
research
11/07/2022

Dynamics of Gender Bias in Computing

Gender bias in computing is a hard problem that has resisted decades of ...
research
06/13/2019

Advance gender prediction tool of first names and its use in analysing gender disparity in Computer Science in the UK, Malaysia and China

Global gender disparity in science is an unsolved problem. Predicting ge...
research
08/13/2020

On the Origin(s) of the Term "Big Data"

I investigate the origin(s) of the term "Big Data," in industry and acad...
research
02/18/2021

The Gender Pay Gap Revisited with Big Data: Do Methodological Choices Matter?

The vast majority of existing studies that estimate the average unexplai...
research
06/16/2017

Big Missing Data: are scientific memes inherited differently from gendered authorship?

This paper seeks to build upon the previous literature on gender aspects...

Please sign up or login with your details

Forgot password? Click here to reset