Disembodied Machine Learning: On the Illusion of Objectivity in NLP

01/28/2021
by   Zeerak Waseem, et al.
20

Machine Learning seeks to identify and encode bodies of knowledge within provided datasets. However, data encodes subjective content, which determines the possible outcomes of the models trained on it. Because such subjectivity enables marginalisation of parts of society, it is termed (social) `bias' and sought to be removed. In this paper, we contextualise this discourse of bias in the ML community against the subjective choices in the development process. Through a consideration of how choices in data and model development construct subjectivity, or biases that are represented in a model, we argue that addressing and mitigating biases is near-impossible. This is because both data and ML models are objects for which meaning is made in each step of the development pipeline, from data selection over annotation to model training and analysis. Accordingly, we find the prevalent discourse of bias limiting in its ability to address social marginalisation. We recommend to be conscientious of this, and to accept that de-biasing methods only correct for a fraction of biases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2020

Social Biases in NLP Models as Barriers for Persons with Disabilities

Building equitable and inclusive NLP technologies demands consideration ...
research
06/13/2022

A Machine Learning Model for Predicting, Diagnosing, and Mitigating Health Disparities in Hospital Readmission

The management of hyperglycemia in hospitalized patients has a significa...
research
11/06/2019

Designing Evaluations of Machine Learning Models for Subjective Inference: The Case of Sentence Toxicity

Machine Learning (ML) is increasingly applied in real-life scenarios, ra...
research
04/08/2023

Connecting Fairness in Machine Learning with Public Health Equity

Machine learning (ML) has become a critical tool in public health, offer...
research
01/19/2022

On Heuristic Models, Assumptions, and Parameters

Study of the interaction between computation and society often focuses o...
research
05/28/2021

Changing the World by Changing the Data

NLP community is currently investing a lot more research and resources i...
research
06/29/2018

Bias in Semantic and Discourse Interpretation

In this paper, we show how game-theoretic work on conversation combined ...

Please sign up or login with your details

Forgot password? Click here to reset