Whose AI Dream? In search of the aspiration in data annotation

03/21/2022
by   Ding Wang, et al.
0

This paper present the practice of data annotation from the perspective of the annotators. Data is fundamental to ML models. This paper investigates the work practices concerning data annotation as performed in the industry, in India. Previous investigations have largely focused on annotator subjectivity, bias and efficiency. We present a wider perspective of the data annotation, following a grounded approach, we conducted three sets of interviews with 25 annotators, 10 industry experts and 12 ML practitioners. Our results show that the work of annotators is dictated by the interests, priorities and values of others above their station. More than technical, we contend that data annotation is a systematic exercise of power through organizational structure and practice. We propose a set of implications for how we can cultivate and encourage better practice to balance the tension between the need for high quality data at low cost and the annotator aspiration for well being, career perspective, and active participation in building the AI dream.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2020

Between Subjectivity and Imposition: Power Dynamics in Data Annotation for Computer Vision

The interpretation of data is fundamental to machine learning. This pape...
research
03/18/2021

Towards Productizing AI/ML Models: An Industry Perspective from Data Scientists

The transition from AI/ML models to production-ready AI-based systems is...
research
12/07/2021

Towards a Shared Rubric for Dataset Annotation

When arranging for third-party data annotation, it can be hard to compar...
research
07/14/2023

`It is currently hodgepodge”: Examining AI/ML Practitioners' Challenges during Co-production of Responsible AI Values

Recently, the AI/ML research community has indicated an urgent need to e...
research
06/06/2022

Understanding Machine Learning Practitioners' Data Documentation Perceptions, Needs, Challenges, and Desiderata

Data is central to the development and evaluation of machine learning (M...
research
06/25/2021

Semantic annotation for computational pathology: Multidisciplinary experience and best practice recommendations

Recent advances in whole slide imaging (WSI) technology have led to the ...
research
03/10/2023

Automotive Perception Software Development: An Empirical Investigation into Data, Annotation, and Ecosystem Challenges

Software that contains machine learning algorithms is an integral part o...

Please sign up or login with your details

Forgot password? Click here to reset