Right the docs: Characterising voice dataset documentation practices used in machine learning

03/19/2023
by   Kathy Reid, et al.
0

Voice-enabled technology is quickly becoming ubiquitous, and is constituted from machine learning (ML)-enabled components such as speech recognition and voice activity detection. However, these systems don't yet work well for everyone. They exhibit bias - the systematic and unfair discrimination against individuals or cohorts of individuals in favour of others (Friedman Nissembaum, 1996) - across axes such as age, gender and accent. ML is reliant on large datasets for training. Dataset documentation is designed to give ML Practitioners (MLPs) a better understanding of a dataset's characteristics. However, there is a lack of empirical research on voice dataset documentation specifically. Additionally, while MLPs are frequent participants in fairness research, little work focuses on those who work with voice data. Our work makes an empirical contribution to this gap. Here, we combine two methods to form an exploratory study. First, we undertake 13 semi-structured interviews, exploring multiple perspectives of voice dataset documentation practice. Using open and axial coding methods, we explore MLPs' practices through the lenses of roles and tradeoffs. Drawing from this work, we then purposively sample voice dataset documents (VDDs) for 9 voice datasets. Our findings then triangulate these two methods, using the lenses of MLP roles and trade-offs. We find that current VDD practices are inchoate, inadequate and incommensurate. The characteristics of voice datasets are codified in fragmented, disjoint ways that often do not meet the needs of MLPs. Moreover, they cannot be readily compared, presenting a barrier to practitioners' bias reduction efforts. We then discuss the implications of these findings for bias practices in voice data and speech technologies. We conclude by setting out a program of future work to address these findings – that is, how we may "right the docs".

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2022

Exploring How Machine Learning Practitioners (Try To) Use Fairness Toolkits

Recent years have seen the development of many open-source ML fairness t...
research
06/06/2022

Understanding Machine Learning Practitioners' Data Documentation Perceptions, Needs, Challenges, and Desiderata

Data is central to the development and evaluation of machine learning (M...
research
09/13/2023

Designing Voice Interfaces to Support Mindfulness-Based Pain Management

Objective: Chronic pain is a critical public health issue affecting appr...
research
04/07/2023

About Voice: A Longitudinal Study of Speaker Recognition Dataset Dynamics

Like face recognition, speaker recognition is widely used for voice-base...
research
04/14/2021

Look at Me When I Talk to You: A Video Dataset to Enable Voice Assistants to Recognize Errors

People interacting with voice assistants are often frustrated by voice a...
research
01/13/2021

Moderation Challenges in Voice-based Online Communities on Discord

Online community moderators are on the front lines of combating problems...
research
08/04/2021

With One Voice: Composing a Travel Voice Assistant from Re-purposed Models

Voice assistants provide users a new way of interacting with digital pro...

Please sign up or login with your details

Forgot password? Click here to reset