About Voice: A Longitudinal Study of Speaker Recognition Dataset Dynamics

04/07/2023
by   Casandra Rusti, et al.
0

Like face recognition, speaker recognition is widely used for voice-based biometric identification in a broad range of industries, including banking, education, recruitment, immigration, law enforcement, healthcare, and well-being. However, while dataset evaluations and audits have improved data practices in computer vision and face recognition, the data practices in speaker recognition have gone largely unquestioned. Our research aims to address this gap by exploring how dataset usage has evolved over time and what implications this has on bias and fairness in speaker recognition systems. Previous studies have demonstrated the presence of historical, representation, and measurement biases in popular speaker recognition benchmarks. In this paper, we present a longitudinal study of speaker recognition datasets used for training and evaluation from 2012 to 2021. We survey close to 700 papers to investigate community adoption of datasets and changes in usage over a crucial time period where speaker recognition approaches transitioned to the widespread adoption of deep neural networks. Our study identifies the most commonly used datasets in the field, examines their usage patterns, and assesses their attributes that affect bias, fairness, and other ethical concerns. Our findings suggest areas for further research on the ethics and fairness of speaker recognition technology.

READ FULL TEXT

page 9

page 20

research
01/24/2022

Bias in Automated Speaker Recognition

Automated speaker recognition uses data processing to identify speakers ...
research
04/07/2020

Learning to fool the speaker recognition

Due to the widespread deployment of fingerprint/face/speaker recognition...
research
04/29/2021

Improving Fairness in Speaker Recognition

The human voice conveys unique characteristics of an individual, making ...
research
08/05/2020

Subclass Contrastive Loss for Injured Face Recognition

Deaths and injuries are common in road accidents, violence, and natural ...
research
03/19/2023

Right the docs: Characterising voice dataset documentation practices used in machine learning

Voice-enabled technology is quickly becoming ubiquitous, and is constitu...
research
03/14/2023

A Study on Bias and Fairness In Deep Speaker Recognition

With the ubiquity of smart devices that use speaker recognition (SR) sys...
research
06/10/2021

It's COMPASlicated: The Messy Relationship between RAI Datasets and Algorithmic Fairness Benchmarks

Risk assessment instrument (RAI) datasets, particularly ProPublica's COM...

Please sign up or login with your details

Forgot password? Click here to reset