Bias in Automated Speaker Recognition

01/24/2022
by   Wiebke Toussaint, et al.
0

Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition technologies are deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment and known sources of bias in face recognition and natural language processing, bias in automated speaker recognition has not been studied systematically. We present an in-depth empirical and analytical study of bias in the machine learning development workflow of speaker verification, a voice biometric and core task in automated speaker recognition. Drawing on an established framework for understanding sources of harm in machine learning, we show that bias exists at every development stage in the well-known VoxCeleb Speaker Recognition Challenge, including model building, implementation, and data generation. Most affected are female speakers and non-US nationalities, who experience significant performance degradation. Leveraging the insights from our findings, we make practical recommendations for mitigating bias in automated speaker recognition, and outline future research directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2023

About Voice: A Longitudinal Study of Speaker Recognition Dataset Dynamics

Like face recognition, speaker recognition is widely used for voice-base...
research
07/26/2021

SVEva Fair: A Framework for Evaluating Fairness in Speaker Verification

Despite the success of deep neural networks (DNNs) in enabling on-device...
research
12/05/2017

Multi-speaker Recognition in Cocktail Party Problem

This paper proposes an original statistical decision theory to accomplis...
research
06/14/2018

VoxCeleb2: Deep Speaker Recognition

The objective of this paper is speaker recognition under noisy and uncon...
research
11/30/2018

Modeling natural language emergence with integral transform theory and reinforcement learning

Zipf's law predicts a power-law relationship between word rank and frequ...
research
02/26/2023

I-MSV 2022: Indic-Multilingual and Multi-sensor Speaker Verification Challenge

Speaker Verification (SV) is a task to verify the claimed identity of th...
research
05/17/2022

Dynamic Recognition of Speakers for Consent Management by Contrastive Embedding Replay

Voice assistants record sound and can overhear conversations. Thus, a co...

Please sign up or login with your details

Forgot password? Click here to reset