Quantifying Bias in Automatic Speech Recognition

03/28/2021
by   Siyuan Feng, et al.
0

Automatic speech recognition (ASR) systems promise to deliver objective interpretation of human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs struggle with the large variation in speech due to e.g., gender, age, speech impairment, race, and accents. Many factors can cause the bias of an ASR system. Our overarching goal is to uncover bias in ASR systems to work towards proactive bias mitigation in ASR. This paper is a first step towards this goal and systematically quantifies the bias of a Dutch SotA ASR system against gender, age, regional accents and non-native accents. Word error rates are compared, and an in-depth phoneme-level error analysis is conducted to understand where bias is occurring. We primarily focus on bias due to articulation differences in the dataset. Based on our findings, we suggest bias mitigation strategies for ASR development.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2022

Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR

Despite the fact that variation is a fundamental characteristic of natur...
research
11/18/2021

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

It is well known that many machine learning systems demonstrate bias tow...
research
07/20/2023

A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC Videos

Automatic speech recognition (ASR) systems are designed to transcribe sp...
research
05/22/2023

Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test

Automatic speech recognition systems based on deep learning are mainly t...
research
08/23/2019

Gender Representation in French Broadcast Corpora and Its Impact on ASR Performance

This paper analyzes the gender representation in four major corpora of F...
research
08/16/2023

An Ambient Intelligence-based Approach For Longitudinal Monitoring of Verbal and Vocal Depression Symptoms

Automatic speech recognition (ASR) technology can aid in the detection, ...

Please sign up or login with your details

Forgot password? Click here to reset