Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test

05/22/2023
by   Eungbeom Kim, et al.
0

Automatic speech recognition systems based on deep learning are mainly trained under empirical risk minimization (ERM). Since ERM utilizes the averaged performance on the data samples regardless of a group such as healthy or dysarthric speakers, ASR systems are unaware of the performance disparities across the groups. This results in biased ASR systems whose performance differences among groups are severe. In this study, we aim to improve the ASR system in terms of group robustness for dysarthric speakers. To achieve our goal, we present a novel approach, sample reweighting with sample affinity test (Re-SAT). Re-SAT systematically measures the debiasing helpfulness of the given data sample and then mitigates the bias by debiasing helpfulness-based sample reweighting. Experimental results demonstrate that Re-SAT contributes to improved ASR performance on dysarthric speech without performance degradation on healthy speech.

READ FULL TEXT
research
07/31/2019

Personalizing ASR for Dysarthric and Accented Speech with Limited Data

Automatic speech recognition (ASR) systems have dramatically improved ov...
research
03/28/2021

Quantifying Bias in Automatic Speech Recognition

Automatic speech recognition (ASR) systems promise to deliver objective ...
research
06/07/2023

Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation

Despite major advancements in Automatic Speech Recognition (ASR), the st...
research
06/09/2023

A Theory of Unsupervised Speech Recognition

Unsupervised speech recognition (ASR-U) is the problem of learning autom...
research
02/25/2022

Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR

Despite the fact that variation is a fundamental characteristic of natur...
research
08/01/2022

Performance Disparities Between Accents in Automatic Speech Recognition

Automatic speech recognition (ASR) services are ubiquitous, transforming...
research
03/27/2022

Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition

Although deep learning-based end-to-end Automatic Speech Recognition (AS...

Please sign up or login with your details

Forgot password? Click here to reset