Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities

07/22/2022
by   Pranav Dheram, et al.
4

As for other forms of AI, speech recognition has recently been examined with respect to performance disparities across different user cohorts. One approach to achieve fairness in speech recognition is to (1) identify speaker cohorts that suffer from subpar performance and (2) apply fairness mitigation measures targeting the cohorts discovered. In this paper, we report on initial findings with both discovery and mitigation of performance disparities using data from a product-scale AI assistant speech recognition system. We compare cohort discovery based on geographic and demographic information to a more scalable method that groups speakers without human labels, using speaker embedding technology. For fairness mitigation, we find that oversampling of underrepresented cohorts, as well as modeling speaker cohort membership by additional input variables, reduces the gap between top- and bottom-performing cohorts, without deteriorating overall recognition accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2023

Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering

The challenge of fairness arises when Automatic Speech Recognition (ASR)...
research
09/25/2019

Speech Recognition with Augmented Synthesized Speech

Recent success of the Tacotron speech synthesis architecture and its var...
research
08/17/2013

Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition

This paper is concerned with the development of Back-propagation Neural ...
research
03/15/2022

Privacy-Preserving Speech Representation Learning using Vector Quantization

With the popularity of virtual assistants (e.g., Siri, Alexa), the use o...
research
10/05/2016

Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models

A Pascal challenge entitled monaural multi-talker speech recognition was...
research
06/22/2016

A segmental framework for fully-unsupervised large-vocabulary speech recognition

Zero-resource speech technology is a growing research area that aims to ...
research
07/17/2023

ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development

We introduce "ivrit.ai", a comprehensive Hebrew speech dataset, addressi...

Please sign up or login with your details

Forgot password? Click here to reset