Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks

10/25/2019
by   Alexandros Kastanos, et al.
0

Recently, there has been growth in providers of speech transcription services enabling others to leverage technology they would not normally be able to use. As a result, speech-enabled solutions have become commonplace. Their success critically relies on the quality, accuracy, and reliability of the underlying speech transcription systems. Those black box systems, however, offer limited means for quality control as only word sequences are typically available. This paper examines this limited resource scenario for confidence estimation, a measure commonly used to assess transcription reliability. In particular, it explores what other sources of word and sub-word level information available in the transcription process could be used to improve confidence scores. To encode all such information this paper extends lattice recurrent neural networks to handle sub-words. Experimental results using the IARPA OpenKWS 2016 evaluation system show that the use of additional information yields significant gains in confidence estimation accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2018

Bi-Directional Lattice Recurrent Neural Networks for Confidence Estimation

The standard approach to mitigate errors made by an automatic speech rec...
research
05/20/2018

Targeted Adversarial Examples for Black Box Audio Systems

The application of deep recurrent networks to audio transcription has le...
research
10/30/2018

Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks

The standard approach to assess reliability of automatic speech transcri...
research
12/16/2022

Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-To-End Automatic Speech Recognition

This paper presents a class of new fast non-trainable entropy-based conf...
research
12/21/2020

Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition

This paper proposes a black-box adversarial attack method to automatic s...
research
07/22/2019

On Modeling ASR Word Confidence

We present a new method for computing ASR word confidences that effectiv...
research
10/23/2020

Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance

Automatic speech recognition (ASR) for under-represented named-entity (U...

Please sign up or login with your details

Forgot password? Click here to reset