ASR Performance Prediction on Unseen Broadcast Programs using Convolutional Neural Networks

04/23/2018
by   Zied Elloumi, et al.
0

In this paper, we address a relatively new task: prediction of ASR performance on unseen broadcast programs. We first propose an heterogenous French corpus dedicated to this task. Two prediction approaches are compared: a state-of-the-art performance prediction based on regression (engineered features) and a new strategy based on convolutional neural networks (learnt features). We particularly focus on the combination of both textual (ASR transcription) and signal inputs. While the joint use of textual and signal features did not work for the regression baseline, the combination of inputs for CNNs leads to the best WER prediction performance. We also show that our CNN prediction remarkably predicts the WER distribution on a collection of speech recordings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2018

Analyzing Learned Representations of a Deep ASR Performance Prediction Model

This paper addresses a relatively new task: prediction of ASR performanc...
research
07/28/2018

Articulatory Features for ASR of Pathological Speech

In this work, we investigate the joint use of articulatory and acoustic ...
research
09/05/2013

Improvements to deep convolutional neural networks for LVCSR

Deep Convolutional Neural Networks (CNNs) are more powerful than Deep Ne...
research
08/13/2020

Textual Echo Cancellation

In this paper, we propose Textual Echo Cancellation (TEC) - a framework ...
research
05/12/2020

Automatic Estimation of Inteligibility Measure for Consonants in Speech

In this article, we provide a model to estimate a real-valued measure of...
research
06/10/2016

Automatic Genre and Show Identification of Broadcast Media

Huge amounts of digital videos are being produced and broadcast every da...
research
09/09/2020

Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks

This paper addresses the extraction of multiple F0 values from polyphoni...

Please sign up or login with your details

Forgot password? Click here to reset