DeepAI AI Chat
Log In Sign Up

Disentangling ASR and MT Errors in Speech Translation

by   Ngoc-Tien Le, et al.
Université Grenoble Alpes

The main aim of this paper is to investigate automatic quality assessment for spoken language translation (SLT). More precisely, we investigate SLT errors that can be due to transcription (ASR) or to translation (MT) modules. This paper investigates automatic detection of SLT errors using a single classifier based on joint ASR and MT features. We evaluate both 2-class (good/bad) and 3-class (good/badASR/badMT ) labeling tasks. The 3-class problem necessitates to disentangle ASR and MT errors in the speech translation output and we propose two label extraction methods for this non trivial step. This enables - as a by-product - qualitative analysis on the SLT errors and their origin (are they due to transcription or to translation step?) on our large in-house corpus for French-to-English speech translation.


page 1

page 2

page 3

page 4


The USFD Spoken Language Translation System for IWSLT 2014

The University of Sheffield (USFD) participated in the International Wor...

Enhancements in statistical spoken language translation by de-normalization of ASR results

Spoken language translation (SLT) has become very important in an increa...

A Technical Report: BUT Speech Translation Systems

The paper describes the BUT's speech translation systems. The systems ar...

Dynamic Masking for Improved Stability in Spoken Language Translation

For spoken language translation (SLT) in live scenarios such as conferen...

Segmenting Subtitles for Correcting ASR Segmentation Errors

Typical ASR systems segment the input audio into utterances using purely...

Spoken Language Translation for Polish

Spoken language translation (SLT) is becoming more important in the incr...

Linguistic Features of Genre and Method Variation in Translation: A Computational Perspective

In this paper we describe the use of text classification methods to inve...