A Technical Report: BUT Speech Translation Systems

10/22/2020
by   Hari Krishna Vydana, et al.
0

The paper describes the BUT's speech translation systems. The systems are English⟶German offline speech translation systems. The systems are based on our previous works <cit.>. Though End-to-End and cascade (ASR-MT) spoken language translation (SLT) systems are reaching comparable performances, a large degradation is observed when translating ASR hypothesis compared to the oracle input text. To reduce this performance degradation, we have jointly-trained ASR and MT modules with ASR objective as an auxiliary loss. Both the networks are connected through the neural hidden representations. This model has an End-to-End differentiable path with respect to the final objective function and also utilizes the ASR objective for better optimization. During the inference both the modules(i.e., ASR and MT) are connected through the hidden representations corresponding to the n-best hypotheses. Ensembling with independently trained ASR and MT models have further improved the performance of the system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2020

Jointly Trained Transformers models for Spoken Language Translation

Conventional spoken language translation (SLT) systems are pipeline base...
research
07/13/2021

The IWSLT 2021 BUT Speech Translation Systems

The paper describes BUT's English to German offline speech translation(S...
research
09/13/2015

The USFD Spoken Language Translation System for IWSLT 2014

The University of Sheffield (USFD) participated in the International Wor...
research
09/03/2017

Disentangling ASR and MT Errors in Speech Translation

The main aim of this paper is to investigate automatic quality assessmen...
research
10/18/2022

Simultaneous Translation for Unsegmented Input: A Sliding Window Approach

In the cascaded approach to spoken language translation (SLT), the ASR o...
research
06/05/2020

ELITR Non-Native Speech Translation at IWSLT 2020

This paper is an ELITR system submission for the non-native speech trans...
research
11/24/2020

Tight Integrated End-to-End Training for Cascaded Speech Translation

A cascaded speech translation model relies on discrete and non-different...

Please sign up or login with your details

Forgot password? Click here to reset