Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021

06/15/2021
by   Yuriy Arabskyy, et al.
0

This paper describes the winning approach in the Shared Task 3 at SwissText 2021 on Swiss German Speech to Standard German Text, a public competition on dialect recognition and translation. Swiss German refers to the multitude of Alemannic dialects spoken in the German-speaking parts of Switzerland. Swiss German differs significantly from standard German in pronunciation, word inventory and grammar. It is mostly incomprehensible to native German speakers. Moreover, it lacks a standardized written script. To solve the challenging task, we propose a hybrid automatic speech recognition system with a lexicon that incorporates translations, a 1st pass language model that deals with Swiss German particularities, a transfer-learned acoustic model and a strong neural language model for 2nd pass rescoring. Our submission reaches 46.04 blind conversational test set and outperforms the second best competitor by a 12

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/17/2023

2nd Swiss German Speech to Standard German Text Shared Task at SwissText 2022

We present the results and findings of the 2nd Swiss German speech to St...
research
05/26/2021

Multitask Learning for Grapheme-to-Phoneme Conversion of Anglicisms in German Speech Recognition

Loanwords, such as Anglicisms, are a challenge in German speech recognit...
research
03/31/2020

A Swiss German Dictionary: Variation in Speech and Writing

We introduce a dictionary containing forms of common words in various Sw...
research
03/21/2020

A Joint Approach to Compound Splitting and Idiomatic Compound Detection

Applications such as machine translation, speech recognition, and inform...
research
01/16/2023

Using Kaldi for Automatic Speech Recognition of Conversational Austrian German

As dialogue systems are becoming more and more interactional and social,...
research
06/25/2022

TEVR: Improving Speech Recognition by Token Entropy Variance Reduction

This paper presents TEVR, a speech recognition model designed to minimiz...
research
08/13/2019

IMS-Speech: A Speech to Text Tool

We present the IMS-Speech, a web based tool for German and English speec...

Please sign up or login with your details

Forgot password? Click here to reset