Domain Robust Feature Extraction for Rapid Low Resource ASR Development

07/28/2018
by   Siddharth Dalmia, et al.
0

Developing a practical speech recognizer for a low resource language is challenging, not only because of the (potentially unknown) properties of the language, but also because test data may not be from the same domain as the available training data. In this paper, we focus on the latter challenge, i.e. domain mismatch, for systems trained using a sequence-based criterion. We demonstrate the effectiveness of using a pre-trained English recognizer which is robust to such mismatched conditions, and use it as a domain normalizing feature extractor on a low resource language, like Turkish. This enables rapid development of speech recognizers for new languages which can easily adapt to any domain. Testing in various cross-domain scenarios, we achieve relative improvements of around 25 around 50

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2021

Voice Conversion Can Improve ASR in Very Low-Resource Settings

Voice conversion (VC) has been proposed to improve speech recognition sy...
research
03/16/2022

Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?

What can pre-trained multilingual sequence-to-sequence models like mBART...
research
10/09/2016

Enabling Medical Translation for Low-Resource Languages

We present research towards bridging the language gap between migrant wo...
research
12/22/2020

Applying wav2vec2.0 to Speech Recognition in various low-resource languages

Several domains own corresponding widely used feature extractors, such a...
research
10/31/2022

Mining Word Boundaries in Speech as Naturally Annotated Word Segmentation Data

Chinese word segmentation (CWS) models have achieved very high performan...
research
09/13/2021

Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios

Capitalization and punctuation are important cues for comprehending writ...
research
06/25/2018

Robust Feature Clustering for Unsupervised Speech Activity Detection

In certain applications such as zero-resource speech processing or very-...

Please sign up or login with your details

Forgot password? Click here to reset