Non-native children speech recognition through transfer learning

09/25/2018
by   Marco Matassoni, et al.
0

This work deals with non-native children's speech and investigates both multi-task and transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to speakers, specifically children, learning a foreign language. The application scenario is characterized by young students learning English and German and reading sentences in these second-languages, as well as in their mother language. The paper analyzes and discusses techniques for training effective DNN-based acoustic models starting from children native speech and performing adaptation with limited non-native audio material. A multi-lingual model is adopted as baseline, where a common phonetic lexicon, defined in terms of the units of the International Phonetic Alphabet (IPA), is shared across the three languages at hand (Italian, German and English); DNN adaptation methods based on transfer learning are evaluated on significant non-native evaluation sets. Results show that the resulting non-native models allow a significant improvement with respect to a mono-lingual system adapted to speakers of the target language.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2023

Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications

Voicebots have provided a new avenue for supporting the development of l...
research
04/13/2021

Experiments of ASR-based mispronunciation detection for children and adult English learners

Pronunciation is one of the fundamentals of language learning, and it is...
research
02/27/2022

A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning

Large datasets as required for deep learning of lip reading do not exist...
research
03/11/2022

Improving the transferability of speech separation by meta-learning

Speech separation aims to separate multiple speech sources from a speech...
research
10/26/2020

Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms

Speech communication systems based on Voice-over-IP technology are frequ...
research
10/06/2017

The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments

This paper introduces the contents and the possible usage of the DIRHA-E...

Please sign up or login with your details

Forgot password? Click here to reset