A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition

05/06/2022
by   Sanghyun Yoo, et al.
0

Despite the success of deep learning in speech recognition, multi-dialect speech recognition remains a difficult problem. Although dialect-specific acoustic models are known to perform well in general, they are not easy to maintain when dialect-specific data is scarce and the number of dialects for each language is large. Therefore, a single unified acoustic model (AM) that generalizes well for many dialects has been in demand. In this paper, we propose a novel acoustic modeling technique for accurate multi-dialect speech recognition with a single AM. Our proposed AM is dynamically adapted based on both dialect information and its internal representation, which results in a highly adaptive AM for handling multiple dialects simultaneously. We also propose a simple but effective training method to deal with unseen dialects. The experimental results on large scale speech datasets show that the proposed AM outperforms all the previous ones, reducing word error rates (WERs) by 8.11 relative compared to a single all-dialects AM and by 7.31 dialect-specific AMs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2021

Improving Speech Recognition Accuracy of Local POI Using Geographical Models

Nowadays voice search for points of interest (POI) is becoming increasin...
research
04/01/2021

Interactive spatial speech recognition maps based on simulated speech recognition experiments

In their everyday life, the speech recognition performance of human list...
research
11/08/2012

Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition

In this paper, we propose a novel architecture of wavelet network called...
research
07/19/2017

Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition

Layer normalization is a recently introduced technique for normalizing t...
research
10/18/2021

Analysis of French Phonetic Idiosyncrasies for Accent Recognition

Speech recognition systems have made tremendous progress since the last ...
research
08/02/2018

Linguistic Search Optimization for Deep Learning Based LVCSR

Recent advances in deep learning based large vocabulary con- tinuous spe...
research
06/27/2017

Acoustic Modeling Using a Shallow CNN-HTSVM Architecture

High-accuracy speech recognition is especially challenging when large da...

Please sign up or login with your details

Forgot password? Click here to reset