A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework

10/22/2020
by   Yizhou Peng, et al.
0

Human can perform multi-task recognition from speech. For instance, human can recognize speech, as well as a peculiar accent of the speech simultaneously. However, present state-of-the-art speech recognition system can rarely do that. In this paper, we propose a multilingual approach to recognizing English speech, as well as the related accent that the speakers convey using DNN-HMM framework. Specifically, we assume different accents of English as different languages. We then merge them together and train a multilingual speech recognition system. During decoding, we conduct two sets of experiments. One is a monolingual Automatic Speech Recognition (ASR) system, with the accent information only embedded at the phone level, realizing word-based accent recognition, and the other is a multilingual ASR system, with the accent information embedded at both word and phone level, realizing an approximated utterance-based accent recognition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2020

Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages

In this work, we explore the benefits of using multilingual bottleneck f...
research
09/27/2016

Multi-task Recurrent Model for True Multilingual Speech Recognition

Research on multilingual speech recognition remains attractive yet chall...
research
08/23/2019

Multilingual and Multimode Phone Recognition System for Indian Languages

The aim of this paper is to develop a flexible framework capable of auto...
research
07/11/2021

Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings

The use of phonological features (PFs) potentially allows language-speci...
research
11/13/2017

Phonemic and Graphemic Multilingual CTC Based Speech Recognition

Training automatic speech recognition (ASR) systems requires large amoun...
research
05/01/2022

Bilingual End-to-End ASR with Byte-Level Subwords

In this paper, we investigate how the output representation of an end-to...
research
09/16/2023

Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints

We propose a first step toward multilingual end-to-end automatic speech ...

Please sign up or login with your details

Forgot password? Click here to reset