VAIS ASR: Building a conversational speech recognition system using language model combination

10/12/2019
by   Quang Minh Nguyen, et al.
0

Automatic Speech Recognition (ASR) systems have been evolving quickly and reaching human parity in certain cases. The systems usually perform pretty well on reading style and clean speech, however, most of the available systems suffer from situation where the speaking style is conversation and in noisy environments. It is not straight-forward to tackle such problems due to difficulties in data collection for both speech and text. In this paper, we attempt to mitigate the problems using language models combination techniques that allows us to utilize both large amount of writing style text and small number of conversation text data. Evaluation on the VLSP 2019 ASR challenges showed that our system achieved 4.85 the VLSP 2019 data sets.

READ FULL TEXT

page 1

page 2

page 3

research
04/21/2021

Accented Speech Recognition: A Survey

Automatic Speech Recognition (ASR) systems generalize poorly on accented...
research
08/07/2019

Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging

In recent years, studies on automatic speech recognition (ASR) have show...
research
09/27/2021

Challenges and Opportunities of Speech Recognition for Bengali Language

Speech recognition is a fascinating process that offers the opportunity ...
research
08/23/2021

Automatic Speech Recognition using limited vocabulary: A survey

Automatic Speech Recognition (ASR) is an active field of research due to...
research
01/16/2023

Using Kaldi for Automatic Speech Recognition of Conversational Austrian German

As dialogue systems are becoming more and more interactional and social,...
research
05/16/2022

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

Building inclusive speech recognition systems is a crucial step towards ...
research
10/01/2021

Improving Punctuation Restoration for Speech Transcripts via External Data

Automatic Speech Recognition (ASR) systems generally do not produce punc...

Please sign up or login with your details

Forgot password? Click here to reset