Evolutionary optimization of contexts for phonetic correction in speech recognition systems

02/23/2021
by   Rafael Viana-Cámara, et al.
0

Automatic Speech Recognition (ASR) is an area of growing academic and commercial interest due to the high demand for applications that use it to provide a natural communication method. It is common for general purpose ASR systems to fail in applications that use a domain-specific language. Various strategies have been used to reduce the error, such as providing a context that modifies the language model and post-processing correction methods. This article explores the use of an evolutionary process to generate an optimized context for a specific application domain, as well as different correction techniques based on phonetic distance metrics. The results show the viability of a genetic algorithm as a tool for context optimization, which, added to a post-processing correction based on phonetic representations, can reduce the errors on the recognized speech.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2021

Hybrid phonetic-neural model for correction in speech recognition systems

Automatic speech recognition (ASR) is a relevant area in multiple settin...
research
02/18/2021

Fixing Errors of the Google Voice Recognizer through Phonetic Distance Metrics

Speech recognition systems for the Spanish language, such as Google's, p...
research
03/26/2021

BART based semantic correction for Mandarin automatic speech recognition system

Although automatic speech recognition (ASR) systems achieved significant...
research
02/07/2018

Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling

Automatic speech recognition (ASR) systems lack joint optimization durin...
research
01/02/2018

A Novel Approach to Skew-Detection and Correction of English Alphabets for OCR

Optical Character Recognition has been a challenging field in the advent...
research
05/26/2023

DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction

Conversational speech often consists of deviations from the speech plan,...
research
05/26/2022

Clinical Dialogue Transcription Error Correction using Seq2Seq Models

Good communication is critical to good healthcare. Clinical dialogue is ...

Please sign up or login with your details

Forgot password? Click here to reset