DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction

05/26/2023
by   Vineet Bhat, et al.
0

Conversational speech often consists of deviations from the speech plan, producing disfluent utterances that affect downstream NLP tasks. Removing these disfluencies is necessary to create fluent and coherent speech. This paper presents DisfluencyFixer, a tool that performs speech-to-speech disfluency correction in English and Hindi using a pipeline of Automatic Speech Recognition (ASR), Disfluency Correction (DC) and Text-To-Speech (TTS) models. Our proposed system removes disfluencies from input speech and returns fluent speech as output along with its transcript, disfluency type and total disfluency count in source utterance, providing a one-stop destination for language learners to improve the fluency of their speech. We evaluate the performance of our tool subjectively and receive scores of 4.26, 4.29 and 4.42 out of 5 in ASR performance, DC performance and ease-of-use of the system. Our tool can be accessed openly at the following link.

READ FULL TEXT

page 1

page 2

research
06/10/2023

Adversarial Training For Low-Resource Disfluency Correction

Disfluencies commonly occur in conversational speech. Speech with disflu...
research
04/09/2020

Improving Readability for Automatic Speech Recognition Transcription

Modern Automatic Speech Recognition (ASR) systems can achieve high perfo...
research
09/08/2022

Goodness of Pronunciation Pipelines for OOV Problem

In the following report we propose pipelines for Goodness of Pronunciati...
research
02/23/2021

Evolutionary optimization of contexts for phonetic correction in speech recognition systems

Automatic Speech Recognition (ASR) is an area of growing academic and co...
research
07/05/2023

Flowchase: a Mobile Application for Pronunciation Training

In this paper, we present a solution for providing personalized and inst...
research
02/10/2021

NUVA: A Naming Utterance Verifier for Aphasia Treatment

Anomia (word-finding difficulties) is the hallmark of aphasia, an acquir...
research
05/04/2022

Design of a novel Korean learning application for efficient pronunciation correction

The Korean wave, which denotes the global popularity of South Korea's cu...

Please sign up or login with your details

Forgot password? Click here to reset