Can Generative Large Language Models Perform ASR Error Correction?

07/09/2023
by   Rao Ma, et al.
0

ASR error correction continues to serve as an important part of post-processing for speech recognition systems. Traditionally, these models are trained with supervised training using the decoding results of the underlying ASR system and the reference text. This approach is computationally intensive and the model needs to be re-trained when switching the underlying ASR model. Recent years have seen the development of large language models and their ability to perform natural language processing tasks in a zero-shot manner. In this paper, we take ChatGPT as an example to examine its ability to perform ASR error correction in the zero-shot or 1-shot settings. We use the ASR N-best list as model input and propose unconstrained error correction and N-best constrained error correction methods. Results on a Conformer-Transducer model and the pre-trained Whisper model show that we can largely improve the ASR system performance with error correction using the powerful ChatGPT model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/01/2023

N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space

Error correction models form an important part of Automatic Speech Recog...
research
03/25/2023

An Analysis of GPT-3's Performance in Grammatical Error Correction

GPT-3 models are very powerful, achieving high performance on a variety ...
research
09/18/2023

HTEC: Human Transcription Error Correction

High-quality human transcription is essential for training and improving...
research
05/29/2023

Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods

Large-scale pre-trained language models such as GPT-3 have shown remarka...
research
07/19/2023

Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction

The integration of natural language processing (NLP) technologies into e...
research
03/13/2020

ASR Error Correction and Domain Adaptation Using Machine Translation

Off-the-shelf pre-trained Automatic Speech Recognition (ASR) systems are...
research
03/16/2023

Visual Information Matters for ASR Error Correction

Aiming to improve the Automatic Speech Recognition (ASR) outputs with a ...

Please sign up or login with your details

Forgot password? Click here to reset