ASR Error Correction and Domain Adaptation Using Machine Translation

03/13/2020
by   Anirudh Mani, et al.
7

Off-the-shelf pre-trained Automatic Speech Recognition (ASR) systems are an increasingly viable service for companies of any size building speech-based products. While these ASR systems are trained on large amounts of data, domain mismatch is still an issue for many such parties that want to use this service as-is leading to not so optimal results for their task. We propose a simple technique to perform domain adaptation for ASR error correction via machine translation. The machine translation model is a strong candidate to learn a mapping from out-of-domain ASR errors to in-domain terms in the corresponding reference files. We use two off-the-shelf ASR systems in this work: Google ASR (commercial) and the ASPIRE model (open-source). We observe 7 improvement in word error rate and 4 point absolute improvement in BLEU score in Google ASR output via our proposed method. We also evaluate ASR error correction via a downstream task of Speaker Diarization that captures speaker style, syntax, structure and semantic improvements we obtain via ASR correction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2022

Unsupervised domain adaptation for speech recognition with unsupervised error correction

The transcription quality of automatic speech recognition (ASR) systems ...
research
07/09/2023

Can Generative Large Language Models Perform ASR Error Correction?

ASR error correction continues to serve as an important part of post-pro...
research
06/01/2023

Adapting an Unadaptable ASR System

As speech recognition model sizes and training data requirements grow, i...
research
06/24/2020

Black-box Adaptation of ASR for Accented Speech

We introduce the problem of adapting a black-box, cloud-based ASR system...
research
09/10/2021

Remember the context! ASR slot error correction through memorization

Accurate recognition of slot values such as domain specific words or nam...
research
10/31/2022

DiaCorrect: End-to-end error correction for speaker diarization

In recent years, speaker diarization has attracted widespread attention....
research
09/30/2022

A forensic analysis of the Google Home: repairing compressed data without error correction

This paper provides a detailed explanation of the steps taken to extract...

Please sign up or login with your details

Forgot password? Click here to reset