Disambiguation-BERT for N-best Rescoring in Low-Resource Conversational ASR

10/05/2021
by   Pablo Ortiz, et al.
1

We study the inclusion of past conversational context through BERT language models into a CTC-based Automatic Speech Recognition (ASR) system via N-best rescoring. We introduce a data-efficient strategy to fine-tune BERT on transcript disambiguation without external data. Our results show word error rate recoveries up to 37.2 in low-resource data domains, both in language (Norwegian), tone (spontaneous, conversational), and topics (parliament proceedings and customer service phone calls). We show how the nature of the data greatly affects the performance of context-augmented N-best rescoring.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2023

A Novel Self-training Approach for Low-resource Speech Recognition

In this paper, we propose a self-training approach for automatic speech ...
research
10/13/2022

HuBERT-TR: Reviving Turkish Automatic Speech Recognition with Self-supervised Speech Representation Learning

While the Turkish language is listed among low-resource languages, liter...
research
05/21/2023

On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition

New-age conversational agent systems perform both speech emotion recogni...
research
01/16/2023

Using Kaldi for Automatic Speech Recognition of Conversational Austrian German

As dialogue systems are becoming more and more interactional and social,...
research
09/13/2021

Joint prediction of truecasing and punctuation for conversational speech in low-resource scenarios

Capitalization and punctuation are important cues for comprehending writ...
research
10/24/2022

Development of Hybrid ASR Systems for Low Resource Medical Domain Conversational Telephone Speech

Language barriers present a great challenge in our increasingly connecte...
research
03/18/2021

Contextual Biasing of Language Models for Speech Recognition in Goal-Oriented Conversational Agents

Goal-oriented conversational interfaces are designed to accomplish speci...

Please sign up or login with your details

Forgot password? Click here to reset