Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition

05/22/2023
by   Hong Liu, et al.
0

Energy-based language models (ELMs) parameterize an unnormalized distribution for natural sentences and are radically different from popular autoregressive language models (ALMs). As an important application, ELMs have been successfully used as a means for calculating sentence scores in speech recognition, but they all use less-modern CNN or LSTM networks. The recent progress in Transformer networks and large pretrained models such as BERT and GPT2 opens new possibility to further advancing ELMs. In this paper, we explore different architectures of energy functions and different training methods to investigate the capabilities of ELMs in rescoring for speech recognition, all using large pretrained models as backbones.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2019

Long-span language modeling for speech recognition

We explore neural language modeling for speech recognition where the con...
research
09/19/2023

End-to-End Speech Recognition Contextualization with Large Language Models

In recent years, Large Language Models (LLMs) have garnered significant ...
research
07/07/2021

Improving Speech Recognition Accuracy of Local POI Using Geographical Models

Nowadays voice search for points of interest (POI) is becoming increasin...
research
10/31/2019

Pseudolikelihood Reranking with Masked Language Models

We rerank with scores from pretrained masked language models like BERT t...
research
10/11/2022

PatternRank: Leveraging Pretrained Language Models and Part of Speech for Unsupervised Keyphrase Extraction

Keyphrase extraction is the process of automatically selecting a small s...
research
09/04/2023

A Comparative Analysis of Pretrained Language Models for Text-to-Speech

State-of-the-art text-to-speech (TTS) systems have utilized pretrained l...
research
07/02/2023

Conformer LLMs – Convolution Augmented Large Language Models

This work builds together two popular blocks of neural architecture, nam...

Please sign up or login with your details

Forgot password? Click here to reset