Lookup-Table Recurrent Language Models for Long Tail Speech Recognition

04/09/2021
by   W. Ronny Huang, et al.
0

We introduce Lookup-Table Language Models (LookupLM), a method for scaling up the size of RNN language models with only a constant increase in the floating point operations, by increasing the expressivity of the embedding table. In particular, we instantiate an (additional) embedding table which embeds the previous n-gram token sequence, rather than a single token. This allows the embedding table to be scaled up arbitrarily – with a commensurate increase in performance – without changing the token vocabulary. Since embeddings are sparsely retrieved from the table via a lookup; increasing the size of the table adds neither extra operations to each forward pass nor extra parameters that need to be stored on limited GPU/TPU memory. We explore scaling n-gram embedding tables up to nearly a billion parameters. When trained on a 3-billion sentence corpus, we find that LookupLM improves long tail log perplexity by 2.44 and long tail WER by 23.4 standard RNN language model baseline, an improvement comparable to a scaling up the baseline by 6.2x the number of floating point operations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2021

NetFC: enabling accurate floating-point arithmetic on programmable switches

In-network computation has been widely used to accelerate data-intensive...
research
01/31/2023

The Power of External Memory in Increasing Predictive Model Capacity

One way of introducing sparsity into deep networks is by attaching an ex...
research
05/24/2023

Lexinvariant Language Models

Token embeddings, a mapping from discrete lexical symbols to continuous ...
research
05/05/2023

Optimized Table Tokenization for Table Structure Recognition

Extracting tables from documents is a crucial task in any document conve...
research
05/28/2020

Subword RNNLM Approximations for Out-Of-Vocabulary Keyword Search

In spoken Keyword Search, the query may contain out-of-vocabulary (OOV) ...
research
12/06/2021

SIMD-Optimized Search Over Sorted Data

Applications often require a fast, single-threaded search algorithm over...
research
09/20/2023

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

The Languini Kitchen serves as both a research collective and codebase d...

Please sign up or login with your details

Forgot password? Click here to reset