Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation

04/08/2019
by   Yerbolat Khassanov, et al.
0

The neural language models (NLM) achieve strong generalization capability by learning the dense representation of words and using them to estimate probability distribution function. However, learning the representation of rare words is a challenging problem causing the NLM to produce unreliable probability estimates. To address this problem, we propose a method to enrich representations of rare words in pre-trained NLM and consequently improve its probability estimation performance. The proposed method augments the word embedding matrices of pre-trained NLM while keeping other parameters unchanged. Specifically, our method updates the embedding vectors of rare words using embedding vectors of other semantically and syntactically similar words. To evaluate the proposed method, we enrich the rare street names in the pre-trained NLM and use it to rescore 100-best hypotheses output from the Singapore English speech recognition system. The enriched NLM reduces the word error rate by 6 words by 16

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2021

Learning to Remove: Towards Isotropic Pre-trained BERT Embedding

Pre-trained language models such as BERT have become a more common choic...
research
10/27/2022

Leveraging knowledge graphs to update scientific word embeddings using latent semantic imputation

The most interesting words in scientific texts will often be novel or ra...
research
08/20/2016

Using the Output Embedding to Improve Language Models

We study the topmost weight matrix of neural network language models. We...
research
12/13/2018

An Unbiased Approach to Quantification of Gender Inclination using Interpretable Word Representations

Recent advances in word embedding provide significant benefit to various...
research
06/10/2019

Embedding Imputation with Grounded Language Information

Due to the ubiquitous use of embeddings as input representations for a w...
research
05/23/2019

Detecting Malicious PowerShell Scripts Using Contextual Embeddings

PowerShell is a command line shell, that is widely used in organizations...
research
02/22/2018

Learning Topic Models by Neighborhood Aggregation

Topic models are one of the most frequently used models in machine learn...

Please sign up or login with your details

Forgot password? Click here to reset