Towards Continual Entity Learning in Language Models for Conversational Agents

07/30/2021
by   Ravi Teja Gadde, et al.
0

Neural language models (LM) trained on diverse corpora are known to work well on previously seen entities, however, updating these models with dynamically changing entities such as place names, song titles and shopping items requires re-training from scratch and collecting full sentences containing these entities. We aim to address this issue, by introducing entity-aware language models (EALM), where we integrate entity models trained on catalogues of entities into the pre-trained LMs. Our combined language model adaptively adds information from the entity models into the pre-trained LM depending on the sentence context. Our entity models can be updated independently of the pre-trained LM, enabling us to influence the distribution of entities output by the final LM, without any further training of the pre-trained LM. We show significant perplexity improvements on task-oriented dialogue datasets, especially on long-tailed utterances, with an ability to continually adapt to new entities (to an extent).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2022

A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language Models

Pre-trained language models (PLMs) cannot well recall rich factual knowl...
research
05/13/2018

Building Language Models for Text with Named Entities

Text in many domains involves a significant amount of named entities. Pr...
research
02/03/2022

Towards Coherent and Consistent Use of Entities in Narrative Generation

Large pre-trained language models (LMs) have demonstrated impressive cap...
research
10/28/2021

A Sequence to Sequence Model for Extracting Multiple Product Name Entities from Dialog

E-commerce voice ordering systems need to recognize multiple product nam...
research
09/14/2023

Leveraging Contextual Information for Effective Entity Salience Detection

In text documents such as news articles, the content and key events usua...
research
10/05/2022

"No, they did not": Dialogue response dynamics in pre-trained language models

A critical component of competence in language is being able to identify...
research
04/22/2021

A Short Survey of Pre-trained Language Models for Conversational AI-A NewAge in NLP

Building a dialogue system that can communicate naturally with humans is...

Please sign up or login with your details

Forgot password? Click here to reset