GNN-LM: Language Modeling based on Global Contexts via GNN

10/17/2021
by   Yuxian Meng, et al.
0

Inspired by the notion that “to copy is easier than to memorize“, in this work, we introduce GNN-LM, which extends the vanilla neural language model (LM) by allowing to reference similar contexts in the entire training corpus. We build a directed heterogeneous graph between an input context and its semantically related neighbors selected from the training corpus, where nodes are tokens in the input context and retrieved neighbor contexts, and edges represent connections between nodes. Graph neural networks (GNNs) are constructed upon the graph to aggregate information from similar contexts to decode the token. This learning paradigm provides direct access to the reference contexts and helps improve a model's generalization ability. We conduct comprehensive experiments to validate the effectiveness of the GNN-LM: GNN-LM achieves a new state-of-the-art perplexity of 14.8 on WikiText-103 (a 4.5 point improvement over its counterpart of the vanilla LM model) and shows substantial improvement on One Billion Word and Enwiki8 datasets against strong baselines. In-depth ablation studies are performed to understand the mechanics of GNN-LM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2022

GNN-SL: Sequence Labeling Based on Nearest Examples via GNN

To better handle long-tail cases in the sequence labeling (SL) task, in ...
research
06/24/2021

Visualizing Graph Neural Networks with CorGIE: Corresponding a Graph to Its Embedding

Graph neural networks (GNNs) are a class of powerful machine learning to...
research
08/19/2020

Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters

Graph Neural Networks (GNNs) have been widely applied to fraud detection...
research
06/09/2019

Redundancy-Free Computation Graphs for Graph Neural Networks

Graph Neural Networks (GNNs) are based on repeated aggregations of infor...
research
08/02/2021

Shift-Robust GNNs: Overcoming the Limitations of Localized Graph Training data

There has been a recent surge of interest in designing Graph Neural Netw...
research
09/19/2023

Graph Neural Networks for Dynamic Modeling of Roller Bearing

In the presented work, we propose to apply the framework of graph neural...
research
08/18/2023

Bridged-GNN: Knowledge Bridge Learning for Effective Knowledge Transfer

The data-hungry problem, characterized by insufficiency and low-quality ...

Please sign up or login with your details

Forgot password? Click here to reset