Enhancing Keyphrase Extraction from Long Scientific Documents using Graph Embeddings

05/16/2023
by   Roberto Martínez-Cruz, et al.
0

In this study, we investigate using graph neural network (GNN) representations to enhance contextualized representations of pre-trained language models (PLMs) for keyphrase extraction from lengthy documents. We show that augmenting a PLM with graph embeddings provides a more comprehensive semantic understanding of words in a document, particularly for long documents. We construct a co-occurrence graph of the text and embed it using a graph convolutional network (GCN) trained on the task of edge prediction. We propose a graph-enhanced sequence tagging architecture that augments contextualized PLM embeddings with graph representations. Evaluating on benchmark datasets, we demonstrate that enhancing PLMs with graph embeddings outperforms state-of-the-art models on long documents, showing significant improvements in F1 scores across all the datasets. Our study highlights the potential of GNN representations as a complementary approach to improve PLM performance for keyphrase extraction from long documents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2020

Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks

Text classification is fundamental in natural language processing (NLP),...
research
03/23/2018

WikiRank: Improving Keyphrase Extraction Based on Background Knowledge

Keyphrase is an efficient representation of the main idea of documents. ...
research
08/20/2021

GEDIT: Geographic-Enhanced and Dependency-Guided Tagging for Joint POI and Accessibility Extraction at Baidu Maps

Providing timely accessibility reminders of a point-of-interest (POI) pl...
research
02/27/2019

Multiresolution Graph Attention Networks for Relevance Matching

A large number of deep learning models have been proposed for the text m...
research
09/02/2021

An Empirical Study on Leveraging Position Embeddings for Target-oriented Opinion Words Extraction

Target-oriented opinion words extraction (TOWE) (Fan et al., 2019b) is a...
research
04/17/2023

GrOVe: Ownership Verification of Graph Neural Networks using Embeddings

Graph neural networks (GNNs) have emerged as a state-of-the-art approach...
research
05/30/2023

Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator

The incorporation of biasing words obtained through contextual knowledge...

Please sign up or login with your details

Forgot password? Click here to reset