Unsupervised Keyphrase Extraction by Jointly Modeling Local and Global Context

09/15/2021
by   Xinnian Liang, et al.
0

Embedding based methods are widely used for unsupervised keyphrase extraction (UKE) tasks. Generally, these methods simply calculate similarities between phrase embeddings and document embedding, which is insufficient to capture different context for a more effective UKE model. In this paper, we propose a novel method for UKE, where local and global contexts are jointly modeled. From a global view, we calculate the similarity between a certain phrase and the whole document in the vector space as transitional embedding based models do. In terms of the local view, we first build a graph structure based on the document where phrases are regarded as vertices and the edges are similarities between vertices. Then, we proposed a new centrality computation method to capture local salient information based on the graph structure. Finally, we further combine the modeling of global and local context for ranking. We evaluate our models on three public benchmarks (Inspec, DUC 2001, SemEval 2010) and compare with existing state-of-the-art models. The results show that our model outperforms most models while generalizing better on input documents with different domains and length. Additional ablation study shows that both the local and global information is crucial for unsupervised keyphrase extraction tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2023

Balancing between the Local and Global Structures (LGS) in Graph Embedding

We present a method for balancing between the Local and Global Structure...
research
03/23/2018

Unsupervised Keyphrase Extraction with Multipartite Graphs

We propose an unsupervised keyphrase extraction model that encodes topic...
research
01/17/2022

Topic Aware Contextualized Embeddings for High Quality Phrase Extraction

Keyphrase extraction from a given document is the task of automatically ...
research
08/01/2016

Keyphrase Extraction using Sequential Labeling

Keyphrases efficiently summarize a document's content and are used in va...
research
03/30/2022

Graph Refinement for Coreference Resolution

The state-of-the-art models for coreference resolution are based on inde...
research
11/22/2019

Topical Phrase Extraction from Clinical Reports by Incorporating both Local and Global Context

Making sense of words often requires to simultaneously examine the surro...
research
04/18/2021

Unsupervised Deep Keyphrase Generation

Keyphrase generation aims to summarize long documents with a collection ...

Please sign up or login with your details

Forgot password? Click here to reset