Unsupervised Open-domain Keyphrase Generation

06/19/2023
by   Lam Thanh Do, et al.
0

In this work, we study the problem of unsupervised open-domain keyphrase generation, where the objective is a keyphrase generation model that can be built without using human-labeled data and can perform consistently across domains. To solve this problem, we propose a seq2seq model that consists of two modules, namely phraseness and informativeness module, both of which can be built in an unsupervised and open-domain fashion. The phraseness module generates phrases, while the informativeness module guides the generation towards those that represent the core concepts of the text. We thoroughly evaluate our proposed method using eight benchmark datasets from different domains. Results on in-domain datasets show that our approach achieves state-of-the-art results compared with existing unsupervised models, and overall narrows the gap between supervised and unsupervised methods down to about 16%. Furthermore, we demonstrate that our model performs consistently across domains, as it overall surpasses the baselines on out-of-domain datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/09/2019

Unsupervised Paraphrasing by Simulated Annealing

Unsupervised paraphrase generation is a promising and important research...
research
05/11/2021

Unsupervised domain adaptation via double classifiers based on high confidence pseudo label

Unsupervised domain adaptation (UDA) aims to solve the problem of knowle...
research
05/07/2018

Unpaired Multi-Domain Image Generation via Regularized Conditional GANs

In this paper, we study the problem of multi-domain image generation, th...
research
10/24/2020

Unsupervised Paraphrase Generation via Dynamic Blocking

We propose Dynamic Blocking, a decoding algorithm which enables large-sc...
research
05/26/2022

Unsupervised Reinforcement Adaptation for Class-Imbalanced Text Classification

Class imbalance naturally exists when train and test models in different...
research
05/19/2022

Self-augmented Data Selection for Few-shot Dialogue Generation

The natural language generation (NLG) module in task-oriented dialogue s...
research
07/05/2020

Unsupervised Paraphrasing via Deep Reinforcement Learning

Paraphrasing is expressing the meaning of an input sentence in different...

Please sign up or login with your details

Forgot password? Click here to reset