Topic-Aware Neural Keyphrase Generation for Social Media Language

06/10/2019
by   Yue Wang, et al.
0

A huge volume of user-generated content is daily produced on social media. To facilitate automatic language understanding, we study keyphrase prediction, distilling salient information from massive posts. While most existing methods extract words from source posts to form keyphrases, we propose a sequence-to-sequence (seq2seq) based neural keyphrase generation framework, enabling absent keyphrases to be created. Moreover, our model, being topic-aware, allows joint modeling of corpus-level latent topic representations, which helps alleviate the data sparsity that widely exhibited in social media language. Experiments on three datasets collected from English and Chinese social media platforms show that our model significantly outperforms both extraction and generation models that do not exploit latent topics. Further discussions show that our model learns meaningful topics, which interprets its superiority in social media keyphrase generation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2021

Detecting Inspiring Content on Social Media

Inspiration moves a person to see new possibilities and transforms the w...
research
11/11/2016

UTCNN: a Deep Learning Model of Stance Classificationon on Social Media Text

Most neural network models for document classification on social media f...
research
10/15/2021

Modeling Proficiency with Implicit User Representations

We introduce the problem of proficiency modeling: Given a user's posts o...
research
08/19/2023

HICL: Hashtag-Driven In-Context Learning for Social Media Natural Language Understanding

Natural language understanding (NLU) is integral to various social media...
research
10/22/2018

Sparsemax and Relaxed Wasserstein for Topic Sparsity

Topic sparsity refers to the observation that individual documents usual...
research
05/24/2023

Topic-Guided Self-Introduction Generation for Social Media Users

Millions of users are active on social media. To allow users to better s...
research
10/11/2022

Time-aware topic identification in social media with pre-trained language models: A case study of electric vehicles

Recent extensively competitive business environment makes companies to k...

Please sign up or login with your details

Forgot password? Click here to reset