Microblog Hashtag Generation via Encoding Conversation Contexts

05/18/2019
by   Yue Wang, et al.
0

Automatic hashtag annotation plays an important role in content understanding for microblog posts. To date, progress made in this field has been restricted to phrase selection from limited candidates, or word-level hashtag discovery using topic models. Different from previous work considering hashtags to be inseparable, our work is the first effort to annotate hashtags with a novel sequence generation framework via viewing the hashtag as a short sequence of words. Moreover, to address the data sparsity issue in processing short microblog posts, we propose to jointly model the target posts and the conversation contexts initiated by them with bidirectional attention. Extensive experimental results on two large-scale datasets, newly collected from English Twitter and Chinese Weibo, show that our model significantly outperforms state-of-the-art models based on classification. Further studies demonstrate our ability to effectively generate rare and even unseen hashtags, which is however not possible for most existing methods.

READ FULL TEXT
research
06/06/2021

Attend and Select: A Segment Attention based Selection Mechanism for Microblog Hashtag Generation

Automatic microblog hashtag generation can help us better and faster und...
research
04/18/2021

News Meets Microblog: Hashtag Annotation via Retriever-Generator

Hashtag annotation for microblog posts has been recently formulated as a...
research
10/09/2021

Rumor Detection on Twitter with Claim-Guided Hierarchical Graph Attention Networks

Rumors are rampant in the era of social media. Conversation structures p...
research
06/18/2021

Continuity of Topic, Interaction, and Query: Learning to Quote in Online Conversations

Quotations are crucial for successful explanations and persuasions in in...
research
10/13/2022

Early Discovery of Disappearing Entities in Microblogs

We make decisions by reacting to changes in the real world, in particula...
research
08/24/2022

Diverse Title Generation for Stack Overflow Posts with Multiple Sampling Enhanced Transformer

Stack Overflow is one of the most popular programming communities where ...
research
09/06/2023

A Multimodal Analysis of Influencer Content on Twitter

Influencer marketing involves a wide range of strategies in which brands...

Please sign up or login with your details

Forgot password? Click here to reset