Multi-task Pairwise Neural Ranking for Hashtag Segmentation

by   Mounica Maddela, et al.

Hashtags are often employed on social media and beyond to add metadata to a textual utterance with the goal of increasing discoverability, aiding search, or providing additional semantics. However, the semantic content of hashtags is not straightforward to infer as these represent ad-hoc conventions which frequently include multiple words joined together and can include abbreviations and unorthodox spellings. We build a dataset of 12,594 hashtags split into individual segments and propose a set of approaches for hashtag segmentation by framing it as a pairwise ranking problem between candidate segmentations. Our novel neural approaches demonstrate 24.6 segmentation accuracy compared to the current state-of-the-art method. Finally, we demonstrate that a deeper understanding of hashtag semantics obtained through segmentation is useful for downstream applications such as sentiment analysis, for which we achieved a 2.6 SemEval 2017 sentiment analysis dataset.



page 1

page 2

page 3

page 4


An AutoML-based Approach to Multimodal Image Sentiment Analysis

Sentiment analysis is a research topic focused on analysing data to extr...

SlangSD: Building and Using a Sentiment Dictionary of Slang Words for Short-Text Sentiment Classification

Sentiment in social media is increasingly considered as an important res...

Zero-shot hashtag segmentation for multilingual sentiment analysis

Hashtag segmentation, also known as hashtag decomposition, is a common s...

When Saliency Meets Sentiment: Understanding How Image Content Invokes Emotion and Sentiment

Sentiment analysis is crucial for extracting social signals from social ...

Misspelling Semantics In Thai

User-generated content is full of misspellings. Rather than being just r...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.