Twitter Topic Classification

09/20/2022
by   Dimosthenis Antypas, et al.
3

Social media platforms host discussions about a wide variety of topics that arise everyday. Making sense of all the content and organising it into categories is an arduous task. A common way to deal with this issue is relying on topic modeling, but topics discovered using this technique are difficult to interpret and can differ from corpus to corpus. In this paper, we present a new task based on tweet topic classification and release two associated datasets. Given a wide range of topics covering the most important discussion points in social media, we provide training and testing data from recent time periods that can be used to evaluate tweet classification models. Moreover, we perform a quantitative evaluation and analysis of current general- and domain-specific language models on the task, which provide more insights on the challenges and nature of the task.

READ FULL TEXT

page 8

page 15

research
05/23/2017

TwiInsight: Discovering Topics and Sentiments from Social Media Datasets

Social media platforms contain a great wealth of information which provi...
research
05/03/2022

CTM – A Model for Large-Scale Multi-View Tweet Topic Classification

Automatically associating social media posts with topics is an important...
research
06/03/2018

Transfer Topic Labeling with Domain-Specific Knowledge Base: An Analysis of UK House of Commons Speeches 1935-2014

Topic models are among the most widely used methods in natural language ...
research
04/08/2019

Issue Framing in Online Discussion Fora

In online discussion fora, speakers often make arguments for or against ...
research
02/06/2023

It's about Time: Rethinking Evaluation on Rumor Detection Benchmarks using Chronological Splits

New events emerge over time influencing the topics of rumors in social m...
research
03/29/2023

Using Semantic Similarity and Text Embedding to Measure the Social Media Echo of Strategic Communications

Online discourse covers a wide range of topics and many actors tailor th...
research
10/22/2018

Sparsemax and Relaxed Wasserstein for Topic Sparsity

Topic sparsity refers to the observation that individual documents usual...

Please sign up or login with your details

Forgot password? Click here to reset