Identifying Topics from Micropost Collections using Linked Open Data

04/06/2018
by   Ahmet Yildirim, et al.
0

The extensive use of social media for sharing and obtaining information has resulted in the development of topic detection models to facilitate the comprehension of the overwhelming amount of short and distributed posts. Probabilistic topic models, such as Latent Dirichlet allocation, represent topics as sets of terms that are useful for many automated processes. However, the determination of what a topic is about is left as a further task. Alternatively, techniques that produce summaries are human comprehensible, but less suitable for automated processing. This work proposes an approach that utilizes Linked Open Data (LOD) resources to extract semantically represented topics from collections of microposts. The proposed approach utilizes entity linking to identify the elements of topics from microposts. The elements are related through co-occurrence graphs, which are processed to yield topics. The topics are represented using an ontology that is introduced for this purpose. A prototype of the approach is used to identify topics from 11 datasets consisting of more than one million posts collected from Twitter during various events, such as the 2016 US election debates and the death of Carrie Fisher. The characteristics of the approach and more than 5 thousand generated topics are described in detail. A human evaluation of topics from 30 randomly selected intervals resulted in a precision of 81.0 they are compared with topics generated from the same datasets with two different kinds of topic models. The potentials of semantic topics in revealing information, that is not otherwise easily observable, is demonstrated with semantic queries of various complexities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2022

CTM – A Model for Large-Scale Multi-View Tweet Topic Classification

Automatically associating social media posts with topics is an important...
research
05/26/2020

Examining Racial Bias in an Online Abuse Corpus with Structural Topic Modeling

We use structural topic modeling to examine racial bias in data collecte...
research
10/21/2019

Using machine learning and information visualisation for discovering latent topics in Twitter news

We propose a method to discover latent topics and visualise large collec...
research
05/16/2020

Integrating Semantic and Structural Information with Graph Convolutional Network for Controversy Detection

Identifying controversial posts on social media is a fundamental task fo...
research
10/11/2020

ComStreamClust: A communicative text clustering approach to topic detection in streaming data

Topic detection is the task of determining and tracking hot topics in so...
research
05/19/2022

A Weakly-Supervised Iterative Graph-Based Approach to Retrieve COVID-19 Misinformation Topics

The COVID-19 pandemic has been accompanied by an `infodemic' – of accura...
research
03/25/2021

Term-community-based topic detection with variable resolution

Network-based procedures for topic detection in huge text collections of...

Please sign up or login with your details

Forgot password? Click here to reset