Persian topic detection based on Human Word association and graph embedding

02/20/2023
by   Mehrdad Ranjbar-Khadivi, et al.
0

In this paper, we propose a framework to detect topics in social media based on Human Word Association. Identifying topics discussed in these media has become a critical and significant challenge. Most of the work done in this area is in English, but much has been done in the Persian language, especially microblogs written in Persian. Also, the existing works focused more on exploring frequent patterns or semantic relationships and ignored the structural methods of language. In this paper, a topic detection framework using HWA, a method for Human Word Association, is proposed. This method uses the concept of imitation of mental ability for word association. This method also calculates the Associative Gravity Force that shows how words are related. Using this parameter, a graph can be generated. The topics can be extracted by embedding this graph and using clustering methods. This approach has been applied to a Persian language dataset collected from Telegram. Several experimental studies have been performed to evaluate the proposed framework's performance. Experimental results show that this approach works better than other topic detection methods.

READ FULL TEXT

page 8

page 12

research
01/30/2023

A Human Word Association based model for topic detection in social networks

With the widespread use of social networks, detecting the topics discuss...
research
04/19/2018

Invitación al estudio estadístico del lenguaje

Invitation to the statistical study of language: The topic of this prese...
research
05/26/2020

Examining Racial Bias in an Online Abuse Corpus with Structural Topic Modeling

We use structural topic modeling to examine racial bias in data collecte...
research
06/10/2020

A novel sentence embedding based topic detection method for micro-blog

Topic detection is a challenging task, especially without knowing the ex...
research
05/04/2021

Unsupervised Graph-based Topic Modeling from Video Transcriptions

To unfold the tremendous amount of audiovisual data uploaded daily to so...
research
06/07/2021

Network-based Trajectory Topic Interaction Map for Text Mining of COVID-19 Biomedical Literature

Since the emergence of the worldwide pandemic of COVID-19, relevant rese...
research
12/16/2019

Optimized Tracking of Topic Evolution

Topic evolution modeling has been researched for a long time and has gai...

Please sign up or login with your details

Forgot password? Click here to reset