Exploring the Power of Topic Modeling Techniques in Analyzing Customer Reviews: A Comparative Analysis

08/19/2023
by   Anusuya Krishnan, et al.
0

The exponential growth of online social network platforms and applications has led to a staggering volume of user-generated textual content, including comments and reviews. Consequently, users often face difficulties in extracting valuable insights or relevant information from such content. To address this challenge, machine learning and natural language processing algorithms have been deployed to analyze the vast amount of textual data available online. In recent years, topic modeling techniques have gained significant popularity in this domain. In this study, we comprehensively examine and compare five frequently used topic modeling methods specifically applied to customer reviews. The methods under investigation are latent semantic analysis (LSA), latent Dirichlet allocation (LDA), non-negative matrix factorization (NMF), pachinko allocation model (PAM), Top2Vec, and BERTopic. By practically demonstrating their benefits in detecting important topics, we aim to highlight their efficacy in real-world scenarios. To evaluate the performance of these topic modeling methods, we carefully select two textual datasets. The evaluation is based on standard statistical evaluation metrics such as topic coherence score. Our findings reveal that BERTopic consistently yield more meaningful extracted topics and achieve favorable results.

READ FULL TEXT

page 5

page 10

research
05/26/2022

Federated Non-negative Matrix Factorization for Short Texts Topic Modeling with Mutual Information

Non-negative matrix factorization (NMF) based topic modeling is widely u...
research
06/13/2023

A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews

The efficiency of natural language processing has improved dramatically ...
research
06/26/2018

Computational Analysis of Insurance Complaints: GEICO Case Study

The online environment has provided a great opportunity for insurance po...
research
09/27/2021

Evaluation of Non-Negative Matrix Factorization and n-stage Latent Dirichlet Allocation for Emotion Analysis in Turkish Tweets

With the development of technology, the use of social media has become q...
research
02/08/2022

Police Text Analysis: Topic Modeling and Spatial Relative Density Estimation

We analyze a large corpus of police incident narrative documents in unde...
research
08/30/2023

Conti Inc.: Understanding the Internal Discussions of a large Ransomware-as-a-Service Operator with Machine Learning

Ransomware-as-a-service (RaaS) is increasing the scale and complexity of...
research
02/12/2021

Modeling Dynamic User Interests: A Neural Matrix Factorization Approach

In recent years, there has been significant interest in understanding us...

Please sign up or login with your details

Forgot password? Click here to reset