Multi-scale Hybridized Topic Modeling: A Pipeline for Analyzing Unstructured Text Datasets via Topic Modeling

11/24/2022
by   Keyi Cheng, et al.
0

We propose a multi-scale hybridized topic modeling method to find hidden topics from transcribed interviews more accurately and efficiently than traditional topic modeling methods. Our multi-scale hybridized topic modeling method (MSHTM) approaches data at different scales and performs topic modeling in a hierarchical way utilizing first a classical method, Nonnegative Matrix Factorization, and then a transformer-based method, BERTopic. It harnesses the strengths of both NMF and BERTopic. Our method can help researchers and the public better extract and interpret the interview information. Additionally, it provides insights for new indexing systems based on the topic level. We then deploy our method on real-world interview transcripts and find promising results.

READ FULL TEXT

page 6

page 10

page 15

research
09/30/2021

A Generalized Hierarchical Nonnegative Tensor Decomposition

Nonnegative matrix factorization (NMF) has found many applications inclu...
research
01/02/2020

On Large-Scale Dynamic Topic Modeling with Nonnegative CP Tensor Decomposition

There is currently an unprecedented demand for large-scale temporal data...
research
09/14/2018

Identification of multi-scale hierarchical brain functional networks using deep matrix factorization

We present a deep semi-nonnegative matrix factorization method for ident...
research
02/24/2021

Deep NMF Topic Modeling

Nonnegative matrix factorization (NMF) based topic modeling methods do n...
research
04/13/2022

Neural Topic Modeling of Psychotherapy Sessions

In this work, we compare different neural topic modeling methods in lear...
research
07/13/2021

Semiparametric Latent Topic Modeling on Consumer-Generated Corpora

Legacy procedures for topic modelling have generally suffered problems o...

Please sign up or login with your details

Forgot password? Click here to reset