On Large-Scale Dynamic Topic Modeling with Nonnegative CP Tensor Decomposition

01/02/2020
by   Miju Ahn, et al.
16

There is currently an unprecedented demand for large-scale temporal data analysis due to the explosive growth of data. Dynamic topic modeling has been widely used in social and data sciences with the goal of learning latent topics that emerge, evolve, and fade over time. Previous work on dynamic topic modeling primarily employ the method of nonnegative matrix factorization (NMF), where slices of the data tensor are each factorized into the product of lower-dimensional nonnegative matrices. With this approach, however, information contained in the temporal dimension of the data is often neglected or underutilized. To overcome this issue, we propose instead adopting the method of nonnegative CANDECOMP/PARAPAC (CP) tensor decomposition (NNCPD), where the data tensor is directly decomposed into a minimal sum of outer products of nonnegative vectors, thereby preserving the temporal information. The viability of NNCPD is demonstrated through application to both synthetic and real data, where significantly improved results are obtained compared to those of typical NMF-based methods. The advantages of NNCPD over such approaches are studied and discussed. To the best of our knowledge, this is the first time that NNCPD has been utilized for the purpose of dynamic topic modeling, and our findings will be transformative for both applications and further developments.

READ FULL TEXT

page 8

page 9

page 10

page 11

page 12

page 13

page 15

page 17

research
10/04/2020

On Nonnegative Matrix and Tensor Decompositions for COVID-19 Twitter Dynamics

We analyze Twitter data relating to the COVID-19 pandemic using dynamic ...
research
09/30/2021

A Generalized Hierarchical Nonnegative Tensor Decomposition

Nonnegative matrix factorization (NMF) has found many applications inclu...
research
11/24/2022

Multi-scale Hybridized Topic Modeling: A Pipeline for Analyzing Unstructured Text Datasets via Topic Modeling

We propose a multi-scale hybridized topic modeling method to find hidden...
research
09/16/2020

Online nonnegative tensor factorization and CP-dictionary learning for Markovian data

Nonnegative Matrix Factorization (NMF) algorithms are fundamental tools ...
research
12/01/2019

Topic-aware chatbot using Recurrent Neural Networks and Nonnegative Matrix Factorization

We propose a novel model for a topic-aware chatbot by combining the trad...
research
10/02/2019

Near-Convex Archetypal Analysis

Nonnegative matrix factorization (NMF) is a widely used linear dimension...
research
07/19/2017

Unmixing dynamic PET images with variable specific binding kinetics

To analyze dynamic positron emission tomography (PET) images, various ge...

Please sign up or login with your details

Forgot password? Click here to reset