Towards the TopMost: A Topic Modeling System Toolkit

09/13/2023
by   Xiaobao Wu, et al.
0

Topic models have been proposed for decades with various applications and recently refreshed by the neural variational inference. However, these topic models adopt totally distinct dataset, implementation, and evaluation settings, which hinders their quick utilization and fair comparisons. This greatly hinders the research progress of topic models. To address these issues, in this paper we propose a Topic Modeling System Toolkit (TopMost). Compared to existing toolkits, TopMost stands out by covering a wider range of topic modeling scenarios including complete lifecycles with dataset pre-processing, model training, testing, and evaluations. The highly cohesive and decoupled modular design of TopMost enables quick utilization, fair comparisons, and flexible extensions of different topic models. This can facilitate the research and applications of topic models. Our code, tutorials, and documentation are available at https://github.com/bobxwu/topmost.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2018

jLDADMM: A Java package for the LDA and DMM topic models

In this technical report, we present jLDADMM---an easy-to-use Java toolk...
research
09/19/2020

OpenAttack: An Open-source Textual Adversarial Attack Toolkit

Textual adversarial attacking has received wide and increasing attention...
research
07/31/2017

Familia: An Open-Source Toolkit for Industrial Topic Modeling

Familia is an open-source toolkit for pragmatic topic modeling in indust...
research
10/15/2021

ESPnet2-TTS: Extending the Edge of TTS Research

This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS)...
research
05/25/2017

A Neural Framework for Generalized Topic Models

Topic models for text corpora comprise a popular family of methods that ...
research
08/07/2018

STTM: A Tool for Short Text Topic Modeling

Along with the emergence and popularity of social communications on the ...
research
10/28/2022

Are Neural Topic Models Broken?

Recently, the relationship between automated and human evaluation of top...

Please sign up or login with your details

Forgot password? Click here to reset