vONTSS: vMF based semi-supervised neural topic modeling with optimal transport

07/03/2023
by   Weijie Xu, et al.
0

Recently, Neural Topic Models (NTM), inspired by variational autoencoders, have attracted a lot of research interest; however, these methods have limited applications in the real world due to the challenge of incorporating human knowledge. This work presents a semi-supervised neural topic modeling method, vONTSS, which uses von Mises-Fisher (vMF) based variational autoencoders and optimal transport. When a few keywords per topic are provided, vONTSS in the semi-supervised setting generates potential topics and optimizes topic-keyword quality and topic classification. Experiments show that vONTSS outperforms existing semi-supervised topic modeling methods in classification accuracy and diversity. vONTSS also supports unsupervised topic modeling. Quantitative and qualitative experiments show that vONTSS in the unsupervised setting outperforms recent NTMs on multiple aspects: vONTSS discovers highly clustered and coherent topics on benchmark datasets. It is also much faster than the state-of-the-art weakly supervised text classification method while achieving similar classification performance. We further prove the equivalence of optimal transport loss and cross-entropy loss at the global minimum.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2023

S2vNTM: Semi-supervised vMF Neural Topic Modeling

Language model based methods are powerful techniques for text classifica...
research
07/04/2023

KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation

In text classification tasks, fine tuning pretrained language models lik...
research
09/08/2020

Revisiting LSTM Networks for Semi-Supervised Text Classification via Mixed Objective Function

In this paper, we study bidirectional LSTM network for the task of text ...
research
10/14/2020

Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction

Semi-supervision is a promising paradigm for Bilingual Lexicon Induction...
research
04/07/2022

A Joint Learning Approach for Semi-supervised Neural Topic Modeling

Topic models are some of the most popular ways to represent textual data...
research
02/13/2022

Metric Learning-enhanced Optimal Transport for Biochemical Regression Domain Adaptation

Generalizing knowledge beyond source domains is a crucial prerequisite f...
research
07/18/2020

Semi-Supervised Learning Approach to Discover Enterprise User Insights from Feedback and Support

With the evolution of the cloud and customer centric culture, we inheren...

Please sign up or login with your details

Forgot password? Click here to reset