S2vNTM: Semi-supervised vMF Neural Topic Modeling

07/06/2023
by   Weijie Xu, et al.
0

Language model based methods are powerful techniques for text classification. However, the models have several shortcomings. (1) It is difficult to integrate human knowledge such as keywords. (2) It needs a lot of resources to train the models. (3) It relied on large text data to pretrain. In this paper, we propose Semi-Supervised vMF Neural Topic Modeling (S2vNTM) to overcome these difficulties. S2vNTM takes a few seed keywords as input for topics. S2vNTM leverages the pattern of keywords to identify potential topics, as well as optimize the quality of topics' keywords sets. Across a variety of datasets, S2vNTM outperforms existing semi-supervised topic modeling methods in classification accuracy with limited keywords provided. S2vNTM is at least twice as fast as baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2023

vONTSS: vMF based semi-supervised neural topic modeling with optimal transport

Recently, Neural Topic Models (NTM), inspired by variational autoencoder...
research
08/09/2023

MetRoBERTa: Leveraging Traditional Customer Relationship Management Data to Develop a Transit-Topic-Aware Language Model

Transit riders' feedback provided in ridership surveys, customer relatio...
research
07/04/2023

KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation

In text classification tasks, fine tuning pretrained language models lik...
research
02/06/2021

Exclusive Topic Modeling

We propose an Exclusive Topic Modeling (ETM) for unsupervised text class...
research
11/30/2016

Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge

While generative models such as Latent Dirichlet Allocation (LDA) have p...
research
10/10/2019

Learning Only from Relevant Keywords and Unlabeled Documents

We consider a document classification problem where document labels are ...
research
10/15/2020

Semi-supervised NMF Models for Topic Modeling in Learning Tasks

We propose several new models for semi-supervised nonnegative matrix fac...

Please sign up or login with your details

Forgot password? Click here to reset