Is Neural Topic Modelling Better than Clustering? An Empirical Study on Clustering with Contextual Embeddings for Topics

04/21/2022
by   Zihan Zhang, et al.
0

Recent work incorporates pre-trained word embeddings such as BERT embeddings into Neural Topic Models (NTMs), generating highly coherent topics. However, with high-quality contextualized document representations, do we really need sophisticated neural models to obtain coherent and interpretable topics? In this paper, we conduct thorough experiments showing that directly clustering high-quality sentence embeddings with an appropriate word selecting method can generate more coherent and diverse topics than NTMs, achieving also higher efficiency and simplicity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2020

Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence

Topic models extract meaningful groups of words from documents, allowing...
research
11/09/2022

An Empirical Study on Clustering Pretrained Embeddings: Is Deep Strictly Better?

Recent research in clustering face embeddings has found that unsupervise...
research
05/04/2021

Unsupervised Graph-based Topic Modeling from Video Transcriptions

To unfold the tremendous amount of audiovisual data uploaded daily to so...
research
05/15/2022

Topic Modelling on Consumer Financial Protection Bureau Data: An Approach Using BERT Based Embeddings

Customers' reviews and comments are important for businesses to understa...
research
04/30/2020

Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!

Topic models are a useful analysis tool to uncover the underlying themes...
research
10/23/2020

Topic Modeling with Contextualized Word Representation Clusters

Clustering token-level contextualized word representations produces outp...
research
07/22/2020

Better Early than Late: Fusing Topics with Word Embeddings for Neural Question Paraphrase Identification

Question paraphrase identification is a key task in Community Question A...

Please sign up or login with your details

Forgot password? Click here to reset