Deep Autoencoder-based Fuzzy C-Means for Topic Detection

02/02/2021
by   Hendri Murfi, et al.
1

Topic detection is a process for determining topics from a collection of textual data. One of the topic detection methods is a clustering-based method, which assumes that the centroids are topics. The clustering method has the advantage that it can process data with negative representations. Therefore, the clustering method allows a combination with a broader representation learning method. In this paper, we adopt deep learning for topic detection by using a deep autoencoder and fuzzy c-means called deep autoencoder-based fuzzy c-means (DFCM). The encoder of the autoencoder performs a lower-dimensional representation learning. Fuzzy c-means groups the lower-dimensional representation to identify the centroids. The autoencoder's decoder transforms back the centroids into the original representation to be interpreted as the topics. Our simulation shows that DFCM improves the coherence score of eigenspace-based fuzzy c-means (EFCM) and is comparable to the leading standard methods, i.e., nonnegative matrix factorization (NMF) or latent Dirichlet allocation (LDA).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2023

A Novel Method of Fuzzy Topic Modeling based on Transformer Processing

Topic modeling is admittedly a convenient way to monitor markets trend. ...
research
09/30/2021

Deep Embedded K-Means Clustering

Recently, deep clustering methods have gained momentum because of the hi...
research
12/18/2019

Topic subject creation using unsupervised learning for topic modeling

We describe the use of Non-Negative Matrix Factorization (NMF) and Laten...
research
05/11/2020

SCAT: Second Chance Autoencoder for Textual Data

We present a k-competitive learning approach for textual autoencoders na...
research
01/12/2016

Deep Learning of Part-based Representation of Data Using Sparse Autoencoders with Nonnegativity Constraints

We demonstrate a new deep learning autoencoder network, trained by a non...
research
08/05/2015

Progressive EM for Latent Tree Models and Hierarchical Topic Detection

Hierarchical latent tree analysis (HLTA) is recently proposed as a new m...
research
05/19/2016

Inter-Battery Topic Representation Learning

In this paper, we present the Inter-Battery Topic Model (IBTM). Our appr...

Please sign up or login with your details

Forgot password? Click here to reset