Probabilistic Latent Semantic Analysis (PLSA) untuk Klasifikasi Dokumen Teks Berbahasa Indonesia

12/02/2015
by   Derwin Suhartono, et al.
0

One task that is included in managing documents is how to find substantial information inside. Topic modeling is a technique that has been developed to produce document representation in form of keywords. The keywords will be used in the indexing process and document retrieval as needed by users. In this research, we will discuss specifically about Probabilistic Latent Semantic Analysis (PLSA). It will cover PLSA mechanism which involves Expectation Maximization (EM) as the training algorithm, how to conduct testing, and obtain the accuracy result.

READ FULL TEXT
research
12/17/2012

A Tutorial on Probabilistic Latent Semantic Analysis

In this tutorial, I will discuss the details about how Probabilistic Lat...
research
10/12/2021

Topic Model Supervised by Understanding Map

Inspired by the notion of Center of Mass in physics, an extension called...
research
07/29/2018

Discovering Latent Information By Spreading Activation Algorithm For Document Retrieval

Syntactic search relies on keywords contained in a query to find suitabl...
research
09/02/2023

MPTopic: Improving topic modeling via Masked Permuted pre-training

Topic modeling is pivotal in discerning hidden semantic structures withi...
research
01/02/2021

A Multilayer Correlated Topic Model

We proposed a novel multilayer correlated topic model (MCTM) to analyze ...
research
03/07/2019

Quantum Latent Semantic Analysis

The main goal of this paper is to explore latent topic analysis (LTA), i...
research
01/23/2013

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis is a novel statistical technique ...

Please sign up or login with your details

Forgot password? Click here to reset