Topic2Vec: Learning Distributed Representations of Topics

06/28/2015
by   Li-Qiang Niu, et al.
0

Latent Dirichlet Allocation (LDA) mining thematic structure of documents plays an important role in nature language processing and machine learning areas. However, the probability distribution from LDA only describes the statistical relationship of occurrences in the corpus and usually in practice, probability is not the best choice for feature representations. Recently, embedding methods have been proposed to represent words and documents by learning essential concepts and representations, such as Word2Vec and Doc2Vec. The embedded representations have shown more effectiveness than LDA-style representations in many tasks. In this paper, we propose the Topic2Vec approach which can learn topic representations in the same semantic vector space with words, as an alternative to probability. The experimental results show that Topic2Vec achieves interesting and meaningful results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2019

Neural Embedding Allocation: Distributed Representations of Topic Models

Word embedding models such as the skip-gram learn vector representations...
research
12/19/2014

N-gram-Based Low-Dimensional Representation for Document Classification

The bag-of-words (BOW) model is the common approach for classifying docu...
research
09/24/2019

Diachronic Topics in New High German Poetry

Statistical topic models are increasingly and popularly used by Digital ...
research
04/23/2018

Discovering Style Trends through Deep Visually Aware Latent Item Embeddings

In this paper, we explore Latent Dirichlet Allocation (LDA) and Polyling...
research
04/23/2019

Exploring the Daschle Collection using Text Mining

A U.S. Senator from South Dakota donated documents that were accumulated...
research
07/28/2023

Resume Evaluation through Latent Dirichlet Allocation and Natural Language Processing for Effective Candidate Selection

In this paper, we propose a method for resume rating using Latent Dirich...

Please sign up or login with your details

Forgot password? Click here to reset