Dirichlet-vMF Mixture Model

02/24/2017
by   Shaohua Li, et al.
0

This document is about the multi-document Von-Mises-Fisher mixture model with a Dirichlet prior, referred to as VMFMix. VMFMix is analogous to Latent Dirichlet Allocation (LDA) in that they can capture the co-occurrence patterns acorss multiple documents. The difference is that in VMFMix, the topic-word distribution is defined on a continuous n-dimensional hypersphere. Hence VMFMix is used to derive topic embeddings, i.e., representative vectors, from multiple sets of embedding vectors. An efficient Variational Expectation-Maximization inference algorithm is derived. The performance of VMFMix on two document classification tasks is reported, with some preliminary analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2018

jLDADMM: A Java package for the LDA and DMM topic models

In this technical report, we present jLDADMM---an easy-to-use Java toolk...
research
01/02/2021

A Multilayer Correlated Topic Model

We proposed a novel multilayer correlated topic model (MCTM) to analyze ...
research
09/02/2019

Clustering of count data through a mixture of multinomial PCA

Count data is becoming more and more ubiquitous in a wide range of appli...
research
11/26/2014

Fisher Vectors Derived from Hybrid Gaussian-Laplacian Mixture Models for Image Annotation

In the traditional object recognition pipeline, descriptors are densely ...
research
02/22/2018

Learning Topic Models by Neighborhood Aggregation

Topic models are one of the most frequently used models in machine learn...
research
01/12/2015

Autodetection and Classification of Hidden Cultural City Districts from Yelp Reviews

Topic models are a way to discover underlying themes in an otherwise uns...
research
05/06/2016

Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec

Distributed dense word vectors have been shown to be effective at captur...

Please sign up or login with your details

Forgot password? Click here to reset