Construction and Quality Evaluation of Heterogeneous Hierarchical Topic Models

11/07/2018
by   Anton Belyy, et al.
0

In our work, we propose to represent HTM as a set of flat models, or layers, and a set of topical hierarchies, or edges. We suggest several quality measures for edges of hierarchical models, resembling those proposed for flat models. We conduct an assessment experimentation and show strong correlation between the proposed measures and human judgement on topical edge quality. We also introduce heterogeneous algorithm to build hierarchical topic models for heterogeneous data sources. We show how making certain adjustments to learning process helps to retain original structure of customized models while allowing for slight coherent modifications for new documents. We evaluate this approach using the proposed measures and show that the proposed heterogeneous algorithm significantly outperforms the baseline concat approach. Finally, we implement our own ESE called Rysearch, which demonstrates the potential of ARTM approach for visualizing large heterogeneous document collections.

READ FULL TEXT

page 15

page 26

page 27

page 29

page 30

page 31

page 34

page 37

research
06/15/2023

Hierarchical confusion matrix for classification performance evaluation

In this work we propose a novel concept of a hierarchical confusion matr...
research
01/16/2013

A Nested HDP for Hierarchical Topic Models

We develop a nested hierarchical Dirichlet process (nHDP) for hierarchic...
research
10/25/2012

Nested Hierarchical Dirichlet Processes

We develop a nested hierarchical Dirichlet process (nHDP) for hierarchic...
research
08/29/2017

Unsupervised Terminological Ontology Learning based on Hierarchical Topic Modeling

In this paper, we present hierarchical relationbased latent Dirichlet al...
research
07/18/2019

Recommender Systems with Heterogeneous Side Information

In modern recommender systems, both users and items are associated with ...
research
01/19/2021

Analysis and tuning of hierarchical topic models based on Renyi entropy approach

Hierarchical topic modeling is a potentially powerful instrument for det...
research
04/29/2019

Semantic Matching of Documents from Heterogeneous Collections: A Simple and Transparent Method for Practical Applications

We present a very simple, unsupervised method for the pairwise matching ...

Please sign up or login with your details

Forgot password? Click here to reset