Random-walk Based Generative Model for Classifying Document Networks

01/21/2020
by   Takafumi J. Suzuki, et al.
0

Document networks are found in various collections of real-world data, such as citation networks, hyperlinked web pages, and online social networks. A large number of generative models have been proposed because they offer intuitive and useful pictures for analyzing document networks. Prominent examples are relational topic models, where documents are linked according to their topic similarities. However, existing generative models do not make full use of network structures because they are largely dependent on topic modeling of documents. In particular, centrality of graph nodes is missing in generative processes of previous models. In this paper, we propose a novel generative model for document networks by introducing random walkers on networks to integrate the node centrality into link generation processes. The developed method is evaluated in semi-supervised classification tasks with real-world citation networks. We show that the proposed model outperforms existing probabilistic approaches especially in detecting communities in connected networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2015

Nonparametric Relational Topic Models through Dependent Gamma Processes

Traditional Relational Topic Models provide a way to discover the hidden...
research
04/07/2022

A Joint Learning Approach for Semi-supervised Neural Topic Modeling

Topic models are some of the most popular ways to represent textual data...
research
03/28/2013

Scalable Text and Link Analysis with Mixed-Topic Link Models

Many data sets contain rich information about objects, as well as pairwi...
research
01/04/2016

Scalable Models for Computing Hierarchies in Information Networks

Information hierarchies are organizational structures that often used to...
research
10/02/2016

Text Network Exploration via Heterogeneous Web of Topics

A text network refers to a data type that each vertex is associated with...
research
06/10/2013

Generative Model Selection Using a Scalable and Size-Independent Complex Network Classifier

Real networks exhibit nontrivial topological features such as heavy-tail...
research
08/17/2018

Learning Supervised Topic Models for Classification and Regression from Crowds

The growing need to analyze large collections of documents has led to gr...

Please sign up or login with your details

Forgot password? Click here to reset