Embedded Topics in the Stochastic Block Model

09/19/2022
by   Rémi Boutin, et al.
0

Communication networks such as emails or social networks are now ubiquitous and their analysis has become a strategic field. In many applications, the goal is to automatically extract relevant information by looking at the nodes and their connections. Unfortunately, most of the existing methods focus on analysing the presence or absence of edges and textual data is often discarded. However, all communication networks actually come with textual data on the edges. In order to take into account this specificity, we consider in this paper networks for which two nodes are linked if and only if they share textual data. We introduce a deep latent variable model allowing embedded topics to be handled called ETSBM to simultaneously perform clustering on the nodes while modelling the topics used between the different clusters. ETSBM extends both the stochastic block model (SBM) and the embedded topic model (ETM) which are core models for studying networks and corpora, respectively. The inference is done using a variational-Bayes expectation-maximisation algorithm combined with a stochastic gradient descent. The methodology is evaluated on synthetic data and on a real world dataset.

READ FULL TEXT

page 18

page 21

research
04/14/2023

The Deep Latent Position Topic Model for Clustering and Representation of Networks with Textual Edges

Numerical interactions leading to users sharing textual content publishe...
research
04/03/2019

Stochastic Blockmodels with Edge Information

Stochastic blockmodels allow us to represent networks in terms of a late...
research
06/22/2023

Efficient preconditioned stochastic gradient descent for estimation in latent variable models

Latent variable models are powerful tools for modeling complex phenomena...
research
12/20/2021

An iterative clustering algorithm for the Contextual Stochastic Block Model with optimality guarantees

Real-world networks often come with side information that can help to im...
research
06/28/2019

missSBM: An R Package for Handling Missing Values in the Stochastic Block Model

The Stochastic Block Model (SBM) is a popular probabilistic model for ra...
research
04/09/2020

Interactions in information spread: quantification and interpretation using stochastic block models

In most real-world applications, it is seldom the case that a given obse...
research
09/08/2015

Modelling time evolving interactions in networks through a non stationary extension of stochastic block models

In this paper, we focus on the stochastic block model (SBM),a probabilis...

Please sign up or login with your details

Forgot password? Click here to reset