Multivariate Powered Dirichlet Hawkes Process

12/12/2022
by   Gaël Poux-Médard, et al.
0

The publication time of a document carries a relevant information about its semantic content. The Dirichlet-Hawkes process has been proposed to jointly model textual information and publication dynamics. This approach has been used with success in several recent works, and extended to tackle specific challenging problems –typically for short texts or entangled publication dynamics. However, the prior in its current form does not allow for complex publication dynamics. In particular, inferred topics are independent from each other –a publication about finance is assumed to have no influence on publications about politics, for instance. In this work, we develop the Multivariate Powered Dirichlet-Hawkes Process (MPDHP), that alleviates this assumption. Publications about various topics can now influence each other. We detail and overcome the technical challenges that arise from considering interacting topics. We conduct a systematic evaluation of MPDHP on a range of synthetic datasets to define its application domain and limitations. Finally, we develop a use case of the MPDHP on Reddit data. At the end of this article, the interested reader will know how and when to use MPDHP, and when not to.

READ FULL TEXT

page 9

page 11

research
09/15/2021

Powered Hawkes-Dirichlet Process: Challenging Textual Clustering using a Flexible Temporal Prior

The textual content of a document and its publication date are intertwin...
research
01/29/2022

Le Processus Powered Dirichlet-Hawkes comme A Priori Flexible pour Clustering Temporel de Textes

The textual content of a document and its publication date are intertwin...
research
12/12/2022

Dirichlet-Survival Process: Scalable Inference of Topic-Dependent Diffusion Networks

Information spread on networks can be efficiently modeled by considering...
research
02/16/2023

Topic Modeling in Density Functional Theory on Citations of Condensed Matter Electronic Structure Packages

With an increasing number of new scientific papers being released, it be...
research
10/06/2019

Predicting publication productivity for researchers: A latent variable model

This study provided a model for the publication dynamics of researchers,...
research
04/07/2021

Evaluating the state-of-the-art in mapping research spaces: a Brazilian case study

Scientific knowledge cannot be seen as a set of isolated fields, but as ...
research
03/27/2023

Retrievability in an Integrated Retrieval System: An Extended Study

Retrievability measures the influence a retrieval system has on the acce...

Please sign up or login with your details

Forgot password? Click here to reset