Scalable Modeling of Conversational-role based Self-presentation Characteristics in Large Online Forums

12/10/2015
by   Abhimanu Kumar, et al.
0

Online discussion forums are complex webs of overlapping subcommunities (macrolevel structure, across threads) in which users enact different roles depending on which subcommunity they are participating in within a particular time point (microlevel structure, within threads). This sub-network structure is implicit in massive collections of threads. To uncover this structure, we develop a scalable algorithm based on stochastic variational inference and leverage topic models (LDA) along with mixed membership stochastic block (MMSB) models. We evaluate our model on three large-scale datasets, Cancer-ThreadStarter (22K users and 14.4K threads), Cancer-NameMention(15.1K users and 12.4K threads) and StackOverFlow (1.19 million users and 4.55 million threads). Qualitatively, we demonstrate that our model can provide useful explanations of microlevel and macrolevel user presentation characteristics in different communities using the topics discovered from posts. Quantitatively, we show that our model does better than MMSB and LDA in predicting user reply structure within threads. In addition, we demonstrate via synthetic data experiments that the proposed active sub-network discovery model is stable and recovers the original parameters of the experimental setup with high probability.

READ FULL TEXT
research
07/19/2011

Using Variational Inference and MapReduce to Scale Topic Modeling

Latent Dirichlet Allocation (LDA) is a popular topic modeling technique ...
research
10/16/2015

Scalable MCMC for Mixed Membership Stochastic Blockmodels

We propose a stochastic gradient Markov chain Monte Carlo (SG-MCMC) algo...
research
05/01/2017

Stochastic Divergence Minimization for Biterm Topic Model

As the emergence and the thriving development of social networks, a huge...
research
07/11/2017

Unsupervised robust nonparametric learning of hidden community properties

We consider learning of fundamental properties of communities in large n...
research
05/31/2016

Extreme Stochastic Variational Inference: Distributed and Asynchronous

We propose extreme stochastic variational inference (ESVI), an asynchron...
research
02/27/2017

Semi-parametric Network Structure Discovery Models

We propose a network structure discovery model for continuous observatio...
research
02/07/2019

Towards Autoencoding Variational Inference for Aspect-based Opinion Summary

Aspect-based Opinion Summary (AOS), consisting of aspect discovery and s...

Please sign up or login with your details

Forgot password? Click here to reset