Clustering via Content-Augmented Stochastic Blockmodels

05/25/2015
by   J. Massey Cashore, et al.
0

Much of the data being created on the web contains interactions between users and items. Stochastic blockmodels, and other methods for community detection and clustering of bipartite graphs, can infer latent user communities and latent item clusters from this interaction data. These methods, however, typically ignore the items' contents and the information they provide about item clusters, despite the tendency of items in the same latent cluster to share commonalities in content. We introduce content-augmented stochastic blockmodels (CASB), which use item content together with user-item interaction data to enhance the user communities and item clusters learned. Comparisons to several state-of-the-art benchmark methods, on datasets arising from scientists interacting with scientific articles, show that content-augmented stochastic blockmodels provide highly accurate clusters with respect to metrics representative of the underlying community structure.

READ FULL TEXT

page 6

page 8

research
04/19/2021

Mining Latent Structures for Multimedia Recommendation

Multimedia content is of predominance in the modern Web era. Investigati...
research
04/03/2019

Stochastic Blockmodels with Edge Information

Stochastic blockmodels allow us to represent networks in terms of a late...
research
07/28/2020

Finding Scientific Communities In Citation Graphs: Convergent Clustering

Understanding the nature and organization of scientific communities is o...
research
09/01/2023

Unidimensionality in Rasch Models: Efficient Item Selection and Hierarchical Clustering Methods Based on Marginal Estimates

A strong tool for the selection of items that share a common trait from ...
research
10/20/2015

Optimal Cluster Recovery in the Labeled Stochastic Block Model

We consider the problem of community detection or clustering in the labe...
research
05/18/2021

Enabling self-verifiable mutable content items in IPFS using Decentralized Identifiers

In IPFS content identifiers are constructed based on the item's data the...
research
04/12/2019

N2VSCDNNR: A Local Recommender System Based on Node2vec and Rich Information Network

Recommender systems are becoming more and more important in our daily li...

Please sign up or login with your details

Forgot password? Click here to reset