Online Prediction of Dyadic Data with Heterogeneous Matrix Factorization

01/13/2016
by   Guangyong Chen, et al.
0

Dyadic Data Prediction (DDP) is an important problem in many research areas. This paper develops a novel fully Bayesian nonparametric framework which integrates two popular and complementary approaches, discrete mixed membership modeling and continuous latent factor modeling into a unified Heterogeneous Matrix Factorization (HeMF) model, which can predict the unobserved dyadics accurately. The HeMF can determine the number of communities automatically and exploit the latent linear structure for each bicluster efficiently. We propose a Variational Bayesian method to estimate the parameters and missing data. We further develop a novel online learning approach for Variational inference and use it for the online learning of HeMF, which can efficiently cope with the important large-scale DDP problem. We evaluate the performance of our method on the EachMoive, MovieLens and Netflix Prize collaborative filtering datasets. The experiment shows that, our model outperforms state-of-the-art methods on all benchmarks. Compared with Stochastic Gradient Method (SGD), our online learning approach achieves significant improvement on the estimation accuracy and robustness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2019

Variational inference for neural network matrix factorization and its application to stochastic blockmodeling

We consider the probabilistic analogue to neural network matrix factoriz...
research
11/13/2012

Boosting Simple Collaborative Filtering Models Using Ensemble Methods

In this paper we examine the effect of applying ensemble learning to the...
research
07/18/2018

Efficient Training on Very Large Corpora via Gramian Estimation

We study the problem of learning similarity functions over very large co...
research
12/07/2020

Probabilistic Latent Factor Model for Collaborative Filtering with Bayesian Inference

Latent Factor Model (LFM) is one of the most successful methods for Coll...
research
03/18/2014

Communication Communities in MOOCs

Massive Open Online Courses (MOOCs) bring together thousands of people f...
research
11/28/2014

Predicting clicks in online display advertising with latent features and side-information

We review a method for click-through rate prediction based on the work o...
research
12/15/2020

Variational Beam Search for Online Learning with Distribution Shifts

We consider the problem of online learning in the presence of sudden dis...

Please sign up or login with your details

Forgot password? Click here to reset