A High-Performance Implementation of Bayesian Matrix Factorization with Limited Communication

04/06/2020
by   Tom Vander Aa, et al.
42

Matrix factorization is a very common machine learning technique in recommender systems. Bayesian Matrix Factorization (BMF) algorithms would be attractive because of their ability to quantify uncertainty in their predictions and avoid over-fitting, combined with high prediction accuracy. However, they have not been widely used on large-scale data because of their prohibitive computational cost. In recent work, efforts have been made to reduce the cost, both by improving the scalability of the BMF algorithm as well as its implementation, but so far mainly separately. In this paper we show that the state-of-the-art of both approaches to scalability can be combined. We combine the recent highly-scalable Posterior Propagation algorithm for BMF, which parallelizes computation of blocks of the matrix, with a distributed BMF implementation that users asynchronous communication within each block. We show that the combination of the two methods gives substantial improvements in the scalability of BMF on web-scale datasets, when the goal is to reduce the wall-clock time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2019

SMURFF: a High-Performance Framework for Matrix Factorization

Bayesian Matrix Factorization (BMF) is a powerful technique for recommen...
research
03/05/2015

Large-Scale Distributed Bayesian Matrix Factorization using Stochastic Gradient MCMC

Despite having various attractive qualities such as high prediction accu...
research
12/03/2021

ALX: Large Scale Matrix Factorization on TPUs

We present ALX, an open-source library for distributed matrix factorizat...
research
03/02/2017

Distributed Bayesian Matrix Factorization with Minimal Communication

Bayesian matrix factorization (BMF) is a powerful tool for producing low...
research
09/11/2011

Feature-Based Matrix Factorization

Recommender system has been more and more popular and widely used in man...
research
08/25/2019

Scalable Probabilistic Matrix Factorization with Graph-Based Priors

In matrix factorization, available graph side-information may not be wel...
research
03/15/2012

A Bayesian Matrix Factorization Model for Relational Data

Relational learning can be used to augment one data source with other co...

Please sign up or login with your details

Forgot password? Click here to reset