Coupled Compound Poisson Factorization

01/09/2017
by   Mehmet E. Basbug, et al.
0

We present a general framework, the coupled compound Poisson factorization (CCPF), to capture the missing-data mechanism in extremely sparse data sets by coupling a hierarchical Poisson factorization with an arbitrary data-generating model. We derive a stochastic variational inference algorithm for the resulting model and, as examples of our framework, implement three different data-generating models---a mixture model, linear regression, and factor analysis---to robustly model non-random missing data in the context of clustering, prediction, and matrix factorization. In all three cases, we test our framework against models that ignore the missing-data mechanism on large scale studies with non-random missing data, and we show that explicitly modeling the missing-data mechanism substantially improves the quality of the results, as measured using data log likelihood on a held-out test set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2017

Diagnosing missing always at random in multivariate data

Models for analyzing multivariate data sets with missing values require ...
research
11/22/2017

Variational Bayesian Inference For A Scale Mixture Of Normal Distributions Handling Missing Data

In this paper, a scale mixture of Normal distributions model is develope...
research
10/14/2019

Bayesian Temporal Factorization for Multidimensional Time Series Prediction

Large-scale and multidimensional spatiotemporal data sets are becoming u...
research
05/26/2022

RIGID: Robust Linear Regression with Missing Data

We present a robust framework to perform linear regression with missing ...
research
09/06/2022

Understanding and Reducing Crater Counting Errors in Citizen Science Data and the Need for Standardisation

Citizen science has become a popular tool for preliminary data processin...
research
07/05/2022

Variational Inference of Dynamic Factor Models with Arbitrary Missing Data

Dynamic factor models are often estimated by point-estimation methods, d...
research
03/28/2015

Sparse Linear Regression With Missing Data

This paper proposes a fast and accurate method for sparse regression in ...

Please sign up or login with your details

Forgot password? Click here to reset