Nonparametric Bayesian Negative Binomial Factor Analysis

04/25/2016
by   Mingyuan Zhou, et al.
0

A common approach to analyze a covariate-sample count matrix, an element of which represents how many times a covariate appears in a sample, is to factorize it under the Poisson likelihood. We show its limitation in capturing the tendency for a covariate present in a sample to both repeat itself and excite related ones. To address this limitation, we construct negative binomial factor analysis (NBFA) to factorize the matrix under the negative binomial likelihood, and relate it to a Dirichlet-multinomial distribution based mixed-membership model. To support countably infinite factors, we propose the hierarchical gamma-negative binomial process. By exploiting newly proved connections between discrete distributions, we construct two blocked and a collapsed Gibbs sampler that all adaptively truncate their number of factors, and demonstrate that the blocked Gibbs sampler developed under a compound Poisson representation converges fast and has low computational complexity. Example results show that NBFA has a distinct mechanism in adjusting its number of inferred factors according to the sample lengths, and provides clear advantages in parsimonious representation, predictive power, and computational complexity over previously proposed discrete latent variable models, which either completely ignore burstiness, or model only the burstiness of the covariates but not that of the factors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2014

Bayesian nonparametric comorbidity analysis of psychiatric disorders

The analysis of comorbidity is an open and complex research field in the...
research
09/15/2012

Negative Binomial Process Count and Mixture Modeling

The seemingly disjoint problems of count and mixture modeling are united...
research
10/28/2014

Beta-Negative Binomial Process and Exchangeable Random Partitions for Mixed-Membership Modeling

The beta-negative binomial process (BNBP), an integer-valued stochastic ...
research
11/06/2015

The Poisson Gamma Belief Network

To infer a multilayer representation of high-dimensional count vectors, ...
research
08/23/2016

Softplus Regressions and Convex Polytopes

To construct flexible nonlinear predictive distributions, the paper intr...
research
05/14/2019

Convolutional Poisson Gamma Belief Network

For text analysis, one often resorts to a lossy representation that eith...

Please sign up or login with your details

Forgot password? Click here to reset