Easy Variational Inference for Categorical Models via an Independent Binary Approximation

05/31/2022
by   Michael T. Wojnowicz, et al.
0

We pursue tractable Bayesian analysis of generalized linear models (GLMs) for categorical data. Thus far, GLMs are difficult to scale to more than a few dozen categories due to non-conjugacy or strong posterior dependencies when using conjugate auxiliary variable methods. We define a new class of GLMs for categorical data called categorical-from-binary (CB) models. Each CB model has a likelihood that is bounded by the product of binary likelihoods, suggesting a natural posterior approximation. This approximation makes inference straightforward and fast; using well-known auxiliary variables for probit or logistic regression, the product of binary models admits conjugate closed-form variational inference that is embarrassingly parallel across categories and invariant to category ordering. Moreover, an independent binary model simultaneously approximates multiple CB models. Bayesian model averaging over these can improve the quality of the approximation for any given dataset. We show that our approach scales to thousands of categories, outperforming posterior estimation competitors like Automatic Differentiation Variational Inference (ADVI) and No U-Turn Sampling (NUTS) in the time required to achieve fixed prediction quality.

READ FULL TEXT
research
06/10/2015

Automatic Variational Inference in Stan

Variational inference is a scalable technique for approximate Bayesian i...
research
09/19/2012

Variational Inference in Nonconjugate Models

Mean-field variational methods are widely used for approximate posterior...
research
03/02/2016

Automatic Differentiation Variational Inference

Probabilistic modeling is iterative. A scientist posits a simple model, ...
research
05/18/2018

Model reparametrization for improving variational inference

In this article, we propose a strategy to improve variational Bayes infe...
research
06/17/2020

Categorical Normalizing Flows via Continuous Transformations

Despite their popularity, to date, the application of normalizing flows ...
research
02/01/2020

Deep segmental phonetic posterior-grams based discovery of non-categories in L2 English speech

Second language (L2) speech is often labeled with the native, phone cate...
research
08/30/2022

Bayesian Multinomial Logistic Regression for Numerous Categories

While multinomial logistic regression is a useful tool for classificatio...

Please sign up or login with your details

Forgot password? Click here to reset