Flexible Mixture Modeling on Constrained Spaces

09/24/2018
by   Putu Ayu Sudyanti, et al.
2

This paper addresses challenges in flexibly modeling multimodal data that lie on constrained spaces. Applications include climate or crime measurements in a geographical area, or flow-cytometry experiments, where unsuitable recordings are discarded. A simple approach to modeling such data is through the use of mixture models, with each component following an appropriate truncated distribution. Problems arise when the truncation involves complicated constraints, leading to difficulties in specifying the component distributions, and in evaluating their normalization constants. Bayesian inference over the parameters of these models results in posterior distributions that are doubly-intractable. We address this problem via an algorithm based on rejection sampling and data augmentation. We view samples from a truncated distribution as outcomes of a rejection sampling scheme, where proposals are made from a simple mixture model, and are rejected if they violate the constraints. Our scheme proceeds by imputing the rejected samples given mixture parameters, and then resampling parameters given all samples. We study two modeling approaches: mixtures of truncated components and truncated mixtures of components. In both situations, we describe exact Markov chain Monte Carlo sampling algorithms, as well as approximations that bound the number of rejected samples, achieving computational efficiency and lower variance at the cost of asymptotic bias. Overall, our methodology only requires practitioners to provide an indicator function for the set of interest. We present results on simulated data and apply our algorithm to two problems, one involving flow-cytometry data, and the other, crime recorded in the city of Chicago.

READ FULL TEXT

page 12

page 14

research
12/01/2021

An adaptive mixture-population Monte Carlo method for likelihood-free inference

This paper focuses on variational inference with intractable likelihood ...
research
09/26/2022

Sampling Constrained Continuous Probability Distributions: A Review

The problem of sampling constrained continuous distributions has frequen...
research
02/08/2019

Scalable Nonparametric Sampling from Multimodal Posteriors with the Posterior Bootstrap

Increasingly complex datasets pose a number of challenges for Bayesian i...
research
08/16/2022

Semiparametric imputation using latent sparse conditional Gaussian mixtures for multivariate mixed outcomes

This paper proposes a flexible Bayesian approach to multiple imputation ...
research
04/17/2017

Mixture modeling on related samples by ψ-stick breaking and kernel perturbation

There has been great interest recently in applying nonparametric kernel ...
research
06/19/2015

Sampling constrained probability distributions using Spherical Augmentation

Statistical models with constrained probability distributions are abunda...
research
05/21/2018

Anchored Bayesian Gaussian Mixture Models

Finite Gaussian mixtures are a flexible modeling tool for irregularly sh...

Please sign up or login with your details

Forgot password? Click here to reset