Evaluating Sensitivity to the Stick Breaking Prior in Bayesian Nonparametrics

by   Runjing Liu, et al.

A central question in many probabilistic clustering problems is how many distinct clusters are present in a particular dataset. A Bayesian nonparametric (BNP) model addresses this question by placing a generative process on cluster assignment. However, like all Bayesian approaches, BNP requires the specification of a prior. In practice, it is important to quantitatively establish that the prior is not too informative, particularly when the particular form of the prior is chosen for mathematical convenience rather than because of a considered subjective belief. We derive local sensitivity measures for a truncated variational Bayes (VB) approximation and approximate nonlinear dependence of a VB optimum on prior parameters using a local Taylor series approximation. Using a stick-breaking representation of a Dirichlet process, we consider perturbations both to the scalar concentration parameter and to the functional form of the stick- breaking distribution. Unlike previous work on local Bayesian sensitivity for BNP, we pay special attention to the ability of our sensitivity measures to extrapolate to different priors, rather than treating the sensitivity as a measure of robustness per se. Extrapolation motivates the use of multiplicative perturbations to the functional form of the prior for VB. Additionally, we linearly approximate only the computationally intensive part of inference -- the optimization of the global parameters -- and retain the nonlinearity of easily computed quantities as functions of the global parameters. We apply our methods to estimate sensitivity of the expected number of distinct clusters present in the Iris dataset to the BNP prior specification. We evaluate the accuracy of our approximations by comparing to the much more expensive process of re-fitting the model.


page 1

page 2

page 3

page 4


On the Locally Lipschitz Robustness of Bayesian Inverse Problems

In this note we consider the robustness of posterior measures occuring i...

Bayesian sensitivity analysis for a missing data model

In causal inference, sensitivity analysis is important to assess the rob...

Powered Dirichlet Process for Controlling the Importance of "Rich-Get-Richer" Prior Assumptions in Bayesian Clustering

One of the most used priors in Bayesian clustering is the Dirichlet prio...

BNPdensity: Bayesian nonparametric mixture modeling in R

Robust statistical data modelling under potential model mis-specificatio...

Duality between Approximate Bayesian Methods and Prior Robustness

In this paper we show that there is a link between approximate Bayesian ...

Asymptotic behavior of the number of distinct values in a sample from the geometric stick-breaking process

Discrete random probability measures are a key ingredient of Bayesian no...

A Bayesian computer model analysis of Robust Bayesian analyses

We harness the power of Bayesian emulation techniques, designed to aid t...

Please sign up or login with your details

Forgot password? Click here to reset