Consensus Monte Carlo for Random Subsets using Shared Anchors

06/28/2019
by   Yang Ni, et al.
3

We present a consensus Monte Carlo algorithm that scales existing Bayesian nonparametric models for clustering and feature allocation to big data. The algorithm is valid for any prior on random subsets such as partitions and latent feature allocation, under essentially any sampling model. Motivated by three case studies, we focus on clustering induced by a Dirichlet process mixture sampling model, inference under an Indian buffet process prior with a binomial sampling model, and with a categorical sampling model. We assess the proposed algorithm with simulation studies and show results for inference with three datasets: an MNIST image dataset, a dataset of pancreatic cancer mutations, and a large set of electronic health records (EHR). Supplementary materials for this article are available online.

READ FULL TEXT

page 15

page 21

page 24

research
06/07/2018

Scalable Bayesian Nonparametric Clustering and Classification

We develop a scalable multi-step Monte Carlo algorithm for inference und...
research
07/04/2019

Bayesian Heterogeneity Pursuit Regression Models for Spatially Dependent Data

Most existing spatial clustering literatures discussed the cluster algor...
research
09/04/2018

Bayesian Double Feature Allocation for Phenotyping with Electronic Health Records

We propose a categorical matrix factorization method to infer latent dis...
research
08/12/2017

Bayesian Non-Exhaustive Classification for Active Online Name Disambiguation

The name disambiguation task partitions a collection of records pertaini...
research
02/08/2021

Large-data determinantal clustering

Determinantal consensus clustering is a promising and attractive alterna...
research
02/07/2021

Determinantal consensus clustering

Random restart of a given algorithm produces many partitions to yield a ...
research
10/01/2013

Summary Statistics for Partitionings and Feature Allocations

Infinite mixture models are commonly used for clustering. One can sample...

Please sign up or login with your details

Forgot password? Click here to reset