Search Algorithms and Loss Functions for Bayesian Clustering

05/10/2021
by   David B. Dahl, et al.
0

We propose a randomized greedy search algorithm to find a point estimate for a random partition based on a loss function and posterior Monte Carlo samples. Given the large size and awkward discrete nature of the search space, the minimization of the posterior expected loss is challenging. Our approach is a stochastic search based on a series of greedy optimizations performed in a random order and is embarrassingly parallel. We consider several loss functions, including Binder loss and variation of information. We note that criticisms of Binder loss are the result of using equal penalties of misclassification and we show an efficient means to compute Binder loss with potentially unequal penalties. Furthermore, we extend the original variation of information to allow for unequal penalties and show no increased computational costs. We provide a reference implementation of our algorithm. Using a variety of examples, we show that our method produces clustering estimates that better minimize the expected loss and are obtained faster than existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/27/2022

Comparison and Bayesian Estimation of Feature Allocations

Feature allocation models postulate a sampling distribution whose parame...
research
08/19/2021

Evaluating Multiple Guesses by an Adversary via a Tunable Loss Function

We consider a problem of guessing, wherein an adversary is interested in...
research
05/17/2019

AM-LFS: AutoML for Loss Function Search

Designing an effective loss function plays an important role in visual a...
research
01/30/2013

Decision Theoretic Foundations of Graphical Model Selection

This paper describes a decision theoretic formulation of learning the gr...
research
04/10/2022

Real order total variation with applications to the loss functions in learning schemes

Loss function are an essential part in modern data-driven approach, such...
research
07/18/2019

Minimizing the expected value of the asymmetric loss and an inequality of the variance of the loss

For some estimations and predictions, we solve minimization problems wit...
research
07/12/2021

Nonlinear Least Squares for Large-Scale Machine Learning using Stochastic Jacobian Estimates

For large nonlinear least squares loss functions in machine learning we ...

Please sign up or login with your details

Forgot password? Click here to reset