Identifiability of Nonparametric Mixture Models and Bayes Optimal Clustering

02/12/2018
by   Bryon Aragam, et al.
0

Motivated by problems in data clustering, we establish general conditions under which families of nonparametric mixture models are identifiable by introducing a novel framework for clustering overfitted parametric (i.e. misspecified) mixture models. These conditions generalize existing conditions in the literature, and are flexible enough to include for example mixtures of Gaussian mixtures. In contrast to the recent literature on estimating nonparametric mixtures, we allow for general nonparametric mixture components, and instead impose regularity assumptions on the underlying mixing measure. As our primary application, we apply these results to partition-based clustering, generalizing the well-known notion of a Bayes optimal partition from classical model-based clustering to nonparametric settings. Furthermore, this framework is constructive in that it yields a practical algorithm for learning identified mixtures, which is illustrated through several examples. The key conceptual device in the analysis is the convex, metric geometry of probability distributions on metric spaces and its connection to optimal transport and the Wasserstein convergence of mixing measures. The result is a flexible framework for nonparametric clustering with formal consistency guarantees.

READ FULL TEXT
research
01/14/2015

Dirichlet Process Parsimonious Mixtures for clustering

The parsimonious Gaussian mixture models, which exploit an eigenvalue de...
research
02/27/2019

Quasi-Bayes properties of a recursive procedure for mixtures

Bayesian methods are attractive and often optimal, yet nowadays pressure...
research
12/04/2021

Nonparametric mixture MLEs under Gaussian-smoothed optimal transport distance

The Gaussian-smoothed optimal transport (GOT) framework, pioneered in Go...
research
08/31/2021

Uniform Consistency in Nonparametric Mixture Models

We study uniform consistency in nonparametric mixture models as well as ...
research
10/18/2021

Recovery Guarantees for Kernel-based Clustering under Non-parametric Mixture Models

Despite the ubiquity of kernel-based clustering, surprisingly few statis...
research
09/15/2023

Augmented quantization: a general approach to mixture models

The investigation of mixture models is a key to understand and visualize...
research
05/25/2019

Combining mixture models with linear mixing updates: multilayer image segmentation and synthesis

Finite mixture models for clustering can often be improved by adding a r...

Please sign up or login with your details

Forgot password? Click here to reset