Robust Bregman Clustering

12/11/2018
by   Claire Brécheteau, et al.
0

Using a trimming approach, we investigate a k-means type method based on Bregman divergences for clustering data possibly corrupted with clutter noise. The main interest of Bregman divergences is that the standard Lloyd algorithm adapts to these distortion measures, and they are well-suited for clustering data sampled according to mixture models from exponential families. We prove that there exists an optimal codebook, and that an empirically optimal codebook converges a.s. to an optimal codebook in the distortion sense. Moreover, we obtain the sub-Gaussian rate of convergence for k-means 1 √() n under mild tail assumptions. Also, we derive a Lloyd-type algorithm with a trimming parameter that can be selected from data according to some heuristic, and present some experimental results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2017

The Informativeness of k-Means and Dimensionality Reduction for Learning Mixture Models

The learning of mixture models can be viewed as a clustering problem. In...
research
02/06/2020

Robust k-means Clustering for Distributions with Two Moments

We consider the robust algorithms for the k-means clustering problem whe...
research
02/22/2016

Clustering subgaussian mixtures by semidefinite programming

We introduce a model-free relax-and-round algorithm for k-means clusteri...
research
08/21/2015

Strong Coresets for Hard and Soft Bregman Clustering with Applications to Exponential Family Mixtures

Coresets are efficient representations of data sets such that models tra...
research
09/16/2020

Clustering Data with Nonignorable Missingness using Semi-Parametric Mixture Models

We are concerned in clustering continuous data sets subject to nonignora...
research
11/08/2018

Performance of Johnson-Lindenstrauss Transform for k-Means and k-Medians Clustering

Consider an instance of Euclidean k-means or k-medians clustering. We sh...
research
10/27/2021

Uniform Concentration Bounds toward a Unified Framework for Robust Clustering

Recent advances in center-based clustering continue to improve upon the ...

Please sign up or login with your details

Forgot password? Click here to reset