Robust k-means Clustering for Distributions with Two Moments

02/06/2020
by   Yegor Klochkov, et al.
0

We consider the robust algorithms for the k-means clustering problem where a quantizer is constructed based on N independent observations. Our main results are median of means based non-asymptotic excess distortion bounds that hold under the two bounded moments assumption in a general separable Hilbert space. In particular, our results extend the renowned asymptotic result of Pollard, 1981 who showed that the existence of two moments is sufficient for strong consistency of an empirically optimal quantizer in R^d. In a special case of clustering in R^d, under two bounded moments, we prove matching (up to constant factors) non-asymptotic upper and lower bounds on the excess distortion, which depend on the probability mass of the lightest cluster of an optimal quantizer. Our bounds have the sub-Gaussian form, and the proofs are based on the versions of uniform bounds for robust mean estimators.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2018

Uniform bounds for robust mean estimators

Median-of-means technique is an elegant and general method for estimatin...
research
01/04/2018

Improved Bounds on Lossless Source Coding and Guessing Moments via Rényi Measures

This paper provides upper and lower bounds on the optimal guessing momen...
research
12/11/2018

Robust Bregman Clustering

Using a trimming approach, we investigate a k-means type method based on...
research
10/27/2021

Uniform Concentration Bounds toward a Unified Framework for Robust Clustering

Recent advances in center-based clustering continue to improve upon the ...
research
08/09/2023

Bounded Distributions place Limits on Skewness and Larger Moments

Distributions of strictly positive numbers are common and can be charact...
research
08/16/2019

Algorithms and Complexity for Functions on General Domains

Error bounds and complexity bounds in numerical analysis and information...
research
08/29/2023

Moments of the number of points in a bounded set for number field lattices

We examine the moments of the number of lattice points in a fixed ball o...

Please sign up or login with your details

Forgot password? Click here to reset