Replicable Clustering

02/20/2023
by   Hossein Esfandiari, et al.
0

In this paper, we design replicable algorithms in the context of statistical clustering under the recently introduced notion of replicability. A clustering algorithm is replicable if, with high probability, it outputs the exact same clusters after two executions with datasets drawn from the same distribution when its internal randomness is shared across the executions. We propose such algorithms for the statistical k-medians, statistical k-means, and statistical k-centers problems by utilizing approximation routines for their combinatorial counterparts in a black-box manner. In particular, we demonstrate a replicable O(1)-approximation algorithm for statistical Euclidean k-medians (k-means) with poly(d) sample complexity. We also describe a O(1)-approximation algorithm with an additional O(1)-additive error for statistical Euclidean k-centers, albeit with exp(d) sample complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2021

No-Substitution k-means Clustering with Low Center Complexity and Memory

Clustering is a fundamental task in machine learning. Given a dataset X ...
research
06/19/2015

Representation Learning for Clustering: A Statistical Framework

We address the problem of communicating domain knowledge from a user to ...
research
08/27/2020

Differentially Private Clustering via Maximum Coverage

This paper studies the problem of clustering in metric spaces while pres...
research
12/13/2021

Optimal Fully Dynamic k-Centers Clustering

We present the first algorithm for fully dynamic k-centers clustering in...
research
12/28/2020

No-substitution k-means Clustering with Adversarial Order

We investigate k-means clustering in the online no-substitution setting ...
research
03/15/2021

Promise Problems Meet Pseudodeterminism

The Acceptance Probability Estimation Problem (APEP) is to additively ap...
research
02/15/2022

On the Role of Channel Capacity in Learning Gaussian Mixture Models

This paper studies the sample complexity of learning the k unknown cente...

Please sign up or login with your details

Forgot password? Click here to reset