Differentially-Private Hierarchical Clustering with Provable Approximation Guarantees

01/31/2023
by   Jacob Imola, et al.
0

Hierarchical Clustering is a popular unsupervised machine learning method with decades of history and numerous applications. We initiate the study of differentially private approximation algorithms for hierarchical clustering under the rigorous framework introduced by (Dasgupta, 2016). We show strong lower bounds for the problem: that any ϵ-DP algorithm must exhibit O(|V|^2/ ϵ)-additive error for an input dataset V. Then, we exhibit a polynomial-time approximation algorithm with O(|V|^2.5/ ϵ)-additive error, and an exponential-time algorithm that meets the lower bound. To overcome the lower bound, we focus on the stochastic block model, a popular model of graphs, and, with a separation assumption on the blocks, propose a private 1+o(1) approximation algorithm which also recovers the blocks exactly. Finally, we perform an empirical study of our algorithms and validate their performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2021

Differentially Private Correlation Clustering

Correlation clustering is a widely used technique in unsupervised machin...
research
09/28/2020

A note on differentially private clustering with large additive error

In this note, we describe a simple approach to obtain a differentially p...
research
07/07/2023

Differential Privacy for Clustering Under Continual Observation

We consider the problem of clustering privately a dataset in ℝ^d that un...
research
08/18/2020

Differentially Private Clustering: Tight Approximation Ratios

We study the task of differentially private clustering. For several basi...
research
06/01/2021

Differentially Private Densest Subgraph

Given a graph, the densest subgraph problem asks for a set of vertices s...
research
07/04/2019

Locally Private k-Means Clustering

We design a new algorithm for the Euclidean k-means problem that operate...
research
05/22/2019

An Optimal Private Stochastic-MAB Algorithm Based on an Optimal Private Stopping Rule

We present a provably optimal differentially private algorithm for the s...

Please sign up or login with your details

Forgot password? Click here to reset