Differentially Private Vertical Federated Clustering

08/02/2022
by   Zitao Li, et al.
3

In many applications, multiple parties have private data regarding the same set of users but on disjoint sets of attributes, and a server wants to leverage the data to train a model. To enable model learning while protecting the privacy of the data subjects, we need vertical federated learning (VFL) techniques, where the data parties share only information for training the model, instead of the private data. However, it is challenging to ensure that the shared information maintains privacy while learning accurate models. To the best of our knowledge, the algorithm proposed in this paper is the first practical solution for differentially private vertical federated k-means clustering, where the server can obtain a set of global centers with a provable differential privacy guarantee. Our algorithm assumes an untrusted central server that aggregates differentially private local centers and membership encodings from local data parties. It builds a weighted grid as the synopsis of the global dataset based on the received information. Final centers are generated by running any k-means algorithm on the weighted grid. Our approach for grid weight estimation uses a novel, light-weight, and differentially private set intersection cardinality estimation algorithm based on the Flajolet-Martin sketch. To improve the estimation accuracy in the setting with more than two data parties, we further propose a refined version of the weights estimation algorithm and a parameter tuning strategy to reduce the final k-means utility to be close to that in the central private setting. We provide theoretical utility analysis and experimental evaluation results for the cluster centers computed by our algorithm and show that our approach performs better both theoretically and empirically than the two baselines based on existing techniques.

READ FULL TEXT

page 4

page 10

page 11

research
06/11/2021

Differentially Private Algorithms for Clustering with Stability Assumptions

We study the problem of differentially private clustering under input-st...
research
05/24/2022

Differentially Private AUC Computation in Vertical Federated Learning

Federated learning has gained great attention recently as a privacy-enha...
research
09/23/2022

Differentially private partitioned variational inference

Learning a privacy-preserving model from distributed sensitive data is a...
research
08/02/2023

Dynamic Privacy Allocation for Locally Differentially Private Federated Learning with Composite Objectives

This paper proposes a locally differentially private federated learning ...
research
09/06/2020

Hybrid Differentially Private Federated Learning on Vertically Partitioned Data

We present HDP-VFL, the first hybrid differentially private (DP) framewo...
research
09/12/2019

Differentially Private Meta-Learning

Parameter-transfer is a well-known and versatile approach for meta-learn...
research
06/25/2020

Towards Differentially Private Text Representations

Most deep learning frameworks require users to pool their local data or ...

Please sign up or login with your details

Forgot password? Click here to reset