Federated Classification in Hyperbolic Spaces via Secure Aggregation of Convex Hulls

08/14/2023
by   Saurav Prakash, et al.
0

Hierarchical and tree-like data sets arise in many applications, including language processing, graph data mining, phylogeny and genomics. It is known that tree-like data cannot be embedded into Euclidean spaces of finite dimension with small distortion. This problem can be mitigated through the use of hyperbolic spaces. When such data also has to be processed in a distributed and privatized setting, it becomes necessary to work with new federated learning methods tailored to hyperbolic spaces. As an initial step towards the development of the field of federated learning in hyperbolic spaces, we propose the first known approach to federated classification in hyperbolic spaces. Our contributions are as follows. First, we develop distributed versions of convex SVM classifiers for Poincaré discs. In this setting, the information conveyed from clients to the global classifier are convex hulls of clusters present in individual client data. Second, to avoid label switching issues, we introduce a number-theoretic approach for label recovery based on the so-called integer B_h sequences. Third, we compute the complexity of the convex hulls in hyperbolic spaces to assess the extent of data leakage; at the same time, in order to limit the communication cost for the hulls, we propose a new quantization method for the Poincaré disc coupled with Reed-Solomon-like encoding. Fourth, at server level, we introduce a new approach for aggregating convex hulls of the clients based on balanced graph partitioning. We test our method on a collection of diverse data sets, including hierarchical single-cell RNA-seq data from different patients distributed across different repositories that have stringent privacy constraints. The classification accuracy of our method is up to ∼ 11% better than its Euclidean counterpart, demonstrating the importance of privacy-preserving learning in hyperbolic spaces.

READ FULL TEXT
research
07/26/2023

HyperFed: Hyperbolic Prototypes Exploration with Consistent Aggregation for Non-IID Data in Federated Learning

Federated learning (FL) collaboratively models user data in a decentrali...
research
04/14/2021

Towards Causal Federated Learning For Enhanced Robustness and Privacy

Federated Learning is an emerging privacy-preserving distributed machine...
research
05/02/2023

A Quadtree for Hyperbolic Space

We propose a data structure in d-dimensional hyperbolic space that can b...
research
03/07/2022

Provably Accurate and Scalable Linear Classifiers in Hyperbolic Spaces

Many high-dimensional practical data sets have hierarchical structures i...
research
09/08/2021

Highly Scalable and Provably Accurate Classification in Poincare Balls

Many high-dimensional and large-volume data sets of practical relevance ...
research
10/28/2022

Machine Unlearning of Federated Clusters

Federated clustering is an unsupervised learning problem that arises in ...
research
05/19/2022

HyperAid: Denoising in hyperbolic spaces for tree-fitting and hierarchical clustering

The problem of fitting distances by tree-metrics has received significan...

Please sign up or login with your details

Forgot password? Click here to reset