Federated K-Means Clustering via Dual Decomposition-based Distributed Optimization

07/25/2023
by   Vassilios Yfantis, et al.
0

The use of distributed optimization in machine learning can be motivated either by the resulting preservation of privacy or the increase in computational efficiency. On the one hand, training data might be stored across multiple devices. Training a global model within a network where each node only has access to its confidential data requires the use of distributed algorithms. Even if the data is not confidential, sharing it might be prohibitive due to bandwidth limitations. On the other hand, the ever-increasing amount of available data leads to large-scale machine learning problems. By splitting the training process across multiple nodes its efficiency can be significantly increased. This paper aims to demonstrate how dual decomposition can be applied for distributed training of K-means clustering problems. After an overview of distributed and federated machine learning, the mixed-integer quadratically constrained programming-based formulation of the K-means clustering training problem is presented. The training can be performed in a distributed manner by splitting the data across different nodes and linking these nodes through consensus constraints. Finally, the performance of the subgradient method, the bundle trust method, and the quasi-Newton dual ascent algorithm are evaluated on a set of benchmark problems. While the mixed-integer programming-based formulation of the clustering problems suffers from weak integer relaxations, the presented approach can potentially be used to enable an efficient solution in the future, both in a central and distributed setting.

READ FULL TEXT
research
10/21/2021

Efficient and Robust Mixed-Integer Optimization Methods for Training Binarized Deep Neural Networks

Compared to classical deep neural networks its binarized versions can be...
research
11/04/2021

Mixed-Integer Optimization with Constraint Learning

We establish a broad methodological foundation for mixed-integer optimiz...
research
02/20/2023

A novel dual-decomposition method based on p-Lagrangian relaxation

In this paper, we propose the novel p-branch-and-bound method for solvin...
research
08/17/2020

WAFFLE: Watermarking in Federated Learning

Creators of machine learning models can use watermarking as a technique ...
research
01/20/2015

Distributed Data Association in Smart Camera Networks via Dual Decomposition

One of the fundamental requirements for visual surveillance using smart ...
research
05/22/2023

When Computing Power Network Meets Distributed Machine Learning: An Efficient Federated Split Learning Framework

In this paper, we advocate CPN-FedSL, a novel and flexible Federated Spl...
research
06/14/2023

Integrating machine learning paradigms and mixed-integer model predictive control for irrigation scheduling

The agricultural sector currently faces significant challenges in water ...

Please sign up or login with your details

Forgot password? Click here to reset