Energy-efficient Training of Distributed DNNs in the Mobile-edge-cloud Continuum

02/23/2022
by   Francesco Malandrino, et al.
0

We address distributed machine learning in multi-tier (e.g., mobile-edge-cloud) networks where a heterogeneous set of nodes cooperate to perform a learning task. Due to the presence of multiple data sources and computation-capable nodes, a learning controller (e.g., located in the edge) has to make decisions about (i) which distributed ML model structure to select, (ii) which data should be used for the ML model training, and (iii) which resources should be allocated to it. Since these decisions deeply influence one another, they should be made jointly. In this paper, we envision a new approach to distributed learning in multi-tier networks, which aims at maximizing ML efficiency. To this end, we propose a solution concept, called RightTrain, that achieves energy-efficient ML model training, while fulfilling learning time and quality requirements. RightTrain makes high-quality decisions in polynomial time. Further, our performance evaluation shows that RightTrain closely matches the optimum and outperforms the state of the art by over 50

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2022

Matching DNN Compression and Cooperative Training with Resources and Data Availability

To make machine learning (ML) sustainable and apt to run on the diverse ...
research
02/05/2021

Network Support for High-performance Distributed Machine Learning

The traditional approach to distributed machine learning is to adapt lea...
research
07/23/2019

An Optimization-enhanced MANO for Energy-efficient 5G Networks

5G network nodes, fronthaul and backhaul alike, will have both forwardin...
research
01/25/2021

Cloud, Fog or Edge: Where to Compute?

The computing continuum extends the high-performance cloud data centers ...
research
04/20/2022

fairDMS: Rapid Model Training by Data and Model Reuse

Extracting actionable information from data sources such as the Linac Co...
research
03/12/2023

Scavenger: A Cloud Service for Optimizing Cost and Performance of ML Training

While the pay-as-you-go nature of cloud virtual machines (VMs) makes it ...
research
01/12/2020

Private and Communication-Efficient Edge Learning: A Sparse Differential Gaussian-Masking Distributed SGD Approach

With rise of machine learning (ML) and the proliferation of smart mobile...

Please sign up or login with your details

Forgot password? Click here to reset