Distributed intelligence on the Edge-to-Cloud Continuum: A systematic literature review

04/29/2022
by   Daniel Rosendo, et al.
6

The explosion of data volumes generated by an increasing number of applications is strongly impacting the evolution of distributed digital infrastructures for data analytics and machine learning (ML). While data analytics used to be mainly performed on cloud infrastructures, the rapid development of IoT infrastructures and the requirements for low-latency, secure processing has motivated the development of edge analytics. Today, to balance various trade-offs, ML-based analytics tends to increasingly leverage an interconnected ecosystem that allows complex applications to be executed on hybrid infrastructures where IoT Edge devices are interconnected to Cloud/HPC systems in what is called the Computing Continuum, the Digital Continuum, or the Transcontinuum.Enabling learning-based analytics on such complex infrastructures is challenging. The large scale and optimized deployment of learning-based workflows across the Edge-to-Cloud Continuum requires extensive and reproducible experimental analysis of the application execution on representative testbeds. This is necessary to help understand the performance trade-offs that result from combining a variety of learning paradigms and supportive frameworks. A thorough experimental analysis requires the assessment of the impact of multiple factors, such as: model accuracy, training time, network overhead, energy consumption, processing latency, among others.This review aims at providing a comprehensive vision of the main state-of-the-art libraries and frameworks for machine learning and data analytics available today. It describes the main learning paradigms enabling learning-based analytics on the Edge-to-Cloud Continuum. The main simulation, emulation, deployment systems, and testbeds for experimental research on the Edge-to-Cloud Continuum available today are also surveyed. Furthermore, we analyze how the selected systems provide support for experiment reproducibility. We conclude our review with a detailed discussion of relevant open research challenges and of future directions in this domain such as: holistic understanding of performance; performance optimization of applications;efficient deployment of Artificial Intelligence (AI) workflows on highly heterogeneous infrastructures; and reproducible analysis of experiments on the Computing Continuum.

READ FULL TEXT

page 4

page 9

page 10

page 16

page 17

page 20

page 21

page 22

research
07/26/2019

ServerMix: Tradeoffs and Challenges of Serverless Data Analytics

Serverless computing has become very popular today since it largely simp...
research
04/03/2019

Stratum: A Serverless Framework for Lifecycle Management of Machine Learning based Data Analytics Tasks

With the proliferation of machine learning (ML) libraries and frameworks...
research
08/04/2022

Edge-centric Optimization of Multi-modal ML-driven eHealth Applications

Smart eHealth applications deliver personalized and preventive digital h...
research
09/03/2021

Enabling Reproducible Analysis of Complex Workflows on the Edge-to-Cloud Continuum

Distributed digital infrastructures for computation and analytics are no...
research
04/14/2018

Data Analytics Service Composition and Deployment on Edge Devices

Data analytics on edge devices has gained rapid growth in research, indu...
research
08/04/2021

Reproducible Performance Optimization of Complex Applications on the Edge-to-Cloud Continuum

In more and more application areas, we are witnessing the emergence of c...
research
07/24/2023

KheOps: Cost-effective Repeatability, Reproducibility, and Replicability of Edge-to-Cloud Experiments

Distributed infrastructures for computation and analytics are now evolvi...

Please sign up or login with your details

Forgot password? Click here to reset