To Migrate or not to Migrate: An Analysis of Operator Migration in Distributed Stream Processing

by   Espen Volnes, et al.

One of the most important issues in data stream processing systems is to use operator migration to handle highly variable workloads in a cost-efficient manner and adapt to the needs at any given time on demand. Operator migration is a complex process that involves changes in the state and stream management of a running query, typically without any loss of data, and with as little disruption to the execution as possible. This survey provides an overview of solutions for operator migration from a historical perspective as well as the perspective of the goal of migration. It introduces a conceptual model of operator migration to establish a unified terminology and classify existing solutions. Existing work in the area is analyzed to separate the mechanism of migration from the decision to migrate the data. In case of the latter, a cost-benefit analysis is emphasized that is important for operator migration but is often only implicitly addressed, or is neglected altogether. A description of the available solutions provides the reader with a good understanding of the design alternatives from an algorithmic viewpoint. We complement this with an empirical study to provide quantitative insights on the impact of different design alternatives on the mechanisms of migration.


page 1

page 2

page 3

page 4


Challenges in migrating legacy software systems to the cloud an empirical study

Moving existing legacy systems to cloud platforms is a difficult and hig...

Megaphone: Live state migration for distributed streaming dataflows

We design and implement Megaphone, a data migration mechanism for statef...

Cloud Migration Process A Survey Evaluation Framework and Open Challenges

Moving mission-oriented enterprise applications to cloud environments is...

On PQC Migration and Crypto-Agility

Besides the development of PQC algorithms, the actual migration of IT sy...

Rational Agent-Based Decision Algorithm for Strategic Converged Network Migration Planning

To keep up with constantly growing user demands for services with higher...

PASCAL: A Learning-aided Cooperative Bandwidth Control Policy for Hierarchical Storage Systems

Nowadays, the Hierarchical Storage System (HSS) is considered as an idea...

Seismic Imaging: An Overview and Parallel Implementation of Poststack Depth Migration

Seismic migration is the core step of seismic data processing which is i...

Please sign up or login with your details

Forgot password? Click here to reset