Toward Reliable and Rapid Elasticity for Streaming Dataflows on Clouds

12/02/2017
by   Anshu Shukla, et al.
0

The pervasive availability of streaming data is driving interest in distributed Fast Data platforms for streaming applications. Such latency-sensitive applications need to respond to dynamism in the input rates and task behavior using scale-in and -out on elastic Cloud resources. Platforms like Apache Storm do not provide robust capabilities for responding to such dynamism and for rapid task migration across VMs. We propose several dataflow checkpoint and migration approaches that allow a running streaming dataflow to migrate, without any loss of in-flight messages or their internal tasks states, while reducing the time to recover and stabilize. We implement and evaluate these migration strategies on Apache Storm using micro and application dataflows for scaling in and out on up to 2-21 Azure VMs. Our results show that we can migrate dataflows of large sizes within 50 sec, in comparison to Storm's default approach that takes over 100 sec. We also find that our approaches stabilize the application much earlier and there is no failure and re-processing of messages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2018

Managing Large-Scale Transient Data in IoT Systems

The pervasive availability of streaming data is driving interest in dist...
research
12/04/2018

Megaphone: Live state migration for distributed streaming dataflows

We design and implement Megaphone, a data migration mechanism for statef...
research
01/14/2021

Checkpoint, Restore, and Live Migration for Science Platforms

We demonstrate a fully functional implementation of (per-user) checkpoin...
research
09/13/2019

Performance Characterization and Modeling of Serverless and HPC Streaming Applications

Experiment-in-the-Loop Computing (EILC) requires support for numerous ty...
research
06/01/2019

Patterns for Blockchain Migration

With the rapid evolution of technological, economic, and regulatory land...
research
06/04/2020

Online Versus Offline Rate in Streaming Codes for Variable-Size Messages

Providing high quality-of-service for live communication is a pervasive ...
research
07/10/2022

Efficient RDF Streaming for the Edge-Cloud Continuum

With the ongoing, gradual shift of large-scale distributed systems towar...

Please sign up or login with your details

Forgot password? Click here to reset