StreamBed: capacity planning for stream processing

09/06/2023
by   Guillaume Rosinosky, et al.
0

StreamBed is a capacity planning system for stream processing. It predicts, ahead of any production deployment, the resources that a query will require to process an incoming data rate sustainably, and the appropriate configuration of these resources. StreamBed builds a capacity planning model by piloting a series of runs of the target query in a small-scale, controlled testbed. We implement StreamBed for the popular Flink DSP engine. Our evaluation with large-scale queries of the Nexmark benchmark demonstrates that StreamBed can effectively and accurately predict capacity requirements for jobs spanning more than 1,000 cores using a testbed of only 48 cores.

READ FULL TEXT
research
12/22/2018

Trevor: Automatic configuration and scaling of stream processing pipelines

Operating a distributed data stream processing workload efficiently at s...
research
08/07/2021

Building Analytics Pipelines for Querying Big Streams and Data Histories with H-STREAM

This paper introduces H-STREAM, a big stream/data processing pipelines e...
research
05/14/2021

3.5 GHz Coverage Assessment with a 5G Testbed

Today, cellular networks have saturated frequencies below 3 GHz. Because...
research
06/15/2019

Query and Resource Optimizations: A Case for Breaking the Wall in Big Data Systems

Modern big data systems run on cloud environments where resources are sh...
research
01/16/2020

Hardware-Conscious Stream Processing: A Survey

Data stream processing systems (DSPSs) enable users to express and run s...
research
04/12/2022

How to design a network architecture using capacity planning

Building a network architecture must answer to organization needs, but a...
research
05/31/2022

What Can Database Query Processing Do for Instance-Spanning Constraints?

In the last decade, the term instance-spanning constraint has been intro...

Please sign up or login with your details

Forgot password? Click here to reset