Performance considerations on execution of large scale workflow applications on cloud functions

09/08/2019
by   Maciej Pawlik, et al.
0

Function-as-a-Service is a novel type of cloud service used for creating distributed applications and utilizing computing resources. Application developer supplies source code of cloud functions, which are small applications or application components, while the service provider is responsible for provisioning the infrastructure, scaling and exposing a REST style API. This environment seems to be adequate for running scientific workflows, which in recent years, have become an established paradigm for implementing and preserving complex scientific processes. In this paper, we present work done on evaluating three major FaaS providers (Amazon, Google, IBM) as a platform for running scientific workflows. The experiments were performed with a dedicated benchmarking framework, which consisted of instrumented workflow execution engine. The testing load was implemented as a large scale bag-of-tasks style workflow, where task count reached 5120 running in parallel. The studied parameters include raw performance, efficiency of infrastructure provisioning, overhead introduced by the API and network layers, as well as aspects related to run time accounting. Conclusions include insights into available performance, expressed as raw GFlops values and charts depicting relation of performance to function size. The infrastructure provisioning proved to be governed by parallelism and rate limits, which can be deducted from included charts. The overhead imposed by using a REST API proved to be a significant contribution to overall run time of individual tasks, and possibly the whole workflow. The paper ends with pointing out possible future work, which includes building performance models and designing a dedicated scheduling algorithms for running scientific workflows on FaaS.

READ FULL TEXT
research
10/21/2020

Serverless Containers – rising viable approach to Scientific Workflows

Increasing popularity of the serverless computing approach has led to th...
research
07/30/2018

Comparison of Production Serverless Function Orchestration Systems

Since the appearance of Amazon Lambda in 2014, all major cloud providers...
research
06/15/2020

Triggerflow: Trigger-based Orchestration of Serverless Workflows

As more applications are being moved to the Cloud thanks to serverless c...
research
06/02/2020

Workflow-as-a-Service Cloud Platform and Deployment of Bioinformatics Workflow Applications

Workflow management systems (WMS) support the composition and deployment...
research
01/01/2020

Ripple: A Practical Declarative Programming Framework for Serverless Compute

Serverless computing has emerged as a promising alternative to infrastru...
research
02/19/2022

Combining Node-RED and Openwhisk for Pattern-based Development and Execution of Complex FaaS Workflows

Modern cloud computing advances have been pressing application moderniza...

Please sign up or login with your details

Forgot password? Click here to reset