ServerMix: Tradeoffs and Challenges of Serverless Data Analytics

by   Pedro García López, et al.

Serverless computing has become very popular today since it largely simplifies cloud programming. Developers do not need to longer worry about provisioning or operating servers, and they pay only for the compute resources used when their code is run. This new cloud paradigm suits well for many applications, and researchers have already begun investigating the feasibility of serverless computing for data analytics. Unfortunately, today's serverless computing presents important limitations that make it really difficult to support all sorts of analytics workloads. This paper first starts by analyzing three fundamental trade-offs of today's serverless computing model and their relationship with data analytics. It studies how by relaxing disaggregation, isolation, and simple scheduling, it is possible to increase the overall computing performance, but at the expense of essential aspects of the model such as elasticity, security, or sub-second activations, respectively. The consequence of these trade-offs is that analytics applications may well end up embracing hybrid systems composed of serverless and serverful components, which we call Servermix in this paper. We will review the existing related work to show that most applications can be actually categorized as Servermix. Finally, this paper will introduce the major challenges of the CloudButton research project to manage these trade-offs.


page 1

page 2

page 3

page 4


Distributed intelligence on the Edge-to-Cloud Continuum: A systematic literature review

The explosion of data volumes generated by an increasing number of appli...

Cold Storage Data Archives: More Than Just a Bunch of Tapes

The abundance of available sensor and derived data from large scientific...

Cost-Intelligent Data Analytics in the Cloud

For decades, database research has focused on optimizing performance und...

SToN: A New Fundamental Trade-off for Distributed Data Storage Systems

Locating data efficiently is a key process in every distributed data sto...

Lambada: Interactive Data Analytics on Cold Data using Serverless Cloud Infrastructure

The promise of ultimate elasticity and operational simplicity of serverl...

DBOS: A Proposal for a Data-Centric Operating System

Current operating systems are complex systems that were designed before ...

High Bandwidth Memory on FPGAs: A Data Analytics Perspective

FPGA-based data processing in datacenters is increasing in popularity du...

Please sign up or login with your details

Forgot password? Click here to reset