Serving deep learning models in a serverless platform

10/23/2017
by   Vatche Ishakian, et al.
0

Serverless computing has emerged as a compelling paradigm for the development and deployment of a wide range of event based cloud applications. At the same time, cloud providers and enterprise companies are heavily adopting machine learning and Artificial Intelligence to either differentiate themselves, or provide their customers with value added services. In this work we evaluate the suitability of a serverless computing environment for the inferencing of large neural network models. Our experimental evaluations are executed on the AWS Lambda environment using the MxNet deep learning framework. Our experimental results show that while the inferencing latency can be within an acceptable range, longer delays due to cold starts can skew the latency distribution and hence risk violating more stringent SLAs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2021

Efficient Low-Latency Dynamic Licensing for Deep Neural Network Deployment on Edge Devices

Along with the rapid development in the field of artificial intelligence...
research
05/02/2021

Deployment Archetypes for Cloud Applications

This is a survey paper that explores six Cloud-based deployment archetyp...
research
06/09/2021

Cocktail: Leveraging Ensemble Learning for Optimized Model Serving in Public Cloud

With a growing demand for adopting ML models for a varietyof application...
research
04/29/2021

The Hidden cost of the Edge: A Performance Comparison of Edge and Cloud Latencies

Edge computing has emerged as a popular paradigm for running latency-sen...
research
06/14/2023

AiXpand AI OS – Decentralized ubiquitous computing MLOps execution engine

Over the past few years, ubiquitous, or pervasive computing has gained p...
research
06/01/2023

Characterizing the Cloud's Outbound Network Latency: An Experimental and Modeling Study

Cloud latency has critical influences on the success of cloud applicatio...
research
01/14/2022

Wide Area Network Intelligence with Application to Multimedia Service

Network intelligence is a discipline that builds on the capabilities of ...

Please sign up or login with your details

Forgot password? Click here to reset