No DNN Left Behind: Improving Inference in the Cloud with Multi-Tenancy

01/21/2019
by   Amit Samanta, et al.
0

With the rise of machine learning, inference on deep neural networks (DNNs) has become a core building block on the critical path for many cloud applications. Applications today rely on isolated ad-hoc deployments that force users to compromise on consistent latency, elasticity, or cost-efficiency, depending on workload characteristics. We propose to elevate DNN inference to be a first class cloud primitive provided by a shared multi-tenant system, akin to cloud storage, and cloud databases. A shared system enables cost-efficient operation with consistent performance across the full spectrum of workloads. We argue that DNN inference is an ideal candidate for a multi-tenant system because of its narrow and well-defined interface and predictable resource requirements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/03/2022

iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud

GPUs are essential to accelerating the latency-sensitive deep neural net...
research
05/19/2022

Multi-DNN Accelerators for Next-Generation AI Systems

As the use of AI-powered applications widens across multiple domains, so...
research
11/24/2018

TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep LearningInference in Function as a Service Environments

Deep neural networks (DNNs) have become core computation components with...
research
09/03/2019

Guardians of the Deep Fog: Failure-Resilient DNN Inference from Edge to Cloud

Partitioning and distributing deep neural networks (DNNs) over physical ...
research
12/11/2020

Analyzing the Performance of Smart Industry 4.0 Applications on Cloud Computing Systems

Cloud-based Deep Neural Network (DNN) applications that make latency-sen...
research
05/08/2021

Optimising Resource Management for Embedded Machine Learning

Machine learning inference is increasingly being executed locally on mob...
research
08/18/2023

Cost-Intelligent Data Analytics in the Cloud

For decades, database research has focused on optimizing performance und...

Please sign up or login with your details

Forgot password? Click here to reset