Collage Inference: Tolerating Stragglers in Distributed Neural Network Inference using Coding

04/27/2019
by   Krishna Giri Narra, et al.
0

MLaaS (ML-as-a-Service) offerings by cloud computing platforms are becoming increasingly popular these days. Pre-trained machine learning models are deployed on the cloud to support prediction based applications and services. For achieving higher throughput, incoming requests are served by running multiple replicas of the model on different machines concurrently. Incidence of straggler nodes in distributed inference is a significant concern since it can increase inference latency, violate SLOs of the service. In this paper, we propose a novel coded inference model to deal with stragglers in distributed image classification. We propose modified single shot object detection models, Collage-CNN models, to provide necessary resilience efficiently. A Collage-CNN model takes collage images formed by combining multiple images as its input and performs multi-image classification in one shot. We generate custom training collages using images from standard image classification datasets and train the model to achieve high classification accuracy. Deploying the Collage-CNN models in the cloud, we demonstrate that the 99th percentile latency can be reduced by 1.45X to 2.46X compared to replication based approaches and without compromising prediction accuracy.

READ FULL TEXT

page 7

page 9

research
06/05/2019

Collage Inference: Achieving low tail latency during distributed image classification using coded redundancy models

Reducing the latency variance in machine learning inference is a key req...
research
05/02/2019

Parity Models: A General Framework for Coding-Based Resilience in ML Inference

Machine learning models are becoming the primary workhorses for many app...
research
08/10/2022

PROFET: Profiling-based CNN Training Latency Prophet for GPU Cloud Instances

Training a Convolutional Neural Network (CNN) model typically requires s...
research
05/06/2021

Towards Inference Delivery Networks: Distributing Machine Learning with Optimality Guarantees

We present the novel idea of inference delivery networks (IDN), networks...
research
05/31/2022

Dropbear: Machine Learning Marketplaces made Trustworthy with Byzantine Model Agreement

Marketplaces for machine learning (ML) models are emerging as a way for ...
research
07/07/2020

C2G-Net: Exploiting Morphological Properties for Image Classification

In this paper we propose C2G-Net, a pipeline for image classification th...
research
10/09/2021

Evaluation and Ranking of Replica Deployments in Geographic State Machine Replication

Geographic state machine replication (SMR) is a replication method in wh...

Please sign up or login with your details

Forgot password? Click here to reset