Run-time Deep Model Multiplexing

01/14/2020
by   Amir Erfan Eshratifar, et al.
6

We propose a framework to design a light-weight neural multiplexer that given input and resource budgets, decides upon the appropriate model to be called for the inference. Mobile devices can use this framework to offload the hard inputs to the cloud while inferring the easy ones locally. Besides, in the large scale cloud-based intelligent applications, instead of replicating the most-accurate model, a range of small and large models can be multiplexed from depending on the input's complexity and resource budgets. Our experimental results demonstrate the effectiveness of our framework benefiting both mobile users and cloud providers.

READ FULL TEXT

page 1

page 3

research
09/10/2018

Not Just Privacy: Improving Performance of Private Deep Learning in Mobile Cloud

The increasing demand for on-device deep learning services calls for a h...
research
11/12/2022

PriMask: Cascadable and Collusion-Resilient Data Masking for Mobile Cloud Inference

Mobile cloud offloading is indispensable for inference tasks based on la...
research
02/16/2020

MDInference: Balancing Inference Accuracy andLatency for Mobile Applications

Deep Neural Networks (DNNs) are allowing mobile devices to incorporate a...
research
02/16/2020

MDInference: Balancing Inference Accuracy and Latency for Mobile Applications

Deep Neural Networks (DNNs) are allowing mobile devices to incorporate a...
research
02/01/2020

Shared Mobile-Cloud Inference for Collaborative Intelligence

As AI applications for mobile devices become more prevalent, there is an...
research
08/29/2021

Edge-Cloud Collaborated Object Detection via Difficult-Case Discriminator

As one of the basic tasks of computer vision, object detection has been ...
research
09/28/2022

InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference

Mobile-centric AI applications have high requirements for resource-effic...

Please sign up or login with your details

Forgot password? Click here to reset