Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints

08/29/2023
by   Wenxing Xu, et al.
0

Unlike cloud-based deep learning models that are often large and uniform, edge-deployed models usually demand customization for domain-specific tasks and resource-limited environments. Such customization processes can be costly and time-consuming due to the diversity of edge scenarios and the training load for each scenario. Although various approaches have been proposed for rapid resource-oriented customization and task-oriented customization respectively, achieving both of them at the same time is challenging. Drawing inspiration from the generative AI and the modular composability of neural networks, we introduce NN-Factory, an one-for-all framework to generate customized lightweight models for diverse edge scenarios. The key idea is to use a generative model to directly produce the customized models, instead of training them. The main components of NN-Factory include a modular supernet with pretrained modules that can be conditionally activated to accomplish different tasks and a generative module assembler that manipulate the modules according to task and sparsity requirements. Given an edge scenario, NN-Factory can efficiently customize a compact model specialized in the edge task while satisfying the edge resource constraints by searching for the optimal strategy to assemble the modules. Based on experiments on image classification and object detection tasks with different edge devices, NN-Factory is able to generate high-quality task- and resource-specific models within few seconds, faster than conventional model customization approaches by orders of magnitude.

READ FULL TEXT

page 1

page 7

page 12

research
02/25/2019

Modularity as a Means for Complexity Management in Neural Networks Learning

Training a Neural Network (NN) with lots of parameters or intricate arch...
research
08/03/2021

Elastic Architecture Search for Diverse Tasks with Different Resources

We study a new challenging problem of efficient deployment for diverse t...
research
03/13/2023

AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments

Deep learning models are increasingly deployed to edge devices for real-...
research
05/30/2019

Toward Runtime-Throttleable Neural Networks

As deep neural network (NN) methods have matured, there has been increas...
research
04/13/2020

Deep-Edge: An Efficient Framework for Deep Learning Model Update on Heterogeneous Edge

Deep Learning (DL) model-based AI services are increasingly offered in a...
research
05/23/2019

Accelerating DNN Training in Wireless Federated Edge Learning System

Training task in classical machine learning models, such as deep neural ...

Please sign up or login with your details

Forgot password? Click here to reset