MIME: Adapting a Single Neural Network for Multi-task Inference with Memory-efficient Dynamic Pruning

04/11/2022
by   Abhiroop Bhattacharjee, et al.
15

Recent years have seen a paradigm shift towards multi-task learning. This calls for memory and energy-efficient solutions for inference in a multi-task scenario. We propose an algorithm-hardware co-design approach called MIME. MIME reuses the weight parameters of a trained parent task and learns task-specific threshold parameters for inference on multiple child tasks. We find that MIME results in highly memory-efficient DRAM storage of neural-network parameters for multiple tasks compared to conventional multi-task inference. In addition, MIME results in input-dependent dynamic neuronal pruning, thereby enabling energy-efficient inference with higher throughput on a systolic-array hardware. Our experiments with benchmark datasets (child tasks)- CIFAR10, CIFAR100, and Fashion-MNIST, show that MIME achieves  3.48x memory-efficiency and  2.4-3.1x energy-savings compared to conventional multi-task inference in Pipelined task mode.

READ FULL TEXT

page 1

page 3

page 5

research
05/18/2019

Which Tasks Should Be Learned Together in Multi-task Learning?

Many computer vision applications require solving multiple tasks in real...
research
11/28/2020

EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference

Transformer-based language models such as BERT provide significant accur...
research
10/26/2022

M^3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design

Multi-task learning (MTL) encapsulates multiple learned tasks in a singl...
research
01/29/2021

Towards Generalising Neural Implicit Representations

Neural implicit representations have shown substantial improvements in e...
research
04/17/2023

AdaMTL: Adaptive Input-dependent Inference for Efficient Multi-Task Learning

Modern Augmented reality applications require performing multiple tasks ...
research
05/23/2019

Disentangling Redundancy for Multi-Task Pruning

Can prior network pruning strategies eliminate redundancy in multiple co...
research
12/21/2020

Searching for Controllable Image Restoration Networks

Diverse user preferences over images have recently led to a great amount...

Please sign up or login with your details

Forgot password? Click here to reset