Learning Compact Neural Networks with Deep Overparameterised Multitask Learning

08/25/2023
by   Shen Ren, et al.
0

Compact neural network offers many benefits for real-world applications. However, it is usually challenging to train the compact neural networks with small parameter sizes and low computational costs to achieve the same or better model performance compared to more complex and powerful architecture. This is particularly true for multitask learning, with different tasks competing for resources. We present a simple, efficient and effective multitask learning overparameterisation neural network design by overparameterising the model architecture in training and sharing the overparameterised model parameters more effectively across tasks, for better optimisation and generalisation. Experiments on two challenging multitask datasets (NYUv2 and COCO) demonstrate the effectiveness of the proposed method across various convolutional networks and parameter sizes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2018

Learning to Multitask

Multitask learning has shown promising performance in many applications ...
research
06/09/2022

DiSparse: Disentangled Sparsification for Multitask Model Compression

Despite the popularity of Model Compression and Multitask Learning, how ...
research
04/05/2016

Deep Cross Residual Learning for Multitask Visual Recognition

Residual learning has recently surfaced as an effective means of constru...
research
12/19/2018

Multitask Painting Categorization by Deep Multibranch Neural Network

In this work we propose a new deep multibranch neural network to solve t...
research
08/09/2018

The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data

Electronic phenotyping, which is the task of ascertaining whether an ind...
research
12/02/2022

ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning

Pretraining has been shown to scale well with compute, data size and dat...
research
11/28/2017

WSNet: Compact and Efficient Networks with Weight Sampling

We present a new approach and a novel architecture, termed WSNet, for le...

Please sign up or login with your details

Forgot password? Click here to reset