Multitask Learning with Single Gradient Step Update for Task Balancing

05/20/2020
by   Sungjae Lee, et al.
0

Multitask learning is a methodology to boost generalization performance and also reduce computational intensity and memory usage. However, learning multiple tasks simultaneously can be more difficult than learning a single task because it can cause imbalance problem among multiple tasks. To address the imbalance problem, we propose an algorithm to balance between tasks at gradient-level by applying learning method of gradient-based meta-learning to multitask learning. The proposed method trains shared layer and task specific layers separately so that the two layers of different roles in multitask network can be fitted to their own purpose. Especially, the shared layer that contains knowledge shared among tasks is trained by employing single gradient step update and inner/outer loop training procedure to mitigate the imbalance problem at gradient-level. We apply the proposed method to various multitask computer vision problems and achieve state of the art performance.

READ FULL TEXT
research
05/19/2018

Learning to Multitask

Multitask learning has shown promising performance in many applications ...
research
11/07/2017

GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks

Deep multitask networks, in which one neural network produces multiple p...
research
10/14/2020

Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout

The vast majority of deep models use multiple gradient signals, typicall...
research
07/01/2023

Improving Multitask Retrieval by Promoting Task Specialization

In multitask retrieval, a single retriever is trained to retrieve releva...
research
05/17/2021

Layerwise Optimization by Gradient Decomposition for Continual Learning

Deep neural networks achieve state-of-the-art and sometimes super-human ...
research
04/21/2020

MT-Clinical BERT: Scaling Clinical Information Extraction with Multitask Learning

Clinical notes contain an abundance of important but not-readily accessi...
research
10/31/2017

Beyond Shared Hierarchies: Deep Multitask Learning through Soft Layer Ordering

Existing deep multitask learning (MTL) approaches align layers shared be...

Please sign up or login with your details

Forgot password? Click here to reset