End-to-End Multi-Task Learning with Attention

03/28/2018
by   Shikun Liu, et al.
0

In this paper, we propose a novel multi-task learning architecture, which incorporates recent advances in attention mechanisms. Our approach, the Multi-Task Attention Network (MTAN), consists of a single shared network containing a global feature pool, together with task-specific soft-attention modules, which are trainable in an end-to-end manner. These attention modules allow for learning of task-specific features from the global pool, whilst simultaneously allowing for features to be shared across different tasks. The architecture can be built upon any feed-forward neural network, is simple to implement, and is parameter efficient. Experiments on the CityScapes dataset show that our method outperforms several baselines in both single-task and multi-task learning, and is also more robust to the various weighting schemes in the multi-task loss function. We further explore the effectiveness of our method through experiments over a range of task complexities, and show how our method scales well with task complexity compared to baselines.

READ FULL TEXT

page 3

page 5

page 9

page 11

page 12

research
07/14/2020

Knowledge Distillation for Multi-task Learning

Multi-task learning (MTL) is to learn one single model that performs mul...
research
02/18/2020

Multi-Task Learning from Videos via Efficient Inter-Frame Attention

Prior work in multi-task learning has mainly focused on predictions on a...
research
11/23/2020

Multi-task Learning for Human Settlement Extent Regression and Local Climate Zone Classification

Human Settlement Extent (HSE) and Local Climate Zone (LCZ) maps are both...
research
02/09/2020

Multi-Task Learning by a Top-Down Control Network

A general problem that received considerable recent attention is how to ...
research
06/04/2023

Top-Down Processing: Top-Down Network Combines Back-Propagation with Attention

Early neural network models relied exclusively on bottom-up processing g...
research
02/28/2018

Neural Aesthetic Image Reviewer

Recently, there is a rising interest in perceiving image aesthetics. The...
research
02/21/2023

MulGT: Multi-task Graph-Transformer with Task-aware Knowledge Injection and Domain Knowledge-driven Pooling for Whole Slide Image Analysis

Whole slide image (WSI) has been widely used to assist automated diagnos...

Please sign up or login with your details

Forgot password? Click here to reset