Do Text-to-Text Multi-Task Learners Suffer from Task Conflict?

12/13/2022
by   David Mueller, et al.
0

Traditional multi-task learning architectures train a single model across multiple tasks through a shared encoder followed by task-specific decoders. Learning these models often requires specialized training algorithms that address task-conflict in the shared parameter updates, which otherwise can lead to negative transfer. A new type of multi-task learning within NLP homogenizes multi-task architectures as a shared encoder and language model decoder, which does surprisingly well across a range of diverse tasks. Does this new architecture suffer from task-conflicts that require specialized training algorithms? We study how certain factors in the shift towards text-to-text models affects multi-task conflict and negative transfer, finding that both directional conflict and transfer are surprisingly constant across architectures.

READ FULL TEXT

page 6

page 9

research
05/02/2020

Understanding and Improving Information Transfer in Multi-Task Learning

We investigate multi-task learning approaches that use a shared feature ...
research
07/07/2023

Mitigating Negative Transfer with Task Awareness for Sexism, Hate Speech, and Toxic Language Detection

This paper proposes a novelty approach to mitigate the negative transfer...
research
11/20/2021

Safe Multi-Task Learning

In recent years, Multi-Task Learning (MTL) attracts much attention due t...
research
08/05/2020

Learning Boost by Exploiting the Auxiliary Task in Multi-task Domain

Learning two tasks in a single shared function has some benefits. Firstl...
research
08/16/2023

STEM: Unleashing the Power of Embeddings for Multi-task Recommendation

Multi-task learning (MTL) has gained significant popularity in recommend...
research
05/23/2023

When Does Aggregating Multiple Skills with Multi-Task Learning Work? A Case Study in Financial NLP

Multi-task learning (MTL) aims at achieving a better model by leveraging...
research
08/05/2020

MultiCheXNet: A Multi-Task Learning Deep Network For Pneumonia-like Diseases Diagnosis From X-ray Scans

We present MultiCheXNet, an end-to-end Multi-task learning model, that i...

Please sign up or login with your details

Forgot password? Click here to reset