Learning to Branch for Multi-Task Learning

06/02/2020
by   Pengsheng Guo, et al.
0

Training multiple tasks jointly in one deep network yields reduced latency during inference and better performance over the single-task counterpart by sharing certain layers of a network. However, over-sharing a network could erroneously enforce over-generalization, causing negative knowledge transfer across tasks. Prior works rely on human intuition or pre-computed task relatedness scores for ad hoc branching structures. They provide sub-optimal end results and often require huge efforts for the trial-and-error process. In this work, we present an automated multi-task learning algorithm that learns where to share or branch within a network, designing an effective network topology that is directly optimized for multiple objectives across tasks. Specifically, we propose a novel tree-structured design space that casts a tree branching operation as a gumbel-softmax sampling procedure. This enables differentiable network splitting that is end-to-end trainable. We validate the proposed method on controlled synthetic data, CelebA, and Taskonomy.

READ FULL TEXT

page 8

page 12

research
04/05/2019

Branched Multi-Task Networks: Deciding What Layers To Share

In the context of deep learning, neural networks with multiple branches ...
research
12/07/2022

Tree DNN: A Deep Container Network

Multi-Task Learning (MTL) has shown its importance at user products for ...
research
05/26/2023

DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning

Multi-task networks rely on effective parameter sharing to achieve robus...
research
03/16/2023

Efficient Computation Sharing for Multi-Task Visual Scene Understanding

Solving multiple visual tasks using individual models can be resource-in...
research
03/13/2023

Dynamic Neural Network for Multi-Task Learning Searching across Diverse Network Topologies

In this paper, we present a new MTL framework that searches for structur...
research
04/29/2020

Task-Feature Collaborative Learning with Application to Personalized Attribute Prediction

As an effective learning paradigm against insufficient training samples,...
research
09/28/2016

Learning to Push by Grasping: Using multiple tasks for effective learning

Recently, end-to-end learning frameworks are gaining prevalence in the f...

Please sign up or login with your details

Forgot password? Click here to reset