Object-Centric Multi-Task Learning for Human Instances

03/13/2023
by   Hyeongseok Son, et al.
0

Human is one of the most essential classes in visual recognition tasks such as detection, segmentation, and pose estimation. Although much effort has been put into individual tasks, multi-task learning for these three tasks has been rarely studied. In this paper, we explore a compact multi-task network architecture that maximally shares the parameters of the multiple tasks via object-centric learning. To this end, we propose a novel query design to encode the human instance information effectively, called human-centric query (HCQ). HCQ enables for the query to learn explicit and structural information of human as well such as keypoints. Besides, we utilize HCQ in prediction heads of the target tasks directly and also interweave HCQ with the deformable attention in Transformer decoders to exploit a well-learned object-centric representation. Experimental results show that the proposed multi-task network achieves comparable accuracy to state-of-the-art task-specific models in human detection, segmentation, and pose estimation task, while it consumes less computational costs.

READ FULL TEXT

page 3

page 7

page 11

research
05/08/2019

Multi-task human analysis in still images: 2D/3D pose, depth map, and multi-part segmentation

While many individual tasks in the domain of human analysis have recentl...
research
02/14/2018

Disjoint Multi-task Learning between Heterogeneous Human-centric Tasks

Human behavior understanding is arguably one of the most important mid-l...
research
10/23/2017

Generic 3D Representation via Pose Estimation and Matching

Though a large body of computer vision research has investigated develop...
research
11/03/2021

Unified 3D Mesh Recovery of Humans and Animals by Learning Animal Exercise

We propose an end-to-end unified 3D mesh recovery of humans and quadrupe...
research
03/06/2023

UniHCP: A Unified Model for Human-Centric Perceptions

Human-centric perceptions (e.g., pose estimation, human parsing, pedestr...
research
11/08/2022

Nimbus: Toward Speed Up Function Signature Recovery via Input Resizing and Multi-Task Learning

Function signature recovery is important for many binary analysis tasks ...
research
06/11/2021

Instance-Level Task Parameters: A Robust Multi-task Weighting Framework

Recent works have shown that deep neural networks benefit from multi-tas...

Please sign up or login with your details

Forgot password? Click here to reset