Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

11/27/2019
by   Haoyu He, et al.
4

Human parsing, or human body part semantic segmentation, has been an active research topic due to its wide potential applications. In this paper, we propose a novel GRAph PYramid Mutual Learning (Grapy-ML) method to address the cross-dataset human parsing problem, where the annotations are at different granularities. Starting from the prior knowledge of the human body hierarchical structure, we devise a graph pyramid module (GPM) by stacking three levels of graph structures from coarse granularity to fine granularity subsequently. At each level, GPM utilizes the self-attention mechanism to model the correlations between context nodes. Then, it adopts a top-down mechanism to progressively refine the hierarchical features through all the levels. GPM also enables efficient mutual learning. Specifically, the network weights of the first two levels are shared to exchange the learned coarse-granularity information across different datasets. By making use of the multi-granularity labels, Grapy-ML learns a more discriminative feature representation and achieves state-of-the-art performance, which is demonstrated by extensive experiments on the three popular benchmarks, e.g. CIHP dataset. The source code is publicly available at https://github.com/Charleshhy/Grapy-ML.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 8

research
04/09/2019

Graphonomy: Universal Human Parsing via Graph Transfer Learning

Prior highly-tuned human parsing models tend to fit towards each dataset...
research
01/26/2021

Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer

Prior highly-tuned image parsing models are usually studied in a certain...
research
12/22/2020

Progressive One-shot Human Parsing

Prior human parsing models are limited to parsing humans into classes pr...
research
02/05/2023

Pyramid Self-attention Polymerization Learning for Semi-supervised Skeleton-based Action Recognition

Most semi-supervised skeleton-based action recognition approaches aim to...
research
09/20/2020

Renovating Parsing R-CNN for Accurate Multiple Human Parsing

Multiple human parsing aims to segment various human parts and associate...
research
10/10/2020

HCNet: Hierarchical Context Network for Semantic Segmentation

Global context information is vital in visual understanding problems, es...
research
08/04/2021

Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation

We present a novel pyramidal output representation to ensure parsimony w...

Please sign up or login with your details

Forgot password? Click here to reset