Parsing R-CNN for Instance-Level Human Analysis

11/30/2018
by   Lu Yang, et al.
0

Instance-level human analysis is common in real-life scenarios and has multiple manifestations, such as human part segmentation, dense pose estimation, human-object interactions, etc. Models need to distinguish different human instances in the image panel and learn rich features to represent the details of each instance. In this paper, we present an end-to-end pipeline for solving the instance-level human analysis, named Parsing R-CNN. It processes a set of human instances simultaneously through comprehensive considering the characteristics of region-based approach and the appearance of a human, thus allowing representing the details of instances. Parsing R-CNN is very flexible and efficient, which is applicable to many issues in human instance analysis. Our approach outperforms all state-of-the-art methods on CIHP (Crowd Instance-level Human Parsing), MHP v2.0 (Multi-Human Parsing) and DensePose-COCO datasets. Based on the proposed Parsing R-CNN, we reach the 1st place in the COCO 2018 Challenge DensePose Estimation task. Code and models are public available.

READ FULL TEXT

page 1

page 4

page 8

research
09/20/2020

Renovating Parsing R-CNN for Accurate Multiple Human Parsing

Multiple human parsing aims to segment various human parts and associate...
research
08/01/2018

Instance-level Human Parsing via Part Grouping Network

Instance-level human parsing towards real-world human analysis scenarios...
research
03/10/2021

Quality-Aware Network for Human Parsing

How to estimate the quality of the network output is an important issue,...
research
10/07/2021

A Baseline Framework for Part-level Action Parsing and Action Recognition

This technical report introduces our 2nd place solution to Kinetics-TPS ...
research
03/11/2021

Robust 2D/3D Vehicle Parsing in CVIS

We present a novel approach to robustly detect and perceive vehicles in ...
research
07/14/2022

Fine-grained Few-shot Recognition by Deep Object Parsing

In our framework, an object is made up of K distinct parts or units, and...
research
04/10/2018

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing

Despite the noticeable progress in perceptual tasks like detection, inst...

Please sign up or login with your details

Forgot password? Click here to reset