Describe me if you can! Characterized Instance-level Human Parsing

01/24/2022
by   Angelique Loesch, et al.
0

Several computer vision applications such as person search or online fashion rely on human description. The use of instance-level human parsing (HP) is therefore relevant since it localizes semantic attributes and body parts within a person. But how to characterize these attributes? To our knowledge, only some single-HP datasets describe attributes with some color, size and/or pattern characteristics. There is a lack of dataset for multi-HP in the wild with such characteristics. In this article, we propose the dataset CCIHP based on the multi-HP dataset CIHP, with 20 new labels covering these 3 kinds of characteristics. In addition, we propose HPTR, a new bottom-up multi-task method based on transformers as a fast and scalable baseline. It is the fastest method of multi-HP state of the art while having precision comparable to the most precise bottom-up method. We hope this will encourage research for fast and accurate methods of precise human descriptions.

READ FULL TEXT
research
08/01/2018

Instance-level Human Parsing via Part Grouping Network

Instance-level human parsing towards real-world human analysis scenarios...
research
05/19/2017

Towards Real World Human Parsing: Multiple-Human Parsing in the Wild

The recent progress of human parsing techniques has been largely driven ...
research
09/20/2020

Renovating Parsing R-CNN for Accurate Multiple Human Parsing

Multiple human parsing aims to segment various human parts and associate...
research
11/29/2019

Color inference from semantic labeling for person search in videos

We propose an explainable model to generate semantic color labels for pe...
research
08/27/2022

RepParser: End-to-End Multiple Human Parsing with Representative Parts

Existing methods of multiple human parsing usually adopt a two-stage str...
research
12/04/2020

PeR-ViS: Person Retrieval in Video Surveillance using Semantic Description

A person is usually characterized by descriptors like age, gender, heigh...
research
04/10/2018

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing

Despite the noticeable progress in perceptual tasks like detection, inst...

Please sign up or login with your details

Forgot password? Click here to reset