Double Anchor R-CNN for Human Detection in a Crowd

09/22/2019
by   Kevin Zhang, et al.
17

Detecting human in a crowd is a challenging problem due to the uncertainties of occlusion patterns. In this paper, we propose to handle the crowd occlusion problem in human detection by leveraging the head part. Double Anchor RPN is developed to capture body and head parts in pairs. A proposal crossover strategy is introduced to generate high-quality proposals for both parts as a training augmentation. Features of coupled proposals are then aggregated efficiently to exploit the inherent relationship. Finally, a Joint NMS module is developed for robust post-processing. The proposed framework, called Double Anchor R-CNN, is able to detect the body and head for each person simultaneously in crowded scenarios. State-of-the-art results are reported on challenging human detection datasets. Our model yields log-average miss rates (MR) of 51.79pp on CrowdHuman, 55.01pp on COCOPersons (crowded sub-dataset) and 40.02pp on CrowdPose (crowded sub-dataset), which outperforms previous baseline detectors by 3.57pp, 3.82pp, and 4.24pp, respectively. We hope our simple and effective approach will serve as a solid baseline and help ease future research in crowded human detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 7

research
04/30/2018

CrowdHuman: A Benchmark for Detecting Human in a Crowd

Human detection has witnessed impressive progress in recent years. Howev...
research
07/23/2018

Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

Pedestrian detection in crowded scenes is a challenging problem since th...
research
09/24/2019

Relational Learning for Joint Head and Human Detection

Head and human detection have been rapidly improved with the development...
research
11/27/2019

Semantic Head Enhanced Pedestrian Detection in a Crowd

Pedestrian detection in the crowd is a challenging task because of intra...
research
07/14/2022

AIParsing: Anchor-free Instance-level Human Parsing

Most state-of-the-art instance-level human parsing models adopt two-stag...
research
12/15/2022

Body-Part Joint Detection and Association via Extended Object Representation

The detection of human body and its related parts (e.g., face, head or h...
research
05/26/2019

Where's My Head? Definition, Dataset and Models for Numeric Fused-Heads Identification and Resolution

We provide the first computational treatment of fused-heads construction...

Please sign up or login with your details

Forgot password? Click here to reset