Where, What, Whether: Multi-modal Learning Meets Pedestrian Detection

12/20/2020
by   Yan Luo, et al.
14

Pedestrian detection benefits greatly from deep convolutional neural networks (CNNs). However, it is inherently hard for CNNs to handle situations in the presence of occlusion and scale variation. In this paper, we propose W^3Net, which attempts to address above challenges by decomposing the pedestrian detection task into Where, What and Whether problem directing against pedestrian localization, scale prediction and classification correspondingly. Specifically, for a pedestrian instance, we formulate its feature by three steps. i) We generate a bird view map, which is naturally free from occlusion issues, and scan all points on it to look for suitable locations for each pedestrian instance. ii) Instead of utilizing pre-fixed anchors, we model the interdependency between depth and scale aiming at generating depth-guided scales at different locations for better matching instances of different sizes. iii) We learn a latent vector shared by both visual and corpus space, by which false positives with similar vertical structure but lacking human partial features would be filtered out. We achieve state-of-the-art results on widely used datasets (Citypersons and Caltech). In particular. when evaluating on heavy occlusion subset, our results reduce MR^-2 from 49.3% to 18.7% on Citypersons, and from 45.18% to 28.33% on Caltech.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 9

research
09/15/2019

PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

Pedestrian detection in crowded scenes is a challenging problem, because...
research
07/22/2022

3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Although deep-learning based methods for monocular pedestrian detection ...
research
05/08/2023

Pedestrian Behavior Maps for Safety Advisories: CHAMP Framework and Real-World Data Analysis

It is critical for vehicles to prevent any collisions with pedestrians. ...
research
04/12/2018

PCN: Part and Context Information for Pedestrian Detection with CNNs

Pedestrian detection has achieved great improvements in recent years, wh...
research
01/13/2023

DINF: Dynamic Instance Noise Filter for Occluded Pedestrian Detection

Occlusion issue is the biggest challenge in pedestrian detection. RCNN-b...
research
05/10/2022

The Impact of Partial Occlusion on Pedestrian Detectability

Robust detection of vulnerable road users is a safety critical requireme...
research
07/27/2020

NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

Greedy-NMS inherently raises a dilemma, where a lower NMS threshold will...

Please sign up or login with your details

Forgot password? Click here to reset