Crowd Scene Analysis by Output Encoding

01/27/2020
by   Yao Xue, et al.
5

Crowd scene analysis receives growing attention due to its wide applications. Grasping the accurate crowd location (rather than merely crowd count) is important for spatially identifying high-risk regions in congested scenes. In this paper, we propose a Compressed Sensing based Output Encoding (CSOE) scheme, which casts detecting pixel coordinates of small objects into a task of signal regression in encoding signal space. CSOE helps to boost localization performance in circumstances where targets are highly crowded without huge scale variation. In addition, proper receptive field sizes are crucial for crowd analysis due to human size variations. We create Multiple Dilated Convolution Branches (MDCB) that offers a set of different receptive field sizes, to improve localization accuracy when objects sizes change drastically in an image. Also, we develop an Adaptive Receptive Field Weighting (ARFW) module, which further deals with scale variation issue by adaptively emphasizing informative channels that have proper receptive field size. Experiments demonstrate the effectiveness of the proposed method, which achieves state-of-the-art performance across four mainstream datasets, especially achieves excellent results in highly crowded scenes. More importantly, experiments support our insights that it is crucial to tackle target size variation issue in crowd analysis task, and casting crowd localization as regression in encoding signal space is quite effective for crowd analysis.

READ FULL TEXT

page 1

page 3

page 7

page 8

page 9

research
12/21/2021

DRPN: Making CNN Dynamically Handle Scale Variation

Based on our observations of infrared targets, serious scale variation a...
research
01/23/2023

Crowd3D: Towards Hundreds of People Reconstruction from a Single Image

Image-based multi-person reconstruction in wide-field large scenes is cr...
research
08/02/2021

Congested Crowd Instance Localization with Dilated Convolutional Swin Transformer

Crowd localization is a new computer vision task, evolved from crowd cou...
research
12/06/2018

Adaptive Scenario Discovery for Crowd Counting

Crowd counting, i.e., estimation number of pedestrian in crowd images, i...
research
08/30/2023

Boosting Detection in Crowd Analysis via Underutilized Output Features

Detection-based methods have been viewed unfavorably in crowd analysis d...
research
02/26/2022

An End-to-End Transformer Model for Crowd Localization

Crowd localization, predicting head positions, is a more practical and h...
research
04/29/2020

Informative Scene Decomposition for Crowd Analysis, Comparison and Simulation Guidance

Crowd simulation is a central topic in several fields including graphics...

Please sign up or login with your details

Forgot password? Click here to reset