Locality-constrained Spatial Transformer Network for Video Crowd Counting

07/18/2019
by   Yanyan Fang, et al.
5

Compared with single image based crowd counting, video provides the spatial-temporal information of the crowd that would help improve the robustness of crowd counting. But translation, rotation and scaling of people lead to the change of density map of heads between neighbouring frames. Meanwhile, people walking in/out or being occluded in dynamic scenes leads to the change of head counts. To alleviate these issues in video crowd counting, a Locality-constrained Spatial Transformer Network (LSTN) is proposed. Specifically, we first leverage a Convolutional Neural Networks to estimate the density map for each frame. Then to relate the density maps between neighbouring frames, a Locality-constrained Spatial Transformer (LST) module is introduced to estimate the density map of next frame with that of current frame. To facilitate the performance evaluation, a large-scale video crowd counting dataset is collected, which contains 15K frames with about 394K annotated heads captured from 13 different scenes. As far as we know, it is the largest video crowd counting dataset. Extensive experiments on our dataset and other crowd counting datasets validate the effectiveness of our LSTN for crowd counting.

READ FULL TEXT

page 2

page 5

research
04/28/2021

Motion-guided Non-local Spatial-Temporal Network for Video Crowd Counting

We study video crowd counting, which is to estimate the number of object...
research
07/25/2017

Spatiotemporal Modeling for Crowd Counting in Videos

Region of Interest (ROI) crowd counting can be formulated as a regressio...
research
07/02/2018

Crowd Counting using Deep Recurrent Spatial-Aware Network

Crowd counting from unconstrained scene images is a crucial task in many...
research
08/02/2021

Congested Crowd Instance Localization with Dilated Convolutional Swin Transformer

Crowd localization is a new computer vision task, evolved from crowd cou...
research
07/04/2019

Video Crowd Counting via Dynamic Temporal Modeling

Crowd counting aims to count the number of instantaneous people in a cro...
research
11/22/2019

Crowd Density Forecasting by Modeling Patch-based Dynamics

Forecasting human activities observed in videos is a long-standing chall...
research
03/24/2023

Application-Driven AI Paradigm for Person Counting in Various Scenarios

Person counting is considered as a fundamental task in video surveillanc...

Please sign up or login with your details

Forgot password? Click here to reset