Stacked Pooling: Improving Crowd Counting by Boosting Scale Invariance

08/22/2018
by   Siyu Huang, et al.
1

In this work, we explore the cross-scale similarity in crowd counting scenario, in which the regions of different scales often exhibit high visual similarity. This feature is universal both within an image and across different images, indicating the importance of scale invariance of a crowd counting model. Motivated by this, in this paper we propose simple but effective variants of pooling module, i.e., multi-kernel pooling and stacked pooling, to boost the scale invariance of convolutional neural networks (CNNs), benefiting much the crowd density estimation and counting. Specifically, the multi-kernel pooling comprises of pooling kernels with multiple receptive fields to capture the responses at multi-scale local ranges. The stacked pooling is an equivalent form of multi-kernel pooling, while, it reduces considerable computing cost. Our proposed pooling modules do not introduce extra parameters into model and can easily take place of the vanilla pooling layer in implementation. In empirical study on two benchmark crowd counting datasets, the stacked pooling beats the vanilla pooling layer in most cases.

READ FULL TEXT
research
05/24/2021

Multi-Level Attentive Convoluntional Neural Network for Crowd Counting

Recently the crowd counting has received more and more attention. Especi...
research
08/18/2018

In Defense of Single-column Networks for Crowd Counting

Crowd counting usually addressed by density estimation becomes an increa...
research
08/07/2019

Attend To Count: Crowd Counting with Adaptive Capacity Multi-scale CNNs

Crowd counting is a challenging task due to the large variations in crow...
research
07/29/2019

Learn to Scale: Generating Multipolar Normalized Density Map for Crowd Counting

Dense crowd counting aims to predict thousands of human instances from a...
research
01/17/2019

Scale-Aware Attention Network for Crowd Counting

In crowd counting datasets, people appear at different scales, depending...
research
12/20/2019

AutoScale: Learning to Scale for Crowd Counting

Crowd counting in images is a widely explored but challenging task. Thou...
research
11/02/2020

Receptive Field Size Optimization with Continuous Time Pooling

The pooling operation is a cornerstone element of convolutional neural n...

Please sign up or login with your details

Forgot password? Click here to reset