Pyramid Scale Network for Crowd Counting

07/11/2020
by   Junhao Cheng, et al.
10

Crowd counting is a challenging task in computer vision due to serious occlusions, complex background and large scale variations, etc. Multi-column architecture is widely adopted to overcome these challenges, yielding state-of-the-art performance in many public benchmarks. However, there still are two issues in such design: scale limitation and feature similarity. Further performance improvements are thus restricted. In this paper, we propose a novel crowd counting framework called Pyramid Scale Network (PSNet) to explicitly address these issues. Specifically, for scale limitation, we adopt three Pyramid Scale Module (PSM) to efficiently capture multi-scale features, which integrate a message passing mechanism and an attention mechanism into multi-column architecture. Moreover, for feature similarity, a Differential loss is introduced to make the features learned by each column in PSM appropriately different from each other. To the best of our knowledge, PSNet is the first work to explicitly address scale limitation and feature similarity in multi-column design. Extensive experiments on five benchmark datasets demonstrate the effectiveness of the proposed innovations as well as the superior performance over the state-of-the-art. Our code is publicly available at: https://github.com/JunhaoCheng/Pyramid_Scale_Network

READ FULL TEXT

page 1

page 4

page 6

research
08/18/2018

In Defense of Single-column Networks for Crowd Counting

Crowd counting usually addressed by density estimation becomes an increa...
research
06/24/2019

Dense Scale Network for Crowd Counting

Crowd counting has been widely studied by computer vision community in r...
research
11/07/2018

PaDNet: Pan-Density Crowd Counting

Crowd counting in varying density scenes is a challenging problem in art...
research
08/23/2019

Crowd Counting with Deep Structured Scale Integration Network

Automatic estimation of the number of people in unconstrained crowded sc...
research
05/25/2020

Interlayer and Intralayer Scale Aggregation for Scale-invariant Crowd Counting

Crowd counting is an important vision task, which faces challenges on co...
research
05/28/2023

MixDehazeNet : Mix Structure Block For Image Dehazing Network

Image dehazing is a typical task in the low-level vision field. Previous...
research
09/16/2019

Perspective-Guided Convolution Networks for Crowd Counting

In this paper, we propose a novel perspective-guided convolution (PGC) f...

Please sign up or login with your details

Forgot password? Click here to reset