Boosting Crowd Counting via Multifaceted Attention

03/05/2022
by   Hui Lin, et al.
0

This paper focuses on the challenging crowd counting task. As large-scale variations often exist within crowd images, neither fixed-size convolution kernel of CNN nor fixed-size attention of recent vision transformers can well handle this kind of variation. To address this problem, we propose a Multifaceted Attention Network (MAN) to improve transformer models in local spatial relation encoding. MAN incorporates global attention from a vanilla transformer, learnable local attention, and instance attention into a counting model. Firstly, the local Learnable Region Attention (LRA) is proposed to assign attention exclusively for each feature location dynamically. Secondly, we design the Local Attention Regularization to supervise the training of LRA by minimizing the deviation among the attention for different feature locations. Finally, we provide an Instance Attention mechanism to focus on the most important instances dynamically during training. Extensive experiments on four challenging crowd counting datasets namely ShanghaiTech, UCF-QNRF, JHU++, and NWPU have validated the proposed method. Codes: https://github.com/LoraLinH/Boosting-Crowd-Counting-via-Multifaceted-Attention.

READ FULL TEXT

page 2

page 8

page 9

research
12/31/2021

Scene-Adaptive Attention Network for Crowd Counting

In recent years, significant progress has been made on the research of c...
research
06/21/2022

Counting Varying Density Crowds Through Density Guided Adaptive Selection CNN and Transformer Estimation

In real-world crowd counting applications, the crowd densities in an ima...
research
07/02/2019

Inverse Attention Guided Deep Crowd Counting Network

In this paper, we address the challenging problem of crowd counting in c...
research
02/22/2022

Reinforcing Local Feature Representation for Weakly-Supervised Dense Crowd Counting

Fully-supervised crowd counting is a laborious task due to the large amo...
research
07/02/2018

Crowd Counting using Deep Recurrent Spatial-Aware Network

Crowd counting from unconstrained scene images is a crucial task in many...
research
08/09/2020

SOFA-Net: Second-Order and First-order Attention Network for Crowd Counting

Automated crowd counting from images/videos has attracted more attention...
research
06/04/2021

Hybrid attention network based on progressive embedding scale-context for crowd counting

The existing crowd counting methods usually adopted attention mechanism ...

Please sign up or login with your details

Forgot password? Click here to reset