Boosting Crowd Counting with Transformers

05/23/2021
by   Guolei Sun, et al.
0

Significant progress on the crowd counting problem has been achieved by integrating larger context into convolutional neural networks (CNNs). This indicates that global scene context is essential, despite the seemingly bottom-up nature of the problem. This may be explained by the fact that context knowledge can adapt and improve local feature extraction to a given scene. In this paper, we therefore investigate the role of global context for crowd counting. Specifically, a pure transformer is used to extract features with global information from overlapping image patches. Inspired by classification, we add a context token to the input sequence, to facilitate information exchange with tokens corresponding to image patches throughout transformer layers. Due to the fact that transformers do not explicitly model the tried-and-true channel-wise interactions, we propose a token-attention module (TAM) to recalibrate encoded features through channel-wise attention informed by the context token. Beyond that, it is adopted to predict the total person count of the image through regression-token module (RTM). Extensive experiments demonstrate that our method achieves state-of-the-art performance on various datasets, including ShanghaiTech, UCF-QNRF, JHU-CROWD++ and NWPU. On the large-scale JHU-CROWD++ dataset, our method improves over the previous best results by 26.9

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2022

CrowdFormer: Weakly-supervised Crowd counting with Improved Generalizability

Convolutional neural networks (CNNs) have dominated the field of compute...
research
03/12/2022

Joint CNN and Transformer Network via weakly supervised Learning for efficient crowd counting

Currently, for crowd counting, the fully supervised methods via density ...
research
09/29/2021

CCTrans: Simplifying and Improving Crowd Counting with Transformer

Most recent methods used for crowd counting are based on the convolution...
research
08/10/2019

SCAR: Spatial-/Channel-wise Attention Regression Networks for Crowd Counting

Recently, crowd counting is a hot topic in crowd analysis. Many CNN-base...
research
12/31/2021

Scene-Adaptive Attention Network for Crowd Counting

In recent years, significant progress has been made on the research of c...
research
08/02/2021

Congested Crowd Instance Localization with Dilated Convolutional Swin Transformer

Crowd localization is a new computer vision task, evolved from crowd cou...
research
07/24/2019

HA-CCN: Hierarchical Attention-based Crowd Counting Network

Single image-based crowd counting has recently witnessed increased focus...

Please sign up or login with your details

Forgot password? Click here to reset