TransCrowd: Weakly-Supervised Crowd Counting with Transformer

04/19/2021
by   Dingkang Liang, et al.
0

The mainstream crowd counting methods usually utilize the convolution neural network (CNN) to regress a density map, requiring point-level annotations. However, annotating each person with a point is an expensive and laborious process. During the testing phase, the point-level annotations are not considered to evaluate the counting accuracy, which means the point-level annotations are redundant. Hence, it is desirable to develop weakly-supervised counting methods that just rely on count level annotations, a more economical way of labeling. Current weakly-supervised counting methods adopt the CNN to regress a total count of the crowd by an image-to-count paradigm. However, having limited receptive fields for context modeling is an intrinsic limitation of these weakly-supervised CNN-based methods. These methods thus can not achieve satisfactory performance, limited applications in the real-word. The Transformer is a popular sequence-to-sequence prediction model in NLP, which contains a global receptive field. In this paper, we propose TransCrowd, which reformulates the weakly-supervised crowd counting problem from the perspective of sequence-to-count based on Transformer. We observe that the proposed TransCrowd can effectively extract the semantic crowd information by using the self-attention mechanism of Transformer. To the best of our knowledge, this is the first work to adopt a pure Transformer for crowd counting research. Experiments on five benchmark datasets demonstrate that the proposed TransCrowd achieves superior performance compared with all the weakly-supervised CNN-based counting methods and gains highly competitive counting performance compared with some popular fully-supervised counting methods. Code is available at https://github.com/dk-liang/TransCrowd.

READ FULL TEXT

page 1

page 3

page 6

research
03/12/2022

Joint CNN and Transformer Network via weakly supervised Learning for efficient crowd counting

Currently, for crowd counting, the fully supervised methods via density ...
research
03/07/2022

CrowdFormer: Weakly-supervised Crowd counting with Improved Generalizability

Convolutional neural networks (CNNs) have dominated the field of compute...
research
02/29/2020

Towards Using Count-level Weak Supervision for Crowd Counting

Most existing crowd counting methods require object location-level annot...
research
04/09/2023

CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model

Supervised crowd counting relies heavily on costly manual labeling, whic...
research
09/29/2021

CCTrans: Simplifying and Improving Crowd Counting with Transformer

Most recent methods used for crowd counting are based on the convolution...
research
02/22/2022

Reinforcing Local Feature Representation for Weakly-Supervised Dense Crowd Counting

Fully-supervised crowd counting is a laborious task due to the large amo...
research
03/15/2022

CrowdMLP: Weakly-Supervised Crowd Counting via Multi-Granularity MLP

Existing state-of-the-art crowd counting algorithms rely excessively on ...

Please sign up or login with your details

Forgot password? Click here to reset