An End-to-End Transformer Model for Crowd Localization

02/26/2022
by   Dingkang Liang, et al.
0

Crowd localization, predicting head positions, is a more practical and high-level task than simply counting. Existing methods employ pseudo-bounding boxes or pre-designed localization maps, relying on complex post-processing to obtain the head positions. In this paper, we propose an elegant, end-to-end Crowd Localization TRansformer named CLTR that solves the task in the regression-based paradigm. The proposed method views the crowd localization as a direct set prediction problem, taking extracted features and trainable embeddings as input of the transformer-decoder. To achieve good matching results, we introduce a KMO-based Hungarian, which innovatively revisits the label assignment from a context view instead of an independent instance view. Extensive experiments conducted on five datasets in various data settings show the effectiveness of our method. In particular, the proposed method achieves the best localization performance on the NWPU-Crowd, UCF-QNRF, and ShanghaiTech Part A datasets.

READ FULL TEXT

page 1

page 6

page 8

research
08/02/2021

Congested Crowd Instance Localization with Dilated Convolutional Swin Transformer

Crowd localization is a new computer vision task, evolved from crowd cou...
research
09/29/2021

CCTrans: Simplifying and Improving Crowd Counting with Transformer

Most recent methods used for crowd counting are based on the convolution...
research
04/26/2021

Dense Point Prediction: A Simple Baseline for Crowd Counting and Localization

In this paper, we propose a simple yet effective crowd counting and loca...
research
08/30/2023

Boosting Detection in Crowd Analysis via Underutilized Output Features

Detection-based methods have been viewed unfavorably in crowd analysis d...
research
09/24/2022

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields

Neural Radiance Fields (NeRFs) have been successfully used for scene rep...
research
07/27/2021

Rethinking Counting and Localization in Crowds:A Purely Point-Based Framework

Localizing individuals in crowds is more in accordance with the practica...
research
01/27/2020

Crowd Scene Analysis by Output Encoding

Crowd scene analysis receives growing attention due to its wide applicat...

Please sign up or login with your details

Forgot password? Click here to reset