Coarse- and Fine-grained Attention Network with Background-aware Loss for Crowd Density Map Estimation

11/07/2020
by   Liangzi Rong, et al.
7

In this paper, we present a novel method Coarse- and Fine-grained Attention Network (CFANet) for generating high-quality crowd density maps and people count estimation by incorporating attention maps to better focus on the crowd area. We devise a from-coarse-to-fine progressive attention mechanism by integrating Crowd Region Recognizer (CRR) and Density Level Estimator (DLE) branch, which can suppress the influence of irrelevant background and assign attention weights according to the crowd density levels, because generating accurate fine-grained attention maps directly is normally difficult. We also employ a multi-level supervision mechanism to assist the backpropagation of gradient and reduce overfitting. Besides, we propose a Background-aware Structural Loss (BSL) to reduce the false recognition ratio while improving the structural similarity to groundtruth. Extensive experiments on commonly used datasets show that our method can not only outperform previous state-of-the-art methods in terms of count accuracy but also improve the image quality of density maps as well as reduce the false recognition ratio.

READ FULL TEXT

page 1

page 3

page 6

page 7

research
08/07/2019

Attend To Count: Crowd Counting with Adaptive Capacity Multi-scale CNNs

Crowd counting is a challenging task due to the large variations in crow...
research
08/06/2021

Fine-grained Domain Adaptive Crowd Counting via Point-derived Segmentation

Existing domain adaptation methods for crowd counting view each crowd im...
research
07/13/2020

Fine-Grained Crowd Counting

Current crowd counting algorithms are only concerned about the number of...
research
01/16/2020

PDANet: Pyramid Density-aware Attention Net for Accurate Crowd Counting

Crowd counting, i.e., estimating the number of people in a crowded area,...
research
09/05/2022

REQA: Coarse-to-fine Assessment of Image Quality to Alleviate the Range Effect

Blind image quality assessment (BIQA) of user generated content (UGC) su...
research
08/02/2017

Generating High-Quality Crowd Density Maps using Contextual Pyramid CNNs

We present a novel method called Contextual Pyramid CNN (CP-CNN) for gen...
research
11/26/2019

Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism

Recently, Human Attribute Recognition (HAR) has become a hot topic due t...

Please sign up or login with your details

Forgot password? Click here to reset