Cascade Transformers for End-to-End Person Search

03/17/2022
by   Rui Yu, et al.
26

The goal of person search is to localize a target person from a gallery set of scene images, which is extremely challenging due to large scale variations, pose/viewpoint changes, and occlusions. In this paper, we propose the Cascade Occluded Attention Transformer (COAT) for end-to-end person search. Our three-stage cascade design focuses on detecting people in the first stage, while later stages simultaneously and progressively refine the representation for person detection and re-identification. At each stage the occluded attention transformer applies tighter intersection over union thresholds, forcing the network to learn coarse-to-fine pose/scale invariant features. Meanwhile, we calculate each detection's occluded attention to differentiate a person's tokens from other people or the background. In this way, we simulate the effect of other objects occluding a person of interest at the token-level. Through comprehensive experiments, we demonstrate the benefits of our method by achieving state-of-the-art performance on two benchmark datasets.

READ FULL TEXT

page 1

page 4

page 6

page 7

research
11/06/2022

Sequential Transformer for End-to-End Person Search

Person Search aims to simultaneously localize and recognize a target per...
research
03/01/2022

Robots Autonomously Detecting People: A Multimodal Deep Contrastive Learning Method Robust to Intraclass Variations

Robotic detection of people in crowded and/or cluttered human-centered e...
research
04/04/2023

Attention Map Guided Transformer Pruning for Edge Device

Due to its significant capability of modeling long-range dependencies, v...
research
09/22/2018

Cascade Attention Network for Person Search: Both Image and Text-Image Similarity Selection

Person search with natural language aims to retrieve the corresponding p...
research
09/10/2023

Towards Fully Decoupled End-to-End Person Search

End-to-end person search aims to jointly detect and re-identify a target...
research
10/07/2022

PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search

Person search is a challenging problem with various real-world applicati...
research
10/24/2022

Gallery Filter Network for Person Search

In person search, we aim to localize a query person from one scene in ot...

Please sign up or login with your details

Forgot password? Click here to reset