Text-based Person Search in Full Images via Semantic-Driven Proposal Generation

09/27/2021
by   Shizhou Zhang, et al.
1

Finding target persons in full scene images with a query of text description has important practical applications in intelligent video surveillance.However, different from the real-world scenarios where the bounding boxes are not available, existing text-based person retrieval methods mainly focus on the cross modal matching between the query text descriptions and the gallery of cropped pedestrian images. To close the gap, we study the problem of text-based person search in full images by proposing a new end-to-end learning framework which jointly optimize the pedestrian detection, identification and visual-semantic feature embedding tasks. To take full advantage of the query text, the semantic features are leveraged to instruct the Region Proposal Network to pay more attention to the text-described proposals. Besides, a cross-scale visual-semantic embedding mechanism is utilized to improve the performance. To validate the proposed method, we collect and annotate two large-scale benchmark datasets based on the widely adopted image-based person search datasets CUHK-SYSU and PRW. Comprehensive experiments are conducted on the two datasets and compared with the baseline methods, our method achieves the state-of-the-art performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 9

research
04/07/2016

Joint Detection and Identification Feature Learning for Person Search

Existing person re-identification benchmarks and methods mainly focus on...
research
02/10/2023

End-to-end Semantic Object Detection with Cross-Modal Alignment

Traditional semantic image search methods aim to retrieve images that ma...
research
05/24/2017

Attention-based Natural Language Person Retrieval

Following the recent progress in image classification and captioning usi...
research
08/24/2023

Ground-to-Aerial Person Search: Benchmark Dataset and Approach

In this work, we construct a large-scale dataset for Ground-to-Aerial Pe...
research
05/16/2017

IAN: The Individual Aggregation Network for Person Search

Person search in real-world scenarios is a new challenging computer vers...
research
03/06/2019

Large-Scale Pedestrian Retrieval Competition

The Large-Scale Pedestrian Retrieval Competition (LSPRC) mainly focuses ...
research
12/04/2020

PeR-ViS: Person Retrieval in Video Surveillance using Semantic Description

A person is usually characterized by descriptors like age, gender, heigh...

Please sign up or login with your details

Forgot password? Click here to reset