Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions

06/13/2023
by   Weizhen He, et al.
0

Human intelligence can retrieve any person according to both visual and language descriptions. However, the current computer vision community studies specific person re-identification (ReID) tasks in different scenarios separately, which limits the applications in the real world. This paper strives to resolve this problem by proposing a new instruct-ReID task that requires the model to retrieve images according to the given image or language instructions.Our instruct-ReID is a more general ReID setting, where existing ReID tasks can be viewed as special cases by designing different instructions. We propose a large-scale OmniReID benchmark and an adaptive triplet loss as a baseline method to facilitate research in this new setting. Experimental results show that the baseline model trained on our OmniReID benchmark can improve +0.5 +0.2 mAP on COCAS+ real2 for clothestemplate based clothes-changing ReID when using only RGB images, +25.5 language-instructed ReID. The dataset, model, and code will be available at https://github.com/hwz-zju/Instruct-ReID.

READ FULL TEXT

page 5

page 9

page 14

page 15

page 16

page 17

research
04/06/2022

Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification

Recently, large-scale synthetic datasets are shown to be very useful for...
research
05/18/2023

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

Large language models (LLMs) have notably accelerated progress towards a...
research
02/09/2018

Triplet-based Deep Similarity Learning for Person Re-Identification

In recent years, person re-identification (re-id) catches great attentio...
research
10/23/2022

Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions

The "Patient Instruction" (PI), which contains critical instructional in...
research
01/26/2017

Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in vitro

The main contribution of this paper is a simple semi-supervised pipeline...
research
04/29/2022

A Challenging Benchmark of Anime Style Recognition

Given two images of different anime roles, anime style recognition (ASR)...
research
08/07/2019

SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition

Understanding the spatial relations between objects in images is a surpr...

Please sign up or login with your details

Forgot password? Click here to reset