Tasks Integrated Networks: Joint Detection and Retrieval for Image Search

09/03/2020
by   Lei Zhang, et al.
7

The traditional object retrieval task aims to learn a discriminative feature representation with intra-similarity and inter-dissimilarity, which supposes that the objects in an image are manually or automatically pre-cropped exactly. However, in many real-world searching scenarios (e.g., video surveillance), the objects (e.g., persons, vehicles, etc.) are seldom accurately detected or annotated. Therefore, object-level retrieval becomes intractable without bounding-box annotation, which leads to a new but challenging topic, i.e. image-level search. In this paper, to address the image search issue, we first introduce an end-to-end Integrated Net (I-Net), which has three merits: 1) A Siamese architecture and an on-line pairing strategy for similar and dissimilar objects in the given images are designed. 2) A novel on-line pairing (OLP) loss is introduced with a dynamic feature dictionary, which alleviates the multi-task training stagnation problem, by automatically generating a number of negative pairs to restrict the positives. 3) A hard example priority (HEP) based softmax loss is proposed to improve the robustness of classification task by selecting hard categories. With the philosophy of divide and conquer, we further propose an improved I-Net, called DC-I-Net, which makes two new contributions: 1) two modules are tailored to handle different tasks separately in the integrated framework, such that the task specification is guaranteed. 2) A class-center guided HEP loss (C2HEP) by exploiting the stored class centers is proposed, such that the intra-similarity and inter-dissimilarity can be captured for ultimate retrieval. Extensive experiments on famous image-level search oriented benchmark datasets demonstrate that the proposed DC-I-Net outperforms the state-of-the-art tasks-integrated and tasks-separated image search models.

READ FULL TEXT

page 2

page 3

page 5

page 11

page 12

page 13

page 14

research
04/02/2018

End-to-End Detection and Re-identification Integrated Net for Person Search

This paper proposes a pedestrian detection and re-identification (re-id)...
research
10/27/2022

MMFL-Net: Multi-scale and Multi-granularity Feature Learning for Cross-domain Fashion Retrieval

Instance-level image retrieval in fashion is a challenging issue owing t...
research
11/21/2018

Angular Triplet-Center Loss for Multi-view 3D Shape Retrieval

How to obtain the desirable representation of a 3D shape, which is discr...
research
08/26/2020

Keypoint-Aligned Embeddings for Image Retrieval and Re-identification

Learning embeddings that are invariant to the pose of the object is cruc...
research
03/26/2020

Correspondence Networks with Adaptive Neighbourhood Consensus

In this paper, we tackle the task of establishing dense visual correspon...
research
12/22/2017

Deep Hashing with Category Mask for Fast Video Retrieval

This paper proposes an end-to-end deep hashing framework with category m...

Please sign up or login with your details

Forgot password? Click here to reset