Deep Robust Multilevel Semantic Cross-Modal Hashing

02/07/2020
by   Ge Song, et al.
20

Hashing based cross-modal retrieval has recently made significant progress. But straightforward embedding data from different modalities into a joint Hamming space will inevitably produce false codes due to the intrinsic modality discrepancy and noises. We present a novel Robust Multilevel Semantic Hashing (RMSH) for more accurate cross-modal retrieval. It seeks to preserve fine-grained similarity among data with rich semantics, while explicitly require distances between dissimilar points to be larger than a specific value for strong robustness. For this, we give an effective bound of this value based on the information coding-theoretic analysis, and the above goals are embodied into a margin-adaptive triplet loss. Furthermore, we introduce pseudo-codes via fusing multiple hash codes to explore seldom-seen semantics, alleviating the sparsity problem of similarity information. Experiments on three benchmarks show the validity of the derived bounds, and our method achieves state-of-the-art performance.

READ FULL TEXT
research
04/04/2019

Triplet-Based Deep Hashing Network for Cross-Modal Retrieval

Given the benefits of its low storage requirements and high retrieval ef...
research
11/06/2020

Deep Cross-modal Proxy Hashing

Due to their high retrieval efficiency and low storage cost for cross-mo...
research
04/03/2020

Error-Corrected Margin-Based Deep Cross-Modal Hashing for Facial Image Retrieval

Cross-modal hashing facilitates mapping of heterogeneous multimedia data...
research
09/26/2022

Information-Theoretic Hashing for Zero-Shot Cross-Modal Retrieval

Zero-shot cross-modal retrieval (ZS-CMR) deals with the retrieval proble...
research
11/30/2022

Improving Cross-Modal Retrieval with Set of Diverse Embeddings

Cross-modal retrieval across image and text modalities is a challenging ...
research
05/15/2021

FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval

Cross-modal hashing, favored for its effectiveness and efficiency, has r...
research
08/08/2017

Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval

Cross-modal hashing is usually regarded as an effective technique for la...

Please sign up or login with your details

Forgot password? Click here to reset