Deep Binary Reconstruction for Cross-modal Hashing

08/17/2017
by   Xuelong Li, et al.
0

With the increasing demand of massive multimodal data storage and organization, cross-modal retrieval based on hashing technique has drawn much attention nowadays. It takes the binary codes of one modality as the query to retrieve the relevant hashing codes of another modality. However, the existing binary constraint makes it difficult to find the optimal cross-modal hashing function. Most approaches choose to relax the constraint and perform thresholding strategy on the real-value representation instead of directly solving the original objective. In this paper, we first provide a concrete analysis about the effectiveness of multimodal networks in preserving the inter- and intra-modal consistency. Based on the analysis, we provide a so-called Deep Binary Reconstruction (DBRC) network that can directly learn the binary hashing codes in an unsupervised fashion. The superiority comes from a proposed simple but efficient activation function, named as Adaptive Tanh (ATanh). The ATanh function can adaptively learn the binary codes and be trained via back-propagation. Extensive experiments on three benchmark datasets demonstrate that DBRC outperforms several state-of-the-art methods in both image2text and text2image retrieval task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2020

Task-adaptive Asymmetric Deep Cross-modal Hashing

Supervised cross-modal hashing aims to embed the semantic correlations o...
research
12/25/2020

Comprehensive Graph-conditional Similarity Preserving Network for Unsupervised Cross-modal Hashing

Unsupervised cross-modal hashing (UCMH) has become a hot topic recently....
research
09/26/2022

Deep Manifold Hashing: A Divide-and-Conquer Approach for Semi-Paired Unsupervised Cross-Modal Retrieval

Hashing that projects data into binary codes has shown extraordinary tal...
research
03/02/2018

Learning Decorrelated Hashing Codes for Multimodal Retrieval

In social networks, heterogeneous multimedia data correlate to each othe...
research
11/07/2011

Multimodal diff-hash

Many applications require comparing multimodal data with different struc...
research
07/13/2017

Discrete Multi-modal Hashing with Canonical Views for Robust Mobile Landmark Search

Mobile landmark search (MLS) recently receives increasing attention for ...
research
07/06/2012

Multimodal similarity-preserving hashing

We introduce an efficient computational framework for hashing data belon...

Please sign up or login with your details

Forgot password? Click here to reset