Discriminative Cross-View Binary Representation Learning

04/04/2018
by   Liu Liu, et al.
0

Learning compact representation is vital and challenging for large scale multimedia data. Cross-view/cross-modal hashing for effective binary representation learning has received significant attention with exponentially growing availability of multimedia content. Most existing cross-view hashing algorithms emphasize the similarities in individual views, which are then connected via cross-view similarities. In this work, we focus on the exploitation of the discriminative information from different views, and propose an end-to-end method to learn semantic-preserving and discriminative binary representation, dubbed Discriminative Cross-View Hashing (DCVH), in light of learning multitasking binary representation for various tasks including cross-view retrieval, image-to-image retrieval, and image annotation/tagging. The proposed DCVH has the following key components. First, it uses convolutional neural network (CNN) based nonlinear hashing functions and multilabel classification for both images and texts simultaneously. Such hashing functions achieve effective continuous relaxation during training without explicit quantization loss by using Direct Binary Embedding (DBE) layers. Second, we propose an effective view alignment via Hamming distance minimization, which is efficiently accomplished by bit-wise XOR operation. Extensive experiments on two image-text benchmark datasets demonstrate that DCVH outperforms state-of-the-art cross-view hashing algorithms as well as single-view image hashing algorithms. In addition, DCVH can provide competitive performance for image annotation/tagging.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2017

End-to-end Binary Representation Learning via Direct Binary Embedding

Learning binary representation is essential to large-scale computer visi...
research
02/15/2022

Efficient Cross-Modal Retrieval via Deep Binary Hashing and Quantization

Cross-modal retrieval aims to search for data with similar semantic mean...
research
12/06/2018

Discriminative Supervised Hashing for Cross-Modal Retrieval

With the advantage of low storage cost and high retrieval efficiency, ha...
research
04/03/2020

Error-Corrected Margin-Based Deep Cross-Modal Hashing for Facial Image Retrieval

Cross-modal hashing facilitates mapping of heterogeneous multimedia data...
research
08/26/2019

Embarrassingly Simple Binary Representation Learning

Recent binary representation learning models usually require sophisticat...
research
11/26/2017

HashGAN:Attention-aware Deep Adversarial Hashing for Cross Modal Retrieval

As the rapid growth of multi-modal data, hashing methods for cross-modal...
research
03/08/2018

Learning Effective Binary Visual Representations with Deep Networks

Although traditionally binary visual representations are mainly designed...

Please sign up or login with your details

Forgot password? Click here to reset