Shared Predictive Cross-Modal Deep Quantization

04/16/2019
by   Erkun Yang, et al.
0

With explosive growth of data volume and ever-increasing diversity of data modalities, cross-modal similarity search, which conducts nearest neighbor search across different modalities, has been attracting increasing interest. This paper presents a deep compact code learning solution for efficient cross-modal similarity search. Many recent studies have proven that quantization-based approaches perform generally better than hashing-based approaches on single-modal similarity search. In this paper, we propose a deep quantization approach, which is among the early attempts of leveraging deep neural networks into quantization-based cross-modal similarity search. Our approach, dubbed shared predictive deep quantization (SPDQ), explicitly formulates a shared subspace across different modalities and two private subspaces for individual modalities, and representations in the shared subspace and the private subspaces are learned simultaneously by embedding them to a reproducing kernel Hilbert space, where the mean embedding of different modality distributions can be explicitly compared. In addition, in the shared subspace, a quantizer is learned to produce the semantics preserving compact codes with the help of label alignment. Thanks to this novel network architecture in cooperation with supervised quantization training, SPDQ can preserve intramodal and intermodal similarities as much as possible and greatly reduce quantization error. Experiments on two popular benchmarks corroborate that our approach outperforms state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 4

research
02/02/2019

Collaborative Quantization for Cross-Modal Similarity Search

Cross-modal similarity search is a problem about designing a search syst...
research
02/15/2022

Efficient Cross-Modal Retrieval via Deep Binary Hashing and Quantization

Cross-modal retrieval aims to search for data with similar semantic mean...
research
11/07/2021

Cross-modal Zero-shot Hashing by Label Attributes Embedding

Cross-modal hashing (CMH) is one of the most promising methods in cross-...
research
01/26/2022

Discriminative Supervised Subspace Learning for Cross-modal Retrieval

Nowadays the measure between heterogeneous data is still an open problem...
research
03/26/2019

Cross-modal subspace learning with Kernel correlation maximization and Discriminative structure preserving

The measure between heterogeneous data is still an open problem. Many re...
research
07/12/2018

Disjoint Mapping Network for Cross-modal Matching of Voices and Faces

We propose a novel framework, called Disjoint Mapping Network (DIMNet), ...
research
10/08/2019

A Test for Shared Patterns in Cross-modal Brain Activation Analysis

Determining the extent to which different cognitive modalities (understo...

Please sign up or login with your details

Forgot password? Click here to reset