Collaborative Quantization for Cross-Modal Similarity Search

02/02/2019
by   Ting Zhang, et al.
0

Cross-modal similarity search is a problem about designing a search system supporting querying across content modalities, e.g., using an image to search for texts or using a text to search for images. This paper presents a compact coding solution for efficient search, with a focus on the quantization approach which has already shown the superior performance over the hashing solutions in the single-modal similarity search. We propose a cross-modal quantization approach, which is among the early attempts to introduce quantization into cross-modal search. The major contribution lies in jointly learning the quantizers for both modalities through aligning the quantized representations for each pair of image and text belonging to a document. In addition, our approach simultaneously learns the common space for both modalities in which quantization is conducted to enable efficient and effective search using the Euclidean distance computed in the common space with fast distance table lookup. Experimental results compared with several competitive algorithms over three benchmark datasets demonstrate that the proposed approach achieves the state-of-the-art performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2019

Shared Predictive Cross-Modal Deep Quantization

With explosive growth of data volume and ever-increasing diversity of da...
research
02/15/2022

Efficient Cross-Modal Retrieval via Deep Binary Hashing and Quantization

Cross-modal retrieval aims to search for data with similar semantic mean...
research
10/31/2018

Semantic Modeling of Textual Relationships in Cross-Modal Retrieval

Feature modeling of different modalities is a basic problem in current r...
research
04/30/2019

Effective and Efficient Indexing in Cross-Modal Hashing-Based Datasets

To overcome the barrier of storage and computation, the hashing techniqu...
research
08/28/2019

Adversarial Representation Learning for Text-to-Image Matching

For many computer vision applications such as image captioning, visual q...
research
11/23/2017

Geometric Cross-Modal Comparison of Heterogeneous Sensor Data

In this work, we address the problem of cross-modal comparison of aerial...
research
02/02/2019

Supervised Quantization for Similarity Search

In this paper, we address the problem of searching for semantically simi...

Please sign up or login with your details

Forgot password? Click here to reset