MIHash: Online Hashing with Mutual Information

03/27/2017
by   Fatih Cakir, et al.
0

Learning-based hashing methods are widely used for nearest neighbor retrieval, and recently, online hashing methods have demonstrated good performance-complexity trade-offs by learning hash functions from streaming data. In this paper, we first address a key challenge for online hashing: the binary codes for indexed data must be recomputed to keep pace with updates to the hash functions. We propose an efficient quality measure for hash functions, based on an information-theoretic quantity, mutual information, and use it successfully as a criterion to eliminate unnecessary hash table updates. Next, we also show how to optimize the mutual information objective using stochastic gradient descent. We thus develop a novel hashing method, MIHash, that can be used in both online and batch settings. Experiments on image retrieval benchmarks (including a 2.5M image dataset) confirm the effectiveness of our formulation, both in reducing hash table recomputations and in learning high-quality hash functions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/02/2018

Hashing with Mutual Information

Binary vector embeddings enable fast nearest neighbor retrieval in large...
research
11/10/2015

Online Supervised Hashing for Ever-Growing Datasets

Supervised hashing methods are widely-used for nearest neighbor search i...
research
04/26/2018

Dialogue Modeling Via Hash Functions: Applications to Psychotherapy

We propose a novel machine-learning framework for dialogue modeling whic...
research
01/16/2019

Deep Supervised Hashing leveraging Quadratic Spherical Mutual Information for Content-based Image Retrieval

Several deep supervised hashing techniques have been proposed to allow f...
research
11/20/2020

Shuffle and Learn: Minimizing Mutual Information for Unsupervised Hashing

Unsupervised binary representation allows fast data retrieval without an...
research
12/01/2020

Fast Class-wise Updating for Online Hashing

Online image hashing has received increasing research attention recently...
research
09/07/2021

Refining BERT Embeddings for Document Hashing via Mutual Information Maximization

Existing unsupervised document hashing methods are mostly established on...

Please sign up or login with your details

Forgot password? Click here to reset