Unsupervised Hashing with Contrastive Information Bottleneck

05/13/2021
by   Zexuan Qiu, et al.
0

Many unsupervised hashing methods are implicitly established on the idea of reconstructing the input data, which basically encourages the hashing codes to retain as much information of original data as possible. However, this requirement may force the models spending lots of their effort on reconstructing the unuseful background information, while ignoring to preserve the discriminative semantic information that is more important for the hashing task. To tackle this problem, inspired by the recent success of contrastive learning in learning continuous representations, we propose to adapt this framework to learn binary hashing codes. Specifically, we first propose to modify the objective function to meet the specific requirement of hashing and then introduce a probabilistic binary representation layer into the model to facilitate end-to-end training of the entire model. We further prove the strong connection between the proposed contrastive-learning-based hashing method and the mutual information, and show that the proposed model can be considered under the broader framework of the information bottleneck (IB). Under this perspective, a more general hashing model is naturally obtained. Extensive experimental results on three benchmark image datasets demonstrate that the proposed hashing method significantly outperforms existing baselines.

READ FULL TEXT
research
09/28/2022

Weighted Contrastive Hashing

The development of unsupervised hashing is advanced by the recent popula...
research
09/07/2021

Refining BERT Embeddings for Document Hashing via Mutual Information Maximization

Existing unsupervised document hashing methods are mostly established on...
research
10/31/2022

Efficient Document Retrieval by End-to-End Refining and Quantizing BERT Embedding with Contrastive Product Quantization

Efficient document retrieval heavily relies on the technique of semantic...
research
10/10/2018

CRH: A Simple Benchmark Approach to Continuous Hashing

In recent years, the distinctive advancement of handling huge data promo...
research
01/31/2022

Learning to Hash Naturally Sorts

Locality sensitive hashing pictures a list-wise sorting problem. Its tes...
research
10/13/2017

End-to-end Network for Twitter Geolocation Prediction and Hashing

We propose an end-to-end neural network to predict the geolocation of a ...
research
05/14/2018

NASH: Toward End-to-End Neural Architecture for Generative Semantic Hashing

Semantic hashing has become a powerful paradigm for fast similarity sear...

Please sign up or login with your details

Forgot password? Click here to reset