DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs

05/09/2019
by   Erkun Yang, et al.
0

Due to the high storage and search efficiency, hashing has become prevalent for large-scale similarity search. Particularly, deep hashing methods have greatly improved the search performance under supervised scenarios. In contrast, unsupervised deep hashing models can hardly achieve satisfactory performance due to the lack of reliable supervisory similarity signals. To address this issue, we propose a novel deep unsupervised hashing model, dubbed DistillHash, which can learn a distilled data set consisted of data pairs, which have confidence similarity signals. Specifically, we investigate the relationship between the initial noisy similarity signals learned from local structures and the semantic similarity labels assigned by a Bayes optimal classifier. We show that under a mild assumption, some data pairs, of which labels are consistent with those assigned by the Bayes optimal classifier, can be potentially distilled. Inspired by this fact, we design a simple yet effective strategy to distill data pairs automatically and further adopt a Bayesian learning framework to learn hash functions from the distilled data set. Extensive experimental results on three widely used benchmark datasets show that the proposed DistillHash consistently accomplishes the state-of-the-art search performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2021

Deep Unsupervised Hashing by Distilled Smooth Guidance

Hashing has been widely used in approximate nearest neighbor search for ...
research
08/16/2021

Deep Self-Adaptive Hashing for Image Retrieval

Hashing technology has been widely used in image retrieval due to its co...
research
10/15/2020

CIMON: Towards High-quality Hash Codes

Recently, hashing is widely-used in approximate nearest neighbor search ...
research
07/01/2020

Unsupervised Semantic Hashing with Pairwise Reconstruction

Semantic Hashing is a popular family of methods for efficient similarity...
research
02/26/2020

Generalized Product Quantization Network for Semi-supervised Hashing

Learning to hash has achieved great success in image retrieval due to it...
research
07/25/2022

Deep Forest with Hashing Screening and Window Screening

As a novel deep learning model, gcForest has been widely used in various...
research
03/10/2015

Short Text Hashing Improved by Integrating Multi-Granularity Topics and Tags

Due to computational and storage efficiencies of compact binary codes, h...

Please sign up or login with your details

Forgot password? Click here to reset