Online Supervised Hashing for Ever-Growing Datasets

11/10/2015
by   Fatih Cakir, et al.
0

Supervised hashing methods are widely-used for nearest neighbor search in computer vision applications. Most state-of-the-art supervised hashing approaches employ batch-learners. Unfortunately, batch-learning strategies can be inefficient when confronted with large training datasets. Moreover, with batch-learners, it is unclear how to adapt the hash functions as a dataset continues to grow and diversify over time. Yet, in many practical scenarios the dataset grows and diversifies; thus, both the hash functions and the indexing must swiftly accommodate these changes. To address these issues, we propose an online hashing method that is amenable to changes and expansions of the datasets. Since it is an online algorithm, our approach offers linear complexity with the dataset size. Our solution is supervised, in that we incorporate available label information to preserve the semantic neighborhood. Such an adaptive hashing method is attractive; but it requires recomputing the hash table as the hash functions are updated. If the frequency of update is high, then recomputing the hash table entries may cause inefficiencies in the system, especially for large indexes. Thus, we also propose a framework to reduce hash table updates. We compare our method to state-of-the-art solutions on two benchmarks and demonstrate significant improvements over previous work.

READ FULL TEXT

page 3

page 6

research
03/27/2017

MIHash: Online Hashing with Mutual Information

Learning-based hashing methods are widely used for nearest neighbor retr...
research
07/26/2017

Asymmetric Deep Supervised Hashing

Hashing has been widely used for large-scale approximate nearest neighbo...
research
11/02/2017

Set-to-Set Hashing with Applications in Visual Recognition

Visual data, such as an image or a sequence of video frames, is often na...
research
02/10/2023

Unified Functional Hashing in Automatic Machine Learning

The field of Automatic Machine Learning (AutoML) has recently attained i...
research
01/29/2019

Towards Optimal Discrete Online Hashing with Balanced Similarity

When facing large-scale image datasets, online hashing serves as a promi...
research
12/01/2020

Fast Class-wise Updating for Online Hashing

Online image hashing has received increasing research attention recently...
research
06/24/2022

VIP Hashing – Adapting to Skew in Popularity of Data on the Fly (extended version)

All data is not equally popular. Often, some portion of data is more fre...

Please sign up or login with your details

Forgot password? Click here to reset