Constant Sequence Extension for Fast Search Using Weighted Hamming Distance

06/06/2023
by   Zhenyu Weng, et al.
0

Representing visual data using compact binary codes is attracting increasing attention as binary codes are used as direct indices into hash table(s) for fast non-exhaustive search. Recent methods show that ranking binary codes using weighted Hamming distance (WHD) rather than Hamming distance (HD) by generating query-adaptive weights for each bit can better retrieve query-related items. However, search using WHD is slower than that using HD. One main challenge is that the complexity of extending a monotone increasing sequence using WHD to probe buckets in hash table(s) for existing methods is at least proportional to the square of the sequence length, while that using HD is proportional to the sequence length. To overcome this challenge, we propose a novel fast non-exhaustive search method using WHD. The key idea is to design a constant sequence extension algorithm to perform each sequence extension in constant computational complexity and the total complexity is proportional to the sequence length, which is justified by theoretical analysis. Experimental results show that our method is faster than other WHD-based search methods. Also, compared with the HD-based non-exhaustive search method, our method has comparable efficiency but retrieves more query-related items for the dataset of up to one billion items.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2019

Efficient Querying from Weighted Binary Codes

Binary codes are widely used to represent the data due to their small st...
research
09/26/2017

FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

This paper introduces a novel real-time Fuzzy Supervised Learning with B...
research
09/18/2020

Accelerating Search on Binary Codes in Weighted Hamming Space

Compared to Hamming distance, weighted Hamming distance as a similarity ...
research
07/11/2013

Fast Exact Search in Hamming Space with Multi-Index Hashing

There is growing interest in representing image data and feature descrip...
research
09/04/2021

Representation Learning for Efficient and Effective Similarity Search and Recommendation

How data is represented and operationalized is critical for building com...
research
09/19/2022

On Relaxed Locally Decodable Codes for Hamming and Insertion-Deletion Errors

Locally Decodable Codes (LDCs) are error-correcting codes C:Σ^n→Σ^m with...
research
05/15/2018

Efficient end-to-end learning for quantizable representations

Embedding representation learning via neural networks is at the core fou...

Please sign up or login with your details

Forgot password? Click here to reset