Hierarchical One Permutation Hashing: Efficient Multimedia Near Duplicate Detection

05/29/2018
by   Chengyuan Zhang, et al.
0

With advances in multimedia technologies and the proliferation of smart phone, digital cameras, storage devices, there are a rapidly growing massive amount of multimedia data collected in many applications such as multimedia retrieval and management system, in which the data element is composed of text, image, video and audio. Consequently, the study of multimedia near duplicate detection has attracted significant concern from research organizations and commercial communities. Traditional solution minwish hashing () faces two challenges: expensive preprocessing time and lower comparison speed. Thus, this work first introduce a hashing method called one permutation hashing () to shun the costly preprocessing time. Based on , a more efficient strategy group based one permutation hashing () is developed to deal with the high comparison time. Based on the fact that the similarity of most multimedia data is not very high, this work design an new hashing method namely hierarchical one permutation hashing () to further improve the performance. Comprehensive experiments on real multimedia datasets clearly show that with similar accuracy is five to seven times faster than

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/06/2012

One Permutation Hashing for Efficient Search and Learning

Recently, the method of b-bit minwise hashing has been applied to large-...
research
11/18/2021

C-OPH: Improving the Accuracy of One Permutation Hashing (OPH) with Circulant Permutations

Minwise hashing (MinHash) is a classical method for efficiently estimati...
research
09/08/2018

Efficient Multimedia Similarity Measurement Using Similar Elements

Online social networking techniques and large-scale multimedia systems a...
research
12/16/2019

PDQ TMK + PDQF – A Test Drive of Facebook's Perceptual Hashing Algorithms

Efficient and reliable automated detection of modified image and multime...
research
07/06/2012

Multimodal similarity-preserving hashing

We introduce an efficient computational framework for hashing data belon...
research
09/10/2021

C-MinHash: Practically Reducing Two Permutations to Just One

Traditional minwise hashing (MinHash) requires applying K independent pe...
research
08/29/2018

Efficient Region of Visual Interests Search for Geo-multimedia Data

With the proliferation of online social networking services and mobile s...

Please sign up or login with your details

Forgot password? Click here to reset