Kernelized Locality-Sensitive Hashing for Semi-Supervised Agglomerative Clustering

01/16/2013
by   Boyi Xie, et al.
0

Large scale agglomerative clustering is hindered by computational burdens. We propose a novel scheme where exact inter-instance distance calculation is replaced by the Hamming distance between Kernelized Locality-Sensitive Hashing (KLSH) hashed values. This results in a method that drastically decreases computation time. Additionally, we take advantage of certain labeled data points via distance metric learning to achieve a competitive precision and recall comparing to K-Means but in much less computation time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2022

Falconn++: A Locality-sensitive Filtering Approach for Approximate Nearest Neighbor Search

We present Falconn++, a novel locality-sensitive filtering (LSF) approac...
research
12/26/2012

Hyperplane Arrangements and Locality-Sensitive Hashing with Lift

Locality-sensitive hashing converts high-dimensional feature vectors, su...
research
07/25/2018

Robust Set Reconciliation via Locality Sensitive Hashing

We consider variations of set reconciliation problems where two parties,...
research
09/25/2017

Non-iterative Label Propagation on Optimal Leading Forest

Graph based semi-supervised learning (GSSL) has intuitive representation...
research
07/17/2018

Supermodular Locality Sensitive Hashes

In this work, we show deep connections between Locality Sensitive Hashab...
research
03/16/2021

An Efficient Vectorization Scheme for Stencil Computation

Stencil computation is one of the most important kernels in various scie...
research
06/07/2022

Locality-sensitive bucketing functions for the edit distance

Many bioinformatics applications involve bucketing a set of sequences wh...

Please sign up or login with your details

Forgot password? Click here to reset