Clustering is Efficient for Approximate Maximum Inner Product Search

07/21/2015
by   Alex Auvolat, et al.
0

Efficient Maximum Inner Product Search (MIPS) is an important task that has a wide applicability in recommendation systems and classification with a large number of classes. Solutions based on locality-sensitive hashing (LSH) as well as tree-based solutions have been investigated in the recent literature, to perform approximate MIPS in sublinear time. In this paper, we compare these to another extremely simple approach for solving approximate MIPS, based on variants of the k-means clustering algorithm. Specifically, we propose to train a spherical k-means, after having reduced the MIPS problem to a Maximum Cosine Similarity Search (MCSS). Experiments on two standard recommendation system benchmarks as well as on large vocabulary word embeddings, show that this simple approach yields much higher speedups, for the same retrieval precision, than current state-of-the-art hashing-based and tree-based methods. This simple method also yields more robust retrievals when the query is corrupted by noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/22/2014

Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product Search (MIPS)

We present the first provably sublinear time algorithm for approximate M...
research
10/20/2014

Improved Asymmetric Locality Sensitive Hashing (ALSH) for Maximum Inner Product Search (MIPS)

Recently it was shown that the problem of Maximum Inner Product Search (...
research
09/24/2018

Norm-Ranging LSH for Maximum Inner Product Search

Neyshabur and Srebro proposed Simple-LSH, which is the state-of-the-art ...
research
04/08/2019

Subsets and Supermajorities: Unifying Hashing-based Set Similarity Search

We consider the problem of designing Locality Sensitive Filters (LSF) fo...
research
06/25/2019

Pyramid: A General Framework for Distributed Similarity Search

Similarity search is a core component in various applications such as im...
research
12/21/2020

Sublinear Maximum Inner Product Search using Concomitants of Extreme Order Statistics

We propose a novel dimensionality reduction method for maximum inner pro...
research
06/05/2017

To Index or Not to Index: Optimizing Maximum Inner Product Search

Making top-K predictions for state-of-the-art Matrix Factorization model...

Please sign up or login with your details

Forgot password? Click here to reset