To Index or Not to Index: Optimizing Maximum Inner Product Search

06/05/2017
by   Firas Abuzaid, et al.
0

Making top-K predictions for state-of-the-art Matrix Factorization models requires solving the Maximum Inner Product Search problem. Solving MIPS can be computationally expensive, thus spurring the recent development of several indexing techniques for this task. These techniques generally exploit similarity between user or item weights in the models to accelerate predictions. In this paper, we show that the current state of the art does not always outperform brute-force matrix multiplication: these models may have significantly less similarity than can be exploited by these techniques. To address this problem, we propose RecOpt, a system that uses an efficient, sampling-based estimation technique to automatically choose between indexing or brute force. In addition, we propose a new baseline indexing scheme, RecDex, that can leverage blocked linear algebra to improve indexing-based serving performance. Together, RecOpt and RecDex outperform state-of-the-art indexes by 3.2× on average, and up to 10.9×, on widely studied models for recommendations and MIPS.

READ FULL TEXT

page 9

page 11

research
06/05/2017

SimDex: Exploiting Model Similarity in Exact Matrix Factorization Recommendations

We present SimDex, a new technique for serving exact top-K recommendatio...
research
10/14/2021

Reverse Maximum Inner Product Search: How to efficiently find users who would like to buy my item?

The MIPS (maximum inner product search), which finds the item with the h...
research
07/07/2019

Search-Based Serving Architecture of Embeddings-Based Recommendations

Over the past 10 years, many recommendation techniques have been based o...
research
07/21/2015

Clustering is Efficient for Approximate Maximum Inner Product Search

Efficient Maximum Inner Product Search (MIPS) is an important task that ...
research
02/28/2017

Billion-scale similarity search with GPUs

Similarity search finds application in specialized database systems hand...
research
06/20/2020

Coconut Palm: Static and Streaming Data Series Exploration Now in your Palm

Many modern applications produce massive streams of data series and main...
research
08/23/2019

Revisiting Wedge Sampling for Budgeted Maximum Inner Product Search

Top-k maximum inner product search (MIPS) is a central task in many mach...

Please sign up or login with your details

Forgot password? Click here to reset