Revisiting Wedge Sampling for Budgeted Maximum Inner Product Search

08/23/2019
by   Stephan S. Lorenzen, et al.
0

Top-k maximum inner product search (MIPS) is a central task in many machine learning applications. This paper extends top-k MIPS with a budgeted setting, that asks for the best approximate top-k MIPS given a limit of B computational operations. We investigate recent advanced sampling algorithms, including wedge and diamond sampling to solve it. Though the design of these sampling schemes naturally supports budgeted top-k MIPS, they suffer from the linear cost from scanning all data points to retrieve top-k results and the performance degradation for handling negative inputs. This paper makes two main contributions. First, we show that diamond sampling is essentially a combination between wedge sampling and basic sampling for top-k MIPS. Our theoretical analysis and empirical evaluation show that wedge is competitive (often superior) to diamond on approximating top-k MIPS regarding both efficiency and accuracy. Second, we propose a series of algorithmic engineering techniques to deploy wedge sampling on budgeted top-k MIPS. Our novel deterministic wedge-based algorithm runs significantly faster than the state-of-the-art methods for budgeted and exact top-k MIPS while maintaining the top-5 precision at least 80

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/15/2018

A Bandit Approach to Maximum Inner Product Search

There has been substantial research on sub-linear time approximate algor...
research
12/14/2022

Faster Maximum Inner Product Search in High Dimensions

Maximum Inner Product Search (MIPS) is a popular problem in the machine ...
research
09/04/2015

Quantization based Fast Inner Product Search

We propose a quantization based approach for fast approximate Maximum In...
research
07/02/2020

Climbing the WOL: Training for Cheaper Inference

Efficient inference for wide output layers (WOLs) is an essential yet ch...
research
01/25/2023

An Approximate Algorithm for Maximum Inner Product Search over Streaming Sparse Vectors

Maximum Inner Product Search or top-k retrieval on sparse vectors is wel...
research
06/05/2017

To Index or Not to Index: Optimizing Maximum Inner Product Search

Making top-K predictions for state-of-the-art Matrix Factorization model...
research
05/20/2017

Ensemble Sampling

Thompson sampling has emerged as an effective heuristic for a broad rang...

Please sign up or login with your details

Forgot password? Click here to reset