Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search

11/12/2019
by   Xinyan Dai, et al.
0

Vector quantization (VQ) techniques are widely used in similarity search for data compression, fast metric computation and etc. Originally designed for Euclidean distance, existing VQ techniques (e.g., PQ, AQ) explicitly or implicitly minimize the quantization error. In this paper, we present a new angle to analyze the quantization error, which decomposes the quantization error into norm error and direction error. We show that quantization errors in norm have much higher influence on inner products than quantization errors in direction, and small quantization error does not necessarily lead to good performance in maximum inner product search (MIPS). Based on this observation, we propose norm-explicit quantization (NEQ) — a general paradigm that improves existing VQ techniques for MIPS. NEQ quantizes the norms of items in a dataset explicitly to reduce errors in norm, which is crucial for MIPS. For the direction vectors, NEQ can simply reuse an existing VQ technique to quantize them without modification. We conducted extensive experiments on a variety of datasets and parameter configurations. The experimental results show that NEQ improves the performance of various VQ techniques for MIPS, including PQ, OPQ, RQ and AQ.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2019

New Loss Functions for Fast Maximum Inner Product Search

Quantization based methods are popular for solving large scale maximum i...
research
09/04/2015

Quantization based Fast Inner Product Search

We propose a quantization based approach for fast approximate Maximum In...
research
09/30/2019

Understanding and Improving Proximity Graph based Maximum Inner Product Search

The inner-product navigable small world graph (ip-NSW) represents the st...
research
04/16/2021

Search-oriented Differentiable Product Quantization

Product quantization (PQ) is a popular approach for maximum inner produc...
research
06/30/2017

Bolt: Accelerated Data Mining with Fast Vector Compression

Vectors of data are at the heart of machine learning and data mining. Re...
research
08/28/2010

On Euclidean Norm Approximations

Euclidean norm calculations arise frequently in scientific and engineeri...
research
04/10/2017

Pyramid Vector Quantization for Deep Learning

This paper explores the use of Pyramid Vector Quantization (PVQ) to redu...

Please sign up or login with your details

Forgot password? Click here to reset