Custom 8-bit floating point value format for reducing shared memory bank conflict in approximate nearest neighbor search

01/17/2023
by   Hiroyuki Ootomo, et al.
0

The k-nearest neighbor search is used in various applications such as machine learning, computer vision, database search, and information retrieval. While the computational cost of the exact nearest neighbor search is enormous, an approximate nearest neighbor search (ANNS) has been attracting much attention. IVFPQ is one of the ANNS methods. Although we can leverage the high bandwidth and low latency of shared memory to compute the search phase of the IVFPQ on NVIDIA GPUs, the throughput can degrade due to shared memory bank conflict. To reduce the bank conflict and improve the search throughput, we propose a custom 8-bit floating point value format. This format doesn't have a sign bit and can be converted from/to FP32 with a few instructions. We use this format for IVFPQ on GPUs and achieved better performance without significant recall loss compared to FP32 and FP16.

READ FULL TEXT

page 1

page 2

page 3

research
10/19/2020

LANNS: A Web-Scale Approximate Nearest Neighbor Lookup System

Nearest neighbor search (NNS) has a wide range of applications in inform...
research
08/05/2020

Fast top-K Cosine Similarity Search through XOR-Friendly Binary Quantization on GPUs

We explore the use of GPU for accelerating large scale nearest neighbor ...
research
08/29/2023

CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs

Approximate Nearest Neighbor Search (ANNS) plays a critical role in vari...
research
11/05/2021

SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search

The in-memory algorithms for approximate nearest neighbor search (ANNS) ...
research
02/20/2019

Empowering Elasticsearch with Exact and Fast r-Neighbor Search in Hamming Space

A growing interest has been witnessed recently in building nearest neigh...
research
07/12/2022

Accelerating Large-Scale Graph-based Nearest Neighbor Search on a Computational Storage Platform

K-nearest neighbor search is one of the fundamental tasks in various app...
research
02/08/2022

Orthogonal Matrices for MBAT Vector Symbolic Architectures, and a "Soft" VSA Representation for JSON

Vector Symbolic Architectures (VSAs) give a way to represent a complex o...

Please sign up or login with your details

Forgot password? Click here to reset