Toward Metric Indexes for Incremental Insertion and Querying

01/12/2018
by   Edward Raff, et al.
0

In this work we explore the use of metric index structures, which accelerate nearest neighbor queries, in the scenario where we need to interleave insertions and queries during deployment. This use-case is inspired by a real-life need in malware analysis triage, and is surprisingly understudied. Existing literature tends to either focus on only final query efficiency, often does not support incremental insertion, or does not support arbitrary distance metrics. We modify and improve three algorithms to support our scenario of incremental insertion and querying with arbitrary metrics, and evaluate them on multiple datasets and distance metrics while varying the value of k for the desired number of nearest neighbors. In doing so we determine that our improved Vantage-Point tree of Minimum-Variance performs best for this scenario.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2019

The SuperM-Tree: Indexing metric spaces with sized objects

A common approach to implementing similarity search applications is the ...
research
01/25/2019

Metric Spaces with Expensive Distances

In algorithms for finite metric spaces, it is common to assume that the ...
research
09/22/2017

Efficient Nearest-Neighbor Search for Dynamical Systems with Nonholonomic Constraints

Nearest-neighbor search dominates the asymptotic complexity of sampling-...
research
05/28/2020

A Practical Index Structure Supporting Fréchet Proximity Queries Among Trajectories

We present a scalable approach for range and k nearest neighbor queries ...
research
03/18/2021

Nearest-Neighbor Queries in Customizable Contraction Hierarchies and Applications

Customizable contraction hierarchies are one of the most popular route p...
research
12/20/2021

The Cascading Metric Tree

This paper presents the Cascaded Metric Tree (CMT) for efficient satisfa...
research
08/08/2021

Fairest Neighbors: Tradeoffs Between Metric Queries

Metric search commonly involves finding objects similar to a given sampl...

Please sign up or login with your details

Forgot password? Click here to reset