DeDrift: Robust Similarity Search under Content Drift

08/05/2023
by   Dmitry Baranchuk, et al.
0

The statistical distribution of content uploaded and searched on media sharing sites changes over time due to seasonal, sociological and technical factors. We investigate the impact of this "content drift" for large-scale similarity search tools, based on nearest neighbor search in embedding space. Unless a costly index reconstruction is performed frequently, content drift degrades the search accuracy and efficiency. The degradation is especially severe since, in general, both the query and database distributions change. We introduce and analyze real-world image and video datasets for which temporal information is available over a long time period. Based on the learnings, we devise DeDrift, a method that updates embedding quantizers to continuously adapt large-scale indexing structures on-the-fly. DeDrift almost eliminates the accuracy degradation due to the query and database content drift while being up to 100x faster than a full index reconstruction.

READ FULL TEXT

page 3

page 4

page 11

page 12

page 13

research
06/11/2019

Similarity Problems in High Dimensions

The main contribution of this dissertation is the introduction of new or...
research
12/18/2018

Efficient Autotuning of Hyperparameters in Approximate Nearest Neighbor Search

Approximate nearest neighbor algorithms are used to speed up nearest nei...
research
04/18/2018

HD-Index: Pushing the Scalability-Accuracy Boundary for Approximate kNN Search in High-Dimensional Spaces

Nearest neighbor searching of large databases in high-dimensional spaces...
research
02/20/2014

Survey on Sparse Coded Features for Content Based Face Image Retrieval

Content based image retrieval, a technique which uses visual contents of...
research
09/21/2010

Balancing clusters to reduce response time variability in large scale image search

Many algorithms for approximate nearest neighbor search in high-dimensio...
research
08/25/2020

Multi-Face: Self-supervised Multiview Adaptation for Robust Face Clustering in Videos

Robust face clustering is a key step towards computational understanding...
research
05/15/2020

Near-duplicate video detection featuring coupled temporal and perceptual visual structures and logical inference based matching

We propose in this paper an architecture for near-duplicate video detect...

Please sign up or login with your details

Forgot password? Click here to reset