FRESH: Fréchet Similarity with Hashing

09/07/2018
by   Matteo Ceccarello, et al.
0

Massive datasets of curves, such as time series and trajectories, are continuously generated by mobile and sensing devices. A relevant operation on curves is similarity search: given a dataset S of curves, construct a data structure that, for any query curve q, finds the curves in S similar to q. Similarity search is a computational demanding task, in particular when a robust distance function is used, such as the continuous Fréchet distance. In this paper, we propose FRESH, a novel approximate solution to find similar curves under the continuous Fréchet distance. FRESH leverages on a locality sensitive hashing scheme for detecting candidate near neighbors of the query curve, and a subsequent pruning step based on a pipeline of curve simplifications. By relaxing the requirement of exact and deterministic solutions, FRESH reaches high performance and outperforms the state-of-the-art approaches. The experiments indeed show that, with a recall larger than 80 precision 100 baseline given by the best solutions developed for the ACM SIGSPATIAL 2017 challenge on the Fréchet distance. Furthermore, the improvement peaks up to two orders of magnitude, and even more, by relaxing the precision.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2019

Approximate Nearest Neighbor for Curves --- Simple, Efficient, and Deterministic

In the (1+ε,r)-approximate-near-neighbor problem for curves (ANNC) under...
research
08/21/2020

(2+ε)-ANN for time series under the Fréchet distance

We study approximate-near-neighbor data structures for time series under...
research
04/17/2020

On the Approximate Nearest Neighbor Queries among Curves under the Fréchet Distance

Approximate nearest neighbor search (ANNS) is a long-studied problem in ...
research
10/06/2011

Bayesian Locality Sensitive Hashing for Fast Similarity Search

Given a collection of objects and an associated similarity measure, the ...
research
12/07/2012

Similarity of Polygonal Curves in the Presence of Outliers

The Fréchet distance is a well studied and commonly used measure to capt...
research
03/02/2018

A fast implementation of near neighbors queries for Fréchet distance (GIS Cup)

This paper describes an implementation of fast near-neighbours queries (...
research
06/21/2020

Equivalence of several curves assessing the similarity between probability distributions

The recent advent of powerful generative models has triggered the renewe...

Please sign up or login with your details

Forgot password? Click here to reset