Distributed Subtrajectory Join on Massive Datasets

03/18/2019
by   Panagiotis Tampakis, et al.
0

Joining trajectory datasets is a significant operation in mobility data analytics and the cornerstone of various methods that aim to extract knowledge out of them. In the era of Big Data, the production of mobility data has become massive and, consequently, performing such an operation in a centralized way is not feasible. In this paper, we address the problem of Distributed Subtrajectory Join processing by utilizing the MapReduce programming model. Compared to traditional trajectory join queries, this problem is even more challenging since the goal is to retrieve all the "maximal" portions of trajectories that are "similar". We propose three solutions: (i) a well-designed basic solution, coined DTJb, (ii) a solution that uses a preprocessing step that repartitions the data, labeled DTJr, and (iii) a solution that, additionally, employs an indexing scheme, named DTJi. In our experimental study, we utilize a 56GB dataset of real trajectories from the maritime domain, which, to the best of our knowledge, is the largest real dataset used for experimentation in the literature of trajectory data management. The results show that DTJi performs up to 16x faster compared with DTJb, 10x faster than DTJr and 3x faster than the closest related state of the art algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2019

Distributed Subtrajectory Clustering

Trajectory clustering is an important operation of knowledge discovery f...
research
06/17/2019

Scalable Distributed Subtrajectory Clustering

Trajectory clustering is an important operation of knowledge discovery f...
research
07/19/2023

Efficient Non-Learning Similar Subtrajectory Search

Similar subtrajectory search is a finer-grained operator that can better...
research
06/07/2021

Sub-trajectory Similarity Join with Obfuscation

User trajectory data is becoming increasingly accessible due to the prev...
research
05/06/2019

Statistically Discriminative Sub-trajectory Mining

We study the problem of discriminative sub-trajectory mining. Given two ...
research
06/10/2022

Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations (extended version)

A Join-Project operation is a join operation followed by a duplicate eli...
research
08/01/2019

A Natural-language-based Visual Query Approach of Uncertain Human Trajectories

Visual querying is essential for interactively exploring massive traject...

Please sign up or login with your details

Forgot password? Click here to reset