Odyssey: A Journey in the Land of Distributed Data Series Similarity Search

01/26/2023
by   Manos Chatzakis, et al.
0

This paper presents Odyssey, a novel distributed data-series processing framework that efficiently addresses the critical challenges of exhibiting good speedup and ensuring high scalability in data series processing by taking advantage of the full computational capacity of modern clusters comprised of multi-core servers. Odyssey addresses a number of challenges in designing efficient and highly scalable distributed data series index, including efficient scheduling, and load-balancing without paying the prohibitive cost of moving data around. It also supports a flexible partial replication scheme, which enables Odyssey to navigate through a fundamental trade-off between data scalability and good performance during query answering. Through a wide range of configurations and using several real and synthetic datasets, our experimental analysis demonstrates that Odyssey achieves its challenging goals.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2021

Fast Data Series Indexing for In-Memory Data

Data series similarity search is a core operation for several data serie...
research
09/10/2020

Rocket: Efficient and Scalable All-Pairs Computations on Heterogeneous Platforms

All-pairs compute problems apply a user-defined function to each combina...
research
09/22/2020

Scalable Data Series Subsequence Matching with ULISSE

Data series similarity search is an important operation and at the core ...
research
02/28/2019

Speeding up Deep Learning with Transient Servers

Distributed training frameworks, like TensorFlow, have been proposed as ...
research
03/02/2020

Graph3S: A Simple, Speedy and Scalable Distributed Graph Processing System

Graph is a ubiquitous structure in many domains. The rapidly increasing ...
research
03/07/2023

Scalable and Cost-effective Data Flow Analysis for Distributed Software: Algorithms and Applications

More and more distributed software systems are being developed and deplo...
research
03/02/2021

An Analysis of Distributed Systems Syllabi With a Focus on Performance-Related Topics

We analyze a dataset of 51 current (2019-2020) Distributed Systems sylla...

Please sign up or login with your details

Forgot password? Click here to reset