Sharon: Shared Online Event Sequence Aggregation

10/06/2020
by   Olga Poppe, et al.
0

Streaming systems evaluate massive workloads of event sequence aggregation queries. State-of-the-art approaches suffer from long delays caused by not sharing intermediate results of similar queries and by constructing event sequences prior to their aggregation. To overcome these limitations, our Shared Online Event Sequence Aggregation (Sharon) approach shares intermediate aggregates among multiple queries while avoiding the expensive construction of event sequences. Our Sharon optimizer faces two challenges. One, a sharing decision is not always beneficial. Two, a sharing decision may exclude other sharing opportunities. To guide our Sharon optimizer, we compactly encode sharing candidates, their benefits, and conflicts among candidates into the Sharon graph. Based on the graph, we map our problem of finding an optimal sharing plan to the Maximum Weight Independent Set (MWIS) problem. We then use the guaranteed weight of a greedy algorithm for the MWIS problem to prune the search of our sharing plan finder without sacrificing its optimality. The Sharon optimizer is shown to produce sharing plans that achieve up to an 18-fold speed-up compared to state-of-the-art approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2021

To Share, or not to Share Online Event Trend Aggregation Over Bursty Event Streams

Complex event processing (CEP) systems continuously evaluate large workl...
research
10/06/2020

GRETA: Graph-based Real-time Event Trend Aggregation

Streaming applications from algorithmic trading to traffic management de...
research
10/06/2020

Event Trend Aggregation Under Rich Event Matching Semantics

Streaming applications from health care analytics to algorithmic trading...
research
03/07/2018

Compact Representations of Event Sequences

We introduce a new technique for the efficient management of large seque...
research
02/28/2022

Efficient Massively Parallel Join Optimization for Large Queries

Modern data analytical workloads often need to run queries over a large ...
research
07/16/2023

Real-Time Analytics by Coordinating Reuse and Work Sharing

Analytical tools often require real-time responses for highly concurrent...

Please sign up or login with your details

Forgot password? Click here to reset