Sequences, yet Functions: The Dual Nature of Data-Stream Processing

01/16/2018
by   Sebastian Herbst, et al.
0

Data-stream processing has continuously risen in importance as the amount of available data has been steadily increas- ing over the last decade. Besides traditional domains such as data-center monitoring and click analytics, there is an increasing number of network-enabled production machines that generate continuous streams of data. Due to their continuous nature, queries on data-streams can be more complex, and distinctly harder to understand then database queries. As users have to consider operational details, maintenance and debugging become challenging. Current approaches model data-streams as sequences, be- cause this is the way they are physically received. These models result in an implementation-focused perspective. We explore an alternate way of modeling data-streams by focusing on time-slicing semantics. This focus results in a model based on functions, which is better suited for reasoning about query semantics. By adapting the definitions of relevant concepts in stream processing to our model, we illustrate the practical useful- ness of our approach. Thereby, we link data-streams and query primitives to concepts in functional programming and mathematics. Most noteworthy, we prove that data-streams are monads, and show how to derive monad definitions for current data-stream models. We provide an abstract, yet practical perspective on data- stream related subjects based on a sound, consistent query model. Our work can serve as solid foundation for future data-stream query-languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/16/2020

Data Stream Clustering: A Review

Number of connected devices is steadily increasing and these devices con...
research
08/22/2017

Strider-lsa: Massive RDF Stream Reasoning in the Cloud

Reasoning over semantically annotated data is an emerging trend in strea...
research
02/04/2022

Monoidal Streams for Dataflow Programming

We introduce monoidal streams: a generalization of causal stream functio...
research
05/23/2019

Hypothetical answers to continuous queries over data streams

Continuous queries over data streams may suffer from blocking operations...
research
04/04/2023

Diba: A Re-configurable Stream Processor

Stream processing acceleration is driven by the continuously increasing ...
research
06/15/2021

Deep Neural Networks for Approximating Stream Reasoning with C-SPARQL

The amount of information produced, whether by newspapers, blogs and soc...
research
09/12/2023

OmniSketch: Efficient Multi-Dimensional High-Velocity Stream Analytics with Arbitrary Predicates

A key need in different disciplines is to perform analytics over fast-pa...

Please sign up or login with your details

Forgot password? Click here to reset