Scalable real-time processing with Spark Streaming: implementation and design of a Car Information System

09/14/2017
by   Philipp M. Grulich, et al.
0

Streaming data processing is a hot topic in big data these days, because it made it possible to process a huge amount of events within a low latency. One of the most common used open-source stream processing platforms is Spark Streaming, which is demonstrated and discussed based on a real-world use-case in this paper. The use-case is about a Car Information System, which is an example for a classic stream processing system. First the System is de- signed and engineered, whereby the application architecture is created carefully, because it should be adaptable for similar use-cases. At the end of this paper the CIS and Spark Streaming is evaluated by the use of the Goal Question Metric model. The evaluation proves that Spark Streaming is capable to create stream processing in a scalable and fault tolerant manner. But it also shows that Spark is a very fast moving project, which could cause problems during the development and maintenance of a software project.

READ FULL TEXT

page 1

page 33

research
06/18/2018

AlertMix: A Big Data platform for multi-source streaming data

The demand for stream processing is increasing at an unprecedented rate....
research
12/08/2019

A study on Modern Messaging Systems- Kafka, RabbitMQ and NATS Streaming

Distributed messaging systems form the core of big data streaming, cloud...
research
07/31/2019

Distributed Streaming Analytics on Large-scale Oceanographic Data using Apache Spark

Real-world data from diverse domains require real-time scalable analysis...
research
12/05/2022

Whale Casting: Remote mobile streaming humpback whale vocalizations to the world

Over several days in early August 2021, while at sea in Chatham Strait, ...
research
02/23/2018

Benchmarking Distributed Stream Processing Engines

Over the last years, stream data processing has been gaining attention b...
research
05/02/2018

Architecture for Analysis of Streaming Data

While several attempts have been made to construct a scalable and flexib...
research
10/29/2021

Parallel-and-stream accelerator for computationally fast supervised learning

Two dominant distributed computing strategies have emerged to overcome t...

Please sign up or login with your details

Forgot password? Click here to reset