Distributed Kafka Clusters: A Novel Approach to Global Message Ordering

09/10/2023
by   Shashank Kumar, et al.
0

In contemporary distributed systems, logs are produced at an astounding rate, generating terabytes of data within mere seconds. These logs, containing pivotal details like system metrics, user actions, and diverse events, are foundational to the system's consistent and accurate operations. Precise log ordering becomes indispensable to avert potential ambiguities and discordances in system functionalities. Apache Kafka, a prevalent distributed message queue, offers significant solutions to various distributed log processing challenges. However, it presents an inherent limitation while Kafka ensures the in-order delivery of messages within a single partition to the consumer, it falls short in guaranteeing a global order for messages spanning multiple partitions. This research delves into innovative methodologies to achieve global ordering of messages within a Kafka topic, aiming to bolster the integrity and consistency of log processing in distributed systems. Our code is available on GitHub.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2023

2P-BFT-Log: 2-Phase Single-Author Append-Only Log for Adversarial Environments

Replicated append-only logs sequentially order messages from the same au...
research
10/08/2020

Diagnosing Distributed Systems through Log Data Analysis

The log-based analysis and trouble-shooting has remained prevalent and c...
research
11/10/2016

Length Matters: Clustering System Log Messages using Length of Words

The analysis techniques of system log messages (syslog messages) have a ...
research
06/22/2022

Kafka Consumer Group Autoscaler

Message brokers enable asynchronous communication between data producers...
research
08/17/2023

Are They All Good? Studying Practitioners' Expectations on the Readability of Log Messages

Developers write logging statements to generate logs that provide run-ti...
research
08/25/2023

The Arrangement of Marks Impacts Afforded Messages: Ordering, Partitioning, Spacing, and Coloring in Bar Charts

Data visualizations present a massive number of potential messages to an...
research
04/21/2022

Message Flow Analysis with Complex Causal Links for Distributed ROS 2 Systems

Distributed robotic systems rely heavily on publish-subscribe frameworks...

Please sign up or login with your details

Forgot password? Click here to reset