Memory-Efficient Performance Monitoring on Programmable Switches with Lean Algorithms

11/16/2019
by   Zaoxing Liu, et al.
0

Network performance problems are notoriously difficult to diagnose. Prior profiling systems collect performance statistics by keeping information about each network flow, but maintaining per-flow state is not scalable on resource-constrained NIC and switch hardware. Instead, we propose sketch-based performance monitoring using memory that is sublinear in the number of flows. Existing sketches estimate flow monitoring metrics based on flow sizes. In contrast, performance monitoring typically requires combining information across pairs of packets, such as matching a data packet with its acknowledgment to compute a round-trip time. We define a new class of lean algorithms that use memory sublinear in both the size of input data and the number of flows. We then introduce lean algorithms for a set of important statistics, such as identifying flows with high latency, loss, out-of-order, or retransmitted packets. We implement prototypes of our lean algorithms on a commodity programmable switch using the P4 language. Our experiments show that lean algorithms detect ∼82% of top 100 problematic flows among real-world packet traces using just 40KB memory.

READ FULL TEXT
research
12/30/2022

Detecting TCP Packet Reordering in the Data Plane

Network administrators want to detect TCP-level packet reordering to dia...
research
02/02/2021

Low-Rate Overuse Flow Tracer (LOFT): An Efficient and Scalable Algorithm for Detecting Overuse Flows

Current probabilistic flow-size monitoring can only detect heavy hitters...
research
07/21/2017

Predictive networking and optimization for flow-based networks

Artificial Neural Networks (ANNs) were used to classify neural network f...
research
05/23/2020

Routing Oblivious Measurement Analytics

Network-wide traffic analytics are often needed for various network moni...
research
04/13/2021

In-Network Volumetric DDoS Victim Identification Using Programmable Commodity Switches

Volumetric distributed Denial-of-Service (DDoS) attacks have become one ...
research
10/13/2022

Processing Particle Data Flows with SmartNICs

Many distributed applications implement complex data flows and need a fl...
research
09/16/2021

Optimal Probing with Statistical Guarantees for Network Monitoring at Scale

Cloud networks are difficult to monitor because they grow rapidly and th...

Please sign up or login with your details

Forgot password? Click here to reset