Towards Intelligent Load Balancing in Data Centers

10/27/2021
by   Zhiyuan Yao, et al.
0

Network load balancers are important components in data centers to provide scalable services. Workload distribution algorithms are based on heuristics, e.g., Equal-Cost Multi-Path (ECMP), Weighted-Cost Multi-Path (WCMP) or naive machine learning (ML) algorithms, e.g., ridge regression. Advanced ML-based approaches help achieve performance gain in different networking and system problems. However, it is challenging to apply ML algorithms on networking problems in real-life systems. It requires domain knowledge to collect features from low-latency, high-throughput, and scalable networking systems, which are dynamic and heterogenous. This paper proposes Aquarius to bridge the gap between ML and networking systems and demonstrates its usage in the context of network load balancers. This paper demonstrates its ability of conducting both offline data analysis and online model deployment in realistic systems. The results show that the ML model trained and deployed using Aquarius improves load balancing performance yet they also reveals more challenges to be resolved to apply ML for networking systems.

READ FULL TEXT
research
06/03/2022

Learning Distributed and Fair Policies for Network Load Balancing as Markov Potential Game

This paper investigates the network load balancing problem in data cente...
research
08/24/2022

Efficient Data-Driven Network Functions

Cloud environments require dynamic and adaptive networking policies. It ...
research
10/13/2021

Competitive Multi-Agent Load Balancing with Adaptive Policies in Wireless Networks

Using Machine Learning (ML) techniques for the next generation wireless ...
research
01/27/2022

Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center

This paper presents the network load balancing problem, a challenging re...
research
06/10/2022

Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models

We study how networking corruptions–data corruptions caused by networkin...
research
02/28/2022

Machine Learning Empowered Intelligent Data Center Networking: A Survey

To support the needs of ever-growing cloud-based services, the number of...
research
06/19/2023

Modular Simulation Environment Towards OTN AI-based Solutions

The current trend for highly dynamic and virtualized networking infrastr...

Please sign up or login with your details

Forgot password? Click here to reset