Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic Control

12/24/2018
by   Fabian Ruffy, et al.
0

Recent networking research has identified that data-driven congestion control (CC) can be more efficient than traditional CC in TCP. Deep reinforcement learning (RL), in particular, has the potential to learn optimal network policies. However, RL suffers from instability and over-fitting, deficiencies which so far render it unacceptable for use in datacenter networks. In this paper, we analyze the requirements for RL to succeed in the datacenter context. We present a new emulator, Iroko, which we developed to support different network topologies, congestion control algorithms, and deployment scenarios. Iroko interfaces with the OpenAI gym toolkit, which allows for fast and fair evaluation of different RL and traditional CC algorithms under the same conditions. We present initial benchmarks on three deep RL algorithms compared to TCP New Vegas and DCTCP. Our results show that these algorithms are able to learn a CC policy which exceeds the performance of TCP New Vegas on a dumbbell and fat-tree topology. We make our emulator open-source and publicly available: https://github.com/dcgym/iroko

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2019

MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions

Effective network congestion control strategies are key to keeping the I...
research
10/22/2020

When Machine Learning Meets Congestion Control: A Survey and Comparison

Machine learning (ML) has seen a significant surge and uptake across man...
research
06/27/2023

Learning to Sail Dynamic Networks: The MARLIN Reinforcement Learning Framework for Congestion Control in Tactical Environments

Conventional Congestion Control (CC) algorithms,such as TCP Cubic, strug...
research
02/12/2022

Automatic Curriculum Generation for Learning Adaptation in Networking

As deep reinforcement learning (RL) showcases its strengths in networkin...
research
07/22/2021

Accelerating Quadratic Optimization with Reinforcement Learning

First-order methods for quadratic optimization such as OSQP are widely u...
research
02/18/2021

Reinforcement Learning for Datacenter Congestion Control

We approach the task of network congestion control in datacenters using ...
research
12/08/2021

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

We present ShinRL, an open-source library specialized for the evaluation...

Please sign up or login with your details

Forgot password? Click here to reset