DeepConfig: Automating Data Center Network Topologies Management with Machine Learning

12/11/2017
by   Christopher Streiffer, et al.
0

In recent years, many techniques have been developed to improve the performance and efficiency of data center networks. While these techniques provide high accuracy, they are often designed using heuristics that leverage domain-specific properties of the workload or hardware. In this vision paper, we argue that many data center networking techniques, e.g., routing, topology augmentation, energy savings, with diverse goals actually share design and architectural similarity. We present a design for developing general intermediate representations of network topologies using deep learning that is amenable to solving classes of data center problems. We develop a framework, DeepConfig, that simplifies the processing of configuring and training deep learning agents that use the intermediate representation to learns different tasks. To illustrate the strength of our approach, we configured, implemented, and evaluated a DeepConfig-Agent that tackles the data center topology augmentation problem. Our initial results are promising --- DeepConfig performs comparably to the optimal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2022

Scalable Tail Latency Estimation for Data Center Networks

In this paper, we consider how to provide fast estimates of flow-level t...
research
02/28/2022

Machine Learning Empowered Intelligent Data Center Networking: A Survey

To support the needs of ever-growing cloud-based services, the number of...
research
02/07/2022

Optimal Direct-Connect Topologies for Collective Communications

We consider the problem of distilling optimal network topologies for col...
research
01/21/2022

Nearest Class-Center Simplification through Intermediate Layers

Recent advances in theoretical Deep Learning have introduced geometric p...
research
05/01/1997

Connectionist Theory Refinement: Genetically Searching the Space of Network Topologies

An algorithm that learns from a set of examples should ideally be able t...
research
10/03/2020

TCLNet: Learning to Locate Typhoon Center Using Deep Neural Network

The task of typhoon center location plays an important role in typhoon i...
research
09/28/2020

DCFIT: Initial Trigger-Based PFC Deadlock Detection in the Data Plane

Recent data center applications rely on lossless networks to achieve hig...

Please sign up or login with your details

Forgot password? Click here to reset