The Softwarised Network Data Zoo

05/13/2019
by   Manuel Peuster, et al.
0

More and more management and orchestration approaches for (software) networks are based on machine learning paradigms and solutions. These approaches depend not only on their program code to operate properly, but also require enough input data to train their internal models. However, such training data is barely available for the software networking domain and most presented solutions rely on their own, sometimes not even published, data sets. This makes it hard, or even infeasible, to reproduce and compare many of the existing solutions. As a result, it ultimately slows down the adoption of machine learning approaches in softwarised networks. To this end, we introduce the "softwarised network data zoo" (SNDZoo), an open collection of software networking data sets aiming to streamline and ease machine learning research in the software networking domain. We present a general methodology to collect, archive, and publish those data sets for use by other researches and, as an example, eight initial data sets, focusing on the performance of virtualised network functions.

READ FULL TEXT

page 1

page 2

research
04/21/2021

Applications of Artificial Intelligence, Machine Learning and related techniques for Computer Networking Systems

This article presents a primer/overview of applications of Artificial In...
research
11/22/2017

No Classification without Representation: Assessing Geodiversity Issues in Open Data Sets for the Developing World

Modern machine learning systems such as image classifiers rely heavily o...
research
08/24/2022

Efficient Data-Driven Network Functions

Cloud environments require dynamic and adaptive networking policies. It ...
research
09/09/2020

RapidLearn: A General Purpose Toolkit for Autonomic Networking

Software Defined Networking has unfolded a new area of opportunity in di...
research
02/23/2021

Data Engineering for Everyone

Data engineering is one of the fastest-growing fields within machine lea...
research
01/31/2020

Automatic lung segmentation in routine imaging is a data diversity problem, not a methodology problem

Automated segmentation of anatomical structures is a crucial step in man...
research
11/06/2019

Searching to Exploit Memorization Effect in Learning from Corrupted Labels

Sample-selection approaches, which attempt to pick up clean instances fr...

Please sign up or login with your details

Forgot password? Click here to reset