Network Report: A Structured Description for Network Datasets

06/08/2022
by   Xinyi Zheng, et al.
0

The rapid development of network science and technologies depends on shareable datasets. Currently, there is no standard practice for reporting and sharing network datasets. Some network dataset providers only share links, while others provide some contexts or basic statistics. As a result, critical information may be unintentionally dropped, and network dataset consumers may misunderstand or overlook critical aspects. Inappropriately using a network dataset can lead to severe consequences (e.g., discrimination) especially when machine learning models on networks are deployed in high-stake domains. Challenges arise as networks are often used across different domains (e.g., network science, physics, etc) and have complex structures. To facilitate the communication between network dataset providers and consumers, we propose network report. A network report is a structured description that summarizes and contextualizes a network dataset. Network report extends the idea of dataset reports (e.g., Datasheets for Datasets) from prior work with network-specific descriptions of the non-i.i.d. nature, demographic information, network characteristics, etc. We hope network reports encourage transparency and accountability in network research and development across different fields.

READ FULL TEXT
research
09/23/2022

A Digital Twin Description Framework and its Mapping to Asset Administration Shell

The pace of reporting on Digital Twin (DT) projects continues to acceler...
research
04/07/2023

Machine Learning with Requirements: a Manifesto

In the recent years, machine learning has made great advancements that h...
research
04/26/2022

Symlink: A New Dataset for Scientific Symbol-Description Linking

Mathematical symbols and descriptions appear in various forms across doc...
research
02/04/2022

Structured Prediction Problem Archive

Structured prediction problems are one of the fundamental tools in machi...
research
10/05/2018

Model Cards for Model Reporting

Trained machine learning models are increasingly used to perform high-im...
research
10/21/2019

Trouble with the Curve: Predicting Future MLB Players Using Scouting Reports

In baseball, a scouting report profiles a player's characteristics and t...
research
04/21/2020

Have you forgotten? A method to assess if machine learning models have forgotten data

In the era of deep learning, aggregation of data from several sources is...

Please sign up or login with your details

Forgot password? Click here to reset