Data Readiness Levels

05/05/2017
by   Neil D. Lawrence, et al.
0

Application of models to data is fraught. Data-generating collaborators often only have a very basic understanding of the complications of collating, processing and curating data. Challenges include: poor data collection practices, missing values, inconvenient storage mechanisms, intellectual property, security and privacy. All these aspects obstruct the sharing and interconnection of data, and the eventual interpretation of data through machine learning or other approaches. In project reporting, a major challenge is in encapsulating these problems and enabling goals to be built around the processing of data. Project overruns can occur due to failure to account for the amount of time required to curate and collate. But to understand these failures we need to have a common language for assessing the readiness of a particular data set. This position paper proposes the use of data readiness levels: it gives a rough outline of three stages of data preparedness and speculates on how formalisation of these levels into a common language for data readiness could facilitate project management.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2022

Assessing Project-Level Fine-Tuning of ML4SE Models

Machine Learning for Software Engineering (ML4SE) is an actively growing...
research
09/23/2022

"My Privacy for their Security": Employees' Privacy Perspectives and Expectations when using Enterprise Security Software

Employees are often required to use Enterprise Security Software ("ESS")...
research
02/07/2019

PAI Data, Summary of the Project PAI Data Protocol

The Project PAI Data Protocol ("PAI Data") is a specification that exten...
research
06/13/2022

Consent verification monitoring

Advances in service personalization are driven by low-cost data collecti...
research
02/07/2019

A Japanese translation of "PAI Data, Summary of the Project PAI Data Protocol" by Jincheng Du, Dan Fang, Mark Harvilla

The Project PAI Data Protocol ("PAI Data") is a specification that exten...
research
02/07/2019

A Simplified Chinese translation of "PAI Data, Summary of the Project PAI Data Protocol" by Jincheng Du, Dan Fang, Mark Harvilla

The Project PAI Data Protocol ("PAI Data") is a specification that exten...
research
11/20/2020

Resolving the cybersecurity Data Sharing Paradox to scale up cybersecurity via a co-production approach towards data sharing

As cybercriminals scale up their operations to increase their profits or...

Please sign up or login with your details

Forgot password? Click here to reset