First Study on Data Readiness Level

01/18/2017
by   Hui Guan, et al.
0

We introduce the idea of Data Readiness Level (DRL) to measure the relative richness of data to answer specific questions often encountered by data scientists. We first approach the problem in its full generality explaining its desired mathematical properties and applications and then we propose and study two DRL metrics. Specifically, we define DRL as a function of at least four properties of data: Noisiness, Believability, Relevance, and Coherence. The information-theoretic based metrics, Cosine Similarity and Document Disparity, are proposed as indicators of Relevance and Coherence for a piece of data. The proposed metrics are validated through a text-based experiment using Twitter data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2020

Minimax Strikes Back

Deep Reinforcement Learning (DRL) reaches a superhuman level of play in ...
research
06/30/2021

Evaluation of Thematic Coherence in Microblogs

Collecting together microblogs representing opinions about the same topi...
research
06/08/2020

Evaluation Criteria for Instance-based Explanation

Explaining predictions made by complex machine learning models helps use...
research
09/14/2021

Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems

While Deep Reinforcement Learning (DRL) provides transformational capabi...
research
08/23/2023

Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges

Deep reinforcement learning (DRL), leveraging Deep Learning (DL) in rein...
research
04/21/2023

Exogenous Data in Forecasting: FARM – A New Measure for Relevance Evaluation

Evaluating the relevance of an exogenous data series is the first step i...
research
02/20/2022

A framework for spatial heat risk assessment using a generalized similarity measure

In this study, we develop a novel framework to assess health risks due t...

Please sign up or login with your details

Forgot password? Click here to reset