Towards a context-dependent numerical data quality evaluation framework

10/22/2018
by   Milen S. Marev, et al.
0

This paper focuses on numeric data, with emphasis on distinct characteristics like varying significance, unstructured format, mass volume and real-time processing. We propose a novel, context-dependent valuation framework specifically devised to assess quality in numeric datasets. Our framework uses eight relevant data quality dimensions, and provide a simple metric to evaluate dataset quality along each dimension. We argue that the proposed set of dimensions and corresponding metrics adequately captures the unique quality antipatterns that are typically associated with numerical data. The introduction of our framework is part of a wider research effort that aims at developing an articulated numerical data quality improvement approach for Oil and Gas exploration and production workflows that is based on artificial intelligence techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2023

A Topological Approach to Measuring Training Data Quality

Data quality is crucial for the successful training, generalization and ...
research
02/18/2020

Quantitative Evaluation of Time-Dependent Multidimensional Projection Techniques

Dimensionality reduction methods are an essential tool for multidimensio...
research
11/17/2020

Towards evaluating and eliciting high-quality documentation for intelligent systems

A vital component of trust and transparency in intelligent systems built...
research
12/17/2018

Report on Data Quality in Biobanks: Problems, Issues, State-of-the-Art

This report discusses the issues of data quality in biobanks. It present...
research
03/03/2023

Interoperability-oriented Quality Assessment for Czech Open Data

With the rapid increase of published open datasets, it is crucial to sup...
research
02/13/2023

One Transformer for All Time Series: Representing and Training with Time-Dependent Heterogeneous Tabular Data

There is a recent growing interest in applying Deep Learning techniques ...
research
07/26/2018

General Context-Aware Data Matching and Merging Framework

Due to numerous public information sources and services, many methods to...

Please sign up or login with your details

Forgot password? Click here to reset