Theoretical Model and Practical Considerations for Data Lineage Reconstruction

01/30/2020
by   Egor Pushkin, et al.
0

We live in a world driven by data. The amount of it outgrows anyone's ability to oversee it or even observe its scope. Along with all the advances in the space of data management, there is still a significant lack of formalism and standardization around defining data ecosystems and processes occurring within those. In order to address the issue we propose a notation for data flow modeling and evaluate some of the most common applications of it based on real-world use cases. To facilitate future work, we provide detailed reference of the data model we defined and consider potential programming paradigms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2021

Towards a Provenance Management System for Astronomical Observatories

We present here a provenance management system adapted to astronomical p...
research
04/12/2022

Towards Polyglot Data Stores – Overview and Open Research Questions

Nowadays, data-intensive applications face the problem of handling heter...
research
05/21/2021

RFID-based Article-to-Fixture Predictions in Real-World Fashion Stores

In recent years, Radio Frequency Identification (RFID) technology has be...
research
03/28/2019

Towards 6G Networks: Use Cases and Technologies

As the digital world becomes increasingly intelligent, automated and ubi...
research
10/05/2021

An Ample Approach to Data and Modeling

In the present work, we describe a framework for modeling how models can...
research
01/02/2013

MANCaLog: A Logic for Multi-Attribute Network Cascades (Technical Report)

The modeling of cascade processes in multi-agent systems in the form of ...
research
05/05/2023

Scope Restriction for Scalable Real-Time Railway Rescheduling: An Exploratory Study

With the aim to stimulate future research, we describe an exploratory st...

Please sign up or login with your details

Forgot password? Click here to reset