Modeling Data Lake Metadata with a Data Vault

07/11/2018
by   Iuri Nogueira, et al.
0

With the rise of big data, business intelligence had to find solutions for managing even greater data volumes and variety than in data warehouses, which proved ill-adapted. Data lakes answer these needs from a storage point of view, but require managing adequate metadata to guarantee an efficient access to data. Starting from a multidimensional metadata model designed for an industrial heritage data lake presenting a lack of schema evolutivity, we propose in this paper to use ensemble modeling, and more precisely a data vault, to address this issue. To illustrate the feasibility of this approach, we instantiate our metadata conceptual model into relational and document-oriented logical and physical models, respectively. We also compare the physical models in terms of metadata storage and query response time.

READ FULL TEXT

page 6

page 11

research
03/24/2021

Coining goldMEDAL: A New Contribution to Data Lake Generic Metadata Modeling

The rise of big data has revolutionized data exploitation practices and ...
research
11/30/2018

The Approach to Managing Provenance Metadata and Data Access Rights in Distributed Storage Using the Hyperledger Blockchain Platform

The paper suggests a new approach based on blockchain technologies and s...
research
07/05/2021

goldMEDAL : une nouvelle contribution à la modélisation générique des métadonnées des lacs de données

We summarize here a paper published in 2021 in the DOLAP international w...
research
07/05/2021

Data Lake Ingestion Management

Data Lake (DL) is a Big Data analysis solution which ingests raw data in...
research
08/07/2022

Data Leaves: Scenario-oriented Metadata for Data Federative Innovation

A method for representing the digest information of each dataset is prop...
research
03/26/2021

Node metadata can produce predictability transitions in network inference problems

Network inference is the process of learning the properties of complex n...
research
02/29/2020

DangKiller: Eliminating Dangling Pointers Efficiently via Implicit Identifier

Use-After-Free vulnerabilities, allowing the attacker to access unintend...

Please sign up or login with your details

Forgot password? Click here to reset