Data lake concept and systems: a survey

06/17/2021
by   Rihan Hai, et al.
0

Although big data has been discussed for some years, it still has many research challenges, especially the variety of data. It poses a huge difficulty to efficiently integrate, access, and query the large volume of diverse data in information silos with the traditional 'schema-on-write' approaches such as data warehouses. Data lakes have been proposed as a solution to this problem. They are repositories storing raw data in its original formats and providing a common access interface. This survey reviews the development, definition, and architectures of data lakes. We provide a comprehensive overview of research questions for designing and building data lakes. We classify the existing data lake systems based on their provided functions, which makes this survey a useful technical reference for designing, implementing and applying data lakes. We hope that the thorough comparison of existing solutions and the discussion of open research challenges in this survey would motivate the future development of data lake research and practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2018

Big Data Meet Cyber-Physical Systems: A Panoramic Survey

The world is witnessing an unprecedented growth of cyber-physical system...
research
10/28/2022

Big Data Meets Metaverse: A Survey

We are living in the era of big data. The Metaverse is an emerging techn...
research
09/02/2019

Big Data Analytics for Large Scale Wireless Networks: Challenges and Opportunities

The wide proliferation of various wireless communication systems and wir...
research
04/13/2022

DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization

Since 2016, we have witnessed the tremendous growth of artificial intell...
research
09/30/2010

A Comprehensive Survey of Data Mining-based Fraud Detection Research

This survey paper categorises, compares, and summarises from almost all ...
research
03/10/2022

Collaborative Learning and Patterns of Practice

In this article, an overview of the background, the research approaches ...
research
08/30/2015

Computational Sociolinguistics: A Survey

Language is a social phenomenon and variation is inherent to its social ...

Please sign up or login with your details

Forgot password? Click here to reset