Building a serverless Data Lakehouse from spare parts

08/10/2023
by   Jacopo Tagliabue, et al.
0

The recently proposed Data Lakehouse architecture is built on open file formats, performance, and first-class support for data transformation, BI and data science: while the vision stresses the importance of lowering the barrier for data work, existing implementations often struggle to live up to user expectations. At Bauplan, we decided to build a new serverless platform to fulfill the Lakehouse vision. Since building from scratch is a challenge unfit for a startup, we started by re-using (sometimes unconventionally) existing projects, and then investing in improving the areas that would give us the highest marginal gains for the developer experience. In this work, we review user experience, high-level architecture and tooling decisions, and conclude by sharing plans for future development.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/16/2020

Starting with data: advancing spatial data science by building and sharing high-quality datasets

Spatial data science has emerged in recent years as an interdisciplinary...
research
11/25/2021

Federated Data Science to Break Down Silos [Vision]

Similar to Open Data initiatives, data science as a community has launch...
research
03/25/2019

Comparative Analysis of Distributed and Parallel File Systems' Internal Techniques

A file system optimization is the most common task in the file system fi...
research
03/27/2022

mdx: A Cloud Platform for Supporting Data Science and Cross-Disciplinary Research Collaborations

The growing amount of data and advances in data science have created a n...
research
03/08/2021

Leveraging Data Scientists and Business Expectations During the COVID-19 Pandemic

The COVID-19 pandemic presented itself as a challenge for separate socie...
research
08/13/2021

HPTMT Parallel Operators for High Performance Data Science Data Engineering

Data-intensive applications are becoming commonplace in all science disc...
research
02/05/2020

If I Hear You Correctly: Building and Evaluating Interview Chatbots with Active Listening Skills

Interview chatbots engage users in a text-based conversation to draw out...

Please sign up or login with your details

Forgot password? Click here to reset