You Say 'What', I Hear 'Where' and 'Why' --- (Mis-)Interpreting SQL to Derive Fine-Grained Provenance

05/29/2018
by   Tobias Müller, et al.
0

SQL declaratively specifies what (not how) the desired output of a query is. This work shows that a non-standard interpretation of the SQL semantics can, instead, disclose where a piece of the output originated in the input and why that piece found its way into the result. We derive such data provenance for very rich SQL dialects---including recursion, windowed aggregates, and user-defined functions---at the fine-grained level of individual table cells. The approach is non-invasive and implemented as a compositional source-level SQL rewrite: an input SQL query is transformed into its own interpreter that wields data dependencies instead of regular values. We deliberately design this transformation to preserve the shape of both data and query, which allows provenance derivation to scale to complex queries without overwhelming the underlying database system.

READ FULL TEXT
research
09/07/2019

Compiling PL/SQL Away

"PL/SQL functions are slow," is common developer wisdom that derives fro...
research
09/12/2022

Optimising Fine-Grained Access Control Policy Enforcement for Database Queries. A Model-Driven Approach

Recently, we have proposed a model-driven approach for enforcing fine-gr...
research
08/31/2018

Implementing WHERE and ORDER BY as spreadsheet formulas

The WHERE and ORDER BY clauses of the SQL SELECT statement select a subs...
research
09/28/2022

Improving Text-to-SQL Semantic Parsing with Fine-grained Query Understanding

Most recent research on Text-to-SQL semantic parsing relies on either pa...
research
10/12/2020

PATSQL: Efficient Synthesis of SQL Queries from Example Tables with Quick Inference of Projected Columns

SQL is one of the most popular tools for data analysis and used by an in...
research
03/09/2022

CUBES: A Parallel Synthesizer for SQL Using Examples

In recent years, more and more people see their work depend on data mani...
research
05/28/2019

One SQL to Rule Them All

Real-time data analysis and management are increasingly critical for tod...

Please sign up or login with your details

Forgot password? Click here to reset