Provenance-based Data Skipping (TechReport)

04/26/2021
by   Xing Niu, et al.
0

Database systems analyze queries to determine upfront which data is needed for answering them and use indexes and other physical design techniques to speed-up access to that data. However, for important classes of queries, e.g., HAVING and top-k queries, it is impossible to determine up-front what data is relevant. To overcome this limitation, we develop provenance-based data skipping (PBDS), a novel approach that generates provenance sketches to concisely encode what data is relevant for a query. Once a provenance sketch has been captured it is used to speed up subsequent queries. PBDS can exploit physical design artifacts such as indexes and zone maps. Our approach significantly improves performance for both disk-based and main-memory database systems.

READ FULL TEXT
research
01/28/2013

Ontology-based Data Access: A Study through Disjunctive Datalog, CSP, and MMSNP

Ontology-based data access is concerned with querying incomplete data so...
research
03/24/2022

Efficient Answering of Historical What-if Queries

We introduce historical what-if queries, a novel type of what-if analysi...
research
06/17/2021

A probabilistic database approach to autoencoder-based data cleaning

Data quality problems are a large threat in data science. In this paper,...
research
12/17/2019

Mosaic: A Sample-Based Database System for Open World Query Processing

Data scientists have relied on samples to analyze populations of interes...
research
07/18/2022

Don't Be a Tattle-Tale: Preventing Leakages through Data Dependencies on Access Control Protected Data

We study the problem of answering queries when (part of) the data may be...
research
03/23/2020

RoboMem: Giving Long Term Memory to Robots

Robots have the potential to improve health monitoring outcomes for the ...
research
03/05/2015

Mapping-equivalence and oid-equivalence of single-function object-creating conjunctive queries

Conjunctive database queries have been extended with a mechanism for obj...

Please sign up or login with your details

Forgot password? Click here to reset