Efficient Approximation of Certain and Possible Answers for Ranking and Window Queries over Uncertain Data (Extended version)

02/17/2023
by   Su Feng, et al.
0

Uncertainty arises naturally inmany application domains due to, e.g., data entry errors and ambiguity in data cleaning. Prior work in incomplete and probabilistic databases has investigated the semantics and efficient evaluation of ranking and top-k queries over uncertain data. However, most approaches deal with top-k and ranking in isolation and do represent uncertain input data and query results using separate, incompatible datamodels. We present an efficient approach for under- and over-approximating results of ranking, top-k, and window queries over uncertain data. Our approach integrates well with existing techniques for querying uncertain data, is efficient, and is to the best of our knowledge the first to support windowed aggregation. We design algorithms for physical operators for uncertain sorting and windowed aggregation, and implement them in PostgreSQL.We evaluated our approach on synthetic and real world datasets, demonstrating that it outperforms all competitors, and often produces more accurate results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2021

Efficient Uncertainty Tracking for Complex Queries with Attribute-level Bounds (extended version)

Certain answers are a principled method for coping with the uncertainty ...
research
03/30/2019

Uncertainty Annotated Databases - A Lightweight Approach for Approximating Certain Answers (extended version)

Certain answers are a principled method for coping with uncertainty that...
research
09/02/2020

Uncertain Spatial Data Management:An Overview

Both the current trends in technology such as smartphones, general mobil...
research
05/10/2021

Probabilistic Top-k Dominating Queries in Distributed Uncertain Databases (Technical Report)

In many real-world applications such as business planning and sensor dat...
research
12/12/2021

Probabilistic Counting in Uncertain Spatial Databases using Generating Functions

Location data is inherently uncertain for many reasons including 1) impr...
research
10/01/2019

Lineage-Aware Temporal Windows: Supporting Set Operations in Temporal-Probabilistic Databases

In temporal-probabilistic (TP) databases, the combination of the tempora...
research
03/27/2020

Identification of Choquet capacity in multicriteria sorting problems through stochastic inverse analysis

In multicriteria decision aiding (MCDA), the Choquet integral has been u...

Please sign up or login with your details

Forgot password? Click here to reset