Sampling Based Approximate Skyline Calculation on Big Data

08/12/2020
by   Xingxing Xiao, et al.
0

The existing algorithms for processing skyline queries cannot adapt to big data. This paper proposes two approximate skyline algorithms based on sampling. The first algorithm obtains a fixed size sample and computes the approximate skyline on the sample. The error of the first algorithm is relatively small in most cases, and is almost independent of the input relation size. The second algorithm returns an (ϵ,δ)-approximation for the exact skyline. The size of sample required by the second algorithm can be regarded as a constant relative to the input relation size, so is the running time. Experiments verify the error analysis of the first algorithm and show that the second algorithm is much faster than the existing skyline algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2018

Sampling techniques for big data analysis in finite population inference

In analyzing big data for finite population inference, it is critical to...
research
08/14/2022

Sharp Frequency Bounds for Sample-Based Queries

A data sketch algorithm scans a big data set, collecting a small amount ...
research
07/29/2018

MISS: Finding Optimal Sample Sizes for Approximate Analytics

Nowadays, sampling-based Approximate Query Processing (AQP) is widely re...
research
11/25/2022

Towards Better Bounds for Finding Quasi-Identifiers

We revisit the problem of finding small ϵ-separation keys introduced by ...
research
09/21/2023

Robust Approximation Algorithms for Non-monotone k-Submodular Maximization under a Knapsack Constraint

The problem of non-monotone k-submodular maximization under a knapsack c...
research
08/16/2020

DeepSampling: Selectivity Estimation with Predicted Error and Response Time

The rapid growth of spatial data urges the research community to find ef...
research
04/29/2023

Subdata selection for big data regression: an improved approach

In the big data era researchers face a series of problems. Even standard...

Please sign up or login with your details

Forgot password? Click here to reset