Highly Efficient Indexing Scheme for k-Dominant Skyline Processing over Uncertain Data Streams

11/16/2021
by   Chuan-Chi Lai, et al.
0

Skyline is widely used in reality to solve multi-criteria problems, such as environmental monitoring and business decision-making. When a data is not worse than another data on all criteria and is better than another data at least one criterion, the data is said to dominate another data. When a data item is not dominated by any other data item, this data is said to be a member of the skyline. However, as the number of criteria increases, the possibility that a data dominates another data decreases, resulting in too many members of the skyline set. To solve this kind of problem, the concept of the k-dominant skyline was proposed, which reduces the number of skyline members by relaxing the limit. The uncertainty of the data makes each data have a probability of appearing, so each data has the probability of becoming a member of the k-dominant skyline. When a new data item is added, the probability of other data becoming members of the k-dominant skyline may change. How to quickly update the k-dominant skyline for real-time applications is a serious problem. This paper proposes an effective method, Middle Indexing (MI), which filters out a large amount of irrelevant data in the uncertain data stream by sorting data specifically, so as to improve the efficiency of updating the k-dominant skyline. Experiments show that the proposed MI outperforms the existing method by approximately 13

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
06/01/2019

Probabilistic Top-k Dominating Query Monitoring over Multiple Uncertain IoT Data Streams in Edge Computing Environments

Extracting the valuable features and information in Big Data has become ...
research
04/11/2022

Rank One Approximation as a Strategy for Wordle

This paper presents a mathematical method of playing the puzzle game Wor...
research
12/16/2019

A new Frequency Estimation Sketch for Data Streams

In data stream applications, one of the critical issues is to estimate t...
research
11/16/2020

Using simulation to incorporate dynamic criteria into multiple criteria decision-making

In this paper, we present a case study demonstrating how dynamic and unc...
research
07/17/2021

Large-Scale Estimation of Dominant Poles of a Transfer Function by an Interpolatory Framework

We focus on the dominant poles of the transfer function of a descriptor ...
research
05/13/2016

Wisdom of Crowds cluster ensemble

The Wisdom of Crowds is a phenomenon described in social science that su...
research
06/08/2017

Distribution-Free One-Pass Learning

In many large-scale machine learning applications, data are accumulated ...

Please sign up or login with your details

Forgot password? Click here to reset