Efficient Processing of k-regret Minimization Queries with Theoretical Guarantees

03/22/2021
by   Jiping Zheng, et al.
0

Assisting end users to identify desired results from a large dataset is an important problem for multi-criteria decision making. To address this problem, top-k and skyline queries have been widely adopted, but they both have inherent drawbacks, i.e., the user either has to provide a specific utility function or faces many results. The k-regret minimization query is proposed, which integrates the merits of top-k and skyline queries. Due to the NP-hardness of the problem, the k-regret minimization query is time consuming and the greedy framework is widely adopted. However, formal theoretical analysis of the greedy approaches for the quality of the returned results is still lacking. In this paper, we first fill this gap by conducting a nontrivial theoretical analysis of the approximation ratio of the returned results. To speed up query processing, a sampling-based method, StocPreGreed,, is developed to reduce the evaluation cost. In addition, a theoretical analysis of the required sample size is conducted to bound the quality of the returned results. Finally, comprehensive experiments are conducted on both real and synthetic datasets to demonstrate the efficiency and effectiveness of the proposed methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2021

Rank-Regret Minimization

Multi-criteria decision-making often requires finding a small representa...
research
11/08/2021

The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle

Thompson sampling (TS) has attracted a lot of interest in the bandit are...
research
05/31/2023

Theoretical Analysis on the Efficiency of Interleaved Comparisons

This study presents a theoretical analysis on the efficiency of interlea...
research
01/25/2022

Flexible skylines, regret minimization and skyline ranking: a comparison to know how to select the right approach

Recent studies pointed out some limitations about classic top-k queries ...
research
12/26/2018

Dynamic Online Gradient Descent with Improved Query Complexity: A Theoretical Revisit

We provide a new theoretical analysis framework to investigate online gr...
research
01/29/2019

Semantic and Influence aware k-Representative Queries over Social Streams

Massive volumes of data continuously generated on social platforms have ...
research
02/03/2022

QueryER: A Framework for Fast Analysis-Aware Deduplication over Dirty Data

In this work, we explore the problem of correctly and efficiently answer...

Please sign up or login with your details

Forgot password? Click here to reset