On Range Summary Queries

05/04/2023
by   Peyman Afshani, et al.
0

We study the query version of the approximate heavy hitter and quantile problems. In the former problem, the input is a parameter ε and a set P of n points in ℝ^d where each point is assigned a color from a set C, and we want to build a structure s.t. given any geometric range γ, we can efficiently find a list of approximate heavy hitters in γ∩ P, i.e., colors that appear at least ε |γ∩ P| times in γ∩ P, as well as their frequencies with an additive error of ε |γ∩ P|. In the latter problem, each point is assigned a weight from a totally ordered universe and the query must output a sequence S of 1+1/ε weights s.t. the i-th weight in S has approximate rank iε|γ∩ P|, meaning, rank iε|γ∩ P| up to an additive error of ε|γ∩ P|. Previously, optimal results were only known in 1D [WY11] but a few sub-optimal methods were available in higher dimensions [AW17, ACH+12]. We study the problems for 3D halfspace and dominance queries. We consider the real RAM model with integer registers of size w=Θ(log n) bits. For dominance queries, we show optimal solutions for both heavy hitter and quantile problems: using linear space, we can answer both queries in time O(log n + 1/ε). Note that as the output size is 1/ε, after investing the initial O(log n) searching time, our structure takes on average O(1) time to find a heavy hitter or a quantile! For more general halfspace heavy hitter queries, the same optimal query time can be achieved by increasing the space by an extra log_w1/ε (resp. loglog_w1/ε) factor in 3D (resp. 2D). By spending extra log^O(1)1/ε factors in time and space, we can also support quantile queries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2022

Simplex Range Searching Revisited: How to Shave Logs in Multi-Level Data Structures

We revisit the classic problem of simplex range searching and related pr...
research
03/11/2018

Approximate Range Queries for Clustering

We study the approximate range searching for three variants of the clust...
research
03/11/2023

Generalizing Greenwald-Khanna Streaming Quantile Summaries for Weighted Inputs

Estimating quantiles, like the median or percentiles, is a fundamental t...
research
12/01/2020

Lower Bounds for Semialgebraic Range Searching and Stabbing Problems

In the semialgebraic range searching problem, we are to preprocess n poi...
research
12/03/2021

Point Enclosure Problem for Homothetic Polygons

In this paper, we investigate the homothetic point enclosure problem: gi...
research
04/28/2018

Heavy Hitters over Interval Queries

Heavy hitters and frequency measurements are fundamental in many network...
research
05/25/2021

Minmax-optimal list searching with O(log_2log_2 n) average cost

We find a searching method on ordered lists that surprisingly outperform...

Please sign up or login with your details

Forgot password? Click here to reset