Sliding window order statistics in sublinear space

07/12/2018
by   Dhruv Rohatgi, et al.
0

We extend the multi-pass streaming model to sliding window problems, and address the problem of computing order statistics on fixed-size sliding windows, in the multi-pass streaming model as well as the closely related communication complexity model. In the 2-pass streaming model, we show that on input of length N with values in range [0,R] and a window of length K, sliding window minimums can be computed in O(√(N)). We show that this is nearly optimal (for any constant number of passes) when R ≥ K, but can be improved when R = o(K) to O(√(NR/K)). Furthermore, we show that there is an (l+1)-pass streaming algorithm which computes l^th-smallest elements in O(l^3/2√(N)) space. In the communication complexity model, we describe a simple O(pN^1/p) algorithm to compute minimums in p rounds of communication for odd p, and a more involved algorithm which computes the l^th-smallest elements in O(pl^2 N^1/(p-2l-1)) space. Finally, we prove that the majority statistic on boolean streams cannot be computed in sublinear space, implying that l^th-smallest elements cannot be computed in space both sublinear in N and independent of l.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2019

Almost-Smooth Histograms and Sliding-Window Graph Algorithms

We study algorithms for the sliding-window model, an important variant o...
research
02/20/2018

Periodicity in Data Streams with Wildcards

We investigate the problem of detecting periodic trends within a string ...
research
11/29/2020

AWLCO: All-Window Length Co-Occurrence

Analyzing patterns in a sequence of events has applications in text anal...
research
02/08/2021

Throwing a Sofa Through the Window

We study several variants of the problem of moving a convex polytope K, ...
research
01/18/2023

Pseudorandom Generators for Sliding-Window Algorithms

A sliding-window algorithm of window size t is an algorithm whose curren...
research
11/21/2019

S-RASTER: Contraction Clustering for Evolving Data Streams

Contraction Clustering (RASTER) is a very fast algorithm for density-bas...
research
10/06/2019

Fast Detection of Outliers in Data Streams with the Q_n Estimator

We present FQN (Fast Q_n), a novel algorithm for fast detection of outli...

Please sign up or login with your details

Forgot password? Click here to reset