Faster Update Time for Turnstile Streaming Algorithms

11/04/2019
by   Josh Alman, et al.
0

In this paper, we present a new algorithm for maintaining linear sketches in turnstile streams with faster update time. As an application, we show that log nCount sketches or CountMin sketches with a constant number of columns (i.e., buckets) can be implicitly maintained in worst-caseO(log^0.582 n) update time using O(log n) words of space, on a standard word RAM with word-size w=Θ(log n). The exponent 0.582≈ 2ω/3-1, where ω is the current matrix multiplication exponent. Due to the numerous applications of linear sketches, our algorithm improves the update time for many streaming problems in turnstile streams, in the high success probability setting, without using more space, including ℓ_2 norm estimation, ℓ_2 heavy hitters, point query with ℓ_1 or ℓ_2 error, etc. Our algorithm generalizes, with the same update time and space, to maintaining log n linear sketches, where each sketch partitions the coordinates into k<log^o(1) n buckets using a c-wise independent hash function for constant c, and maintains the sum of coordinates for each bucket. Moreover, if arbitrary word operations are allowed, the update time can be further improved to O(log^0.187 n), where 0.187≈ω/2-1. Our update algorithm is adaptive, and it circumvents the non-adaptive cell-probe lower bounds for turnstile streaming algorithms by Larsen, Nelson and Nguyên (STOC'15). On the other hand, our result also shows that proving unconditional cell-probe lower bound for the update time seems very difficult, even if the space is restricted to be (nearly) the optimum. If ω=2, the cell-probe update time of our algorithm would be log^o(1) n. Hence, proving any higher lower bound would imply ω>2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2023

Tight Cell-Probe Lower Bounds for Dynamic Succinct Dictionaries

A dictionary data structure maintains a set of at most n keys from the u...
research
04/13/2023

Pseudorandom Hashing for Space-bounded Computation with Applications in Streaming

We revisit Nisan's classical pseudorandom generator (PRG) for space-boun...
research
01/14/2020

Tight Static Lower Bounds for Non-Adaptive Data Structures

In this paper, we study the static cell probe complexity of non-adaptive...
research
07/17/2018

Tracking the ℓ_2 Norm with Constant Update Time

The ℓ_2 tracking problem is the task of obtaining a streaming algorithm ...
research
06/29/2019

Streaming Quantiles Algorithms with Small Space and Update Time

Approximating quantiles and distributions over streaming data has been s...
research
04/09/2019

Lower Bounds for Oblivious Near-Neighbor Search

We prove an Ω(d n/ ( n)^2) lower bound on the dynamic cell-probe comple...
research
11/25/2022

Towards Better Bounds for Finding Quasi-Identifiers

We revisit the problem of finding small ϵ-separation keys introduced by ...

Please sign up or login with your details

Forgot password? Click here to reset