Universal Streaming of Subset Norms

12/01/2018
by   Vladimir Braverman, et al.
0

Most known algorithms in the streaming model of computation aim to approximate a single function such as an ℓ_p-norm. In 2009, Nelson [<https://sublinear.info>, Open Problem 30] asked if it possible to design universal algorithms, that simultaneously approximate multiple functions of the stream. In this paper we answer the question of Nelson for the class of subset ℓ_0-norms in the insertion-only frequency-vector model. Given a family of subsets S⊂ 2^[n], we provide a single streaming algorithm that can (1±ϵ)-approximate the subset-norm for every S∈S. Here, the subset-ℓ_p-norm of v∈R^n with respect to set S⊆ [n] is the ℓ_p-norm of vector v_|S (which denotes restricting v to S, by zeroing all other coordinates). Our main result is a near-tight characterization of the space complexity of every family S⊂ 2^[n] of subset-ℓ_0-norms in insertion-only streams, expressed in terms of the "heavy-hitter dimension" of S, a new combinatorial quantity that is related to the VC-dimension of S. In contrast, we show that the more general turnstile and sliding-window models require a much larger space usage. All these results easily extend to ℓ_1. In addition, we design algorithms for two other subset-ℓ_p-norm variants. These can be compared to the Priority Sampling algorithm of Duffield, Lund and Thorup [JACM 2007], which achieves additive approximation ϵv for all possible subsets (S=2^[n]) in the entry-wise update model. One of our algorithms extends this algorithm to handle turnstile updates, and another one achieves multiplicative approximation given a family S.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2021

Symmetric Norm Estimation and Regression on Sliding Windows

The sliding window model generalizes the standard streaming model and of...
research
07/09/2023

Private Data Stream Analysis for Universal Symmetric Norm Estimation

We study how to release summary statistics on a data stream subject to t...
research
07/16/2021

Streaming and Distributed Algorithms for Robust Column Subset Selection

We give the first single-pass streaming algorithm for Column Subset Sele...
research
06/17/2018

On Sketching the q to p norms

We initiate the study of data dimensionality reduction, or sketching, fo...
research
07/06/2018

Leveraging Well-Conditioned Bases: Streaming & Distributed Summaries in Minkowski p-Norms

Work on approximate linear algebra has led to efficient distributed and ...
research
08/03/2023

One Partition Approximating All ℓ_p-norm Objectives in Correlation Clustering

This paper considers correlation clustering on unweighted complete graph...
research
07/01/2019

Approximate F_2-Sketching of Valuation Functions

We study the problem of constructing a linear sketch of minimum dimensio...

Please sign up or login with your details

Forgot password? Click here to reset