Tight Bounds for the Subspace Sketch Problem with Applications

04/11/2019
by   Yi Li, et al.
0

In the subspace sketch problem one is given an n× d matrix A with O((nd)) bit entries, and would like to compress it in an arbitrary way to build a small space data structure Q_p, so that for any given x ∈R^d, with probability at least 2/3, one has Q_p(x)=(1±ϵ)Ax_p, where p≥ 0, and where the randomness is over the construction of Q_p. The central question is: How many bits are necessary to store Q_p? This problem has applications to the communication of approximating the number of non-zeros in a matrix product, the size of coresets in projective clustering, the memory of streaming algorithms for regression in the row-update model, and embedding subspaces of L_p in functional analysis. A major open question is the dependence on the approximation factor ϵ. We show if p≥ 0 is not a positive even integer and d=Ω((1/ϵ)), then Ω̃(ϵ^-2d) bits are necessary. On the other hand, if p is a positive even integer, then there is an upper bound of O(d^p(nd)) bits independent of ϵ. Our results are optimal up to logarithmic factors, and show in particular that one cannot compress A to O(d) "directions" v_1,...,v_O(d), such that for any x, Ax_1 can be well-approximated from 〈 v_1,x〉,...,〈 v_O(d),x〉. Our lower bound rules out arbitrary functions of these inner products (and in fact arbitrary data structures built from A), and thus rules out the possibility of a singular value decomposition for ℓ_1 in a very strong sense. Indeed, as ϵ→ 0, for p = 1 the space complexity becomes arbitrarily large, while for p = 2 it is at most O(d^2 (nd)). As corollaries of our main lower bound, we obtain new lower bounds for a wide range of applications, including the above, which in many cases are optimal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2022

The ℓ_p-Subspace Sketch Problem in Small Dimensions with Applications to Support Vector Machines

In the ℓ_p-subspace sketch problem, we are given an n× d matrix A with n...
research
07/12/2019

Towards Optimal Moment Estimation in Streaming and Distributed Models

One of the oldest problems in the data stream model is to approximate th...
research
07/17/2022

Streaming Algorithms with Large Approximation Factors

We initiate a broad study of classical problems in the streaming model w...
research
11/03/2021

Nearly Tight Lower Bounds for Succinct Range Minimum Query

Given an array of distinct integers A[1… n], the Range Minimum Query (RM...
research
04/18/2019

An extremal problem for integer sparse recovery

Motivated by the problem of integer sparse recovery we study the followi...
research
12/21/2021

Lower Bounds for Sparse Oblivious Subspace Embeddings

An oblivious subspace embedding (OSE), characterized by parameters m,n,d...
research
12/24/2019

The aBc Problem and Equator Sampling Renyi Divergences

We investigate the problem of approximating the product a^TBc, where a,c...

Please sign up or login with your details

Forgot password? Click here to reset