Tight Bounds for the Subspace Sketch Problem with Applications
Abstract
In the subspace sketch problem one is given an n× d matrix A with O((nd)) bit entries, and would like to compress it in an arbitrary way to build a small space data structure Qp, so that for any given x ∈ Rd, with probability at least 2/3, one has Qp(x)=(1ε)\|Ax\|p, where p≥ 0, and where the randomness is over the construction of Qp. The central question is: How many bits are necessary to store Qp? This problem has applications to the communication of approximating the number of non-zeros in a matrix product, the size of coresets in projective clustering, the memory of streaming algorithms for regression in the row-update model, and embedding subspaces of Lp in functional analysis. A major open question is the dependence on the approximation factor ε. We show if p≥ 0 is not a positive even integer and d=((1/ε)), then (ε-2d) bits are necessary. On the other hand, if p is a positive even integer, then there is an upper bound of O(dp(nd)) bits independent of ε. Our results are optimal up to logarithmic factors, and show in particular that one cannot compress A to O(d) "directions" v1,…,vO(d), such that for any x, \|Ax\|1 can be well-approximated from v1,x,…, vO(d),x. Our lower bound rules out arbitrary functions of these inner products (and in fact arbitrary data structures built from A), and thus rules out the possibility of a singular value decomposition for 1 in a very strong sense. Indeed, as ε 0, for p = 1 the space complexity becomes arbitrarily large, while for p = 2 it is at most O(d2 (nd)). As corollaries of our main lower bound, we obtain new lower bounds for a wide range of applications, including the above, which in many cases are optimal.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.