Subspace Embeddings and p-Regression Using Exponential Random Variables
Abstract
Oblivious low-distortion subspace embeddings are a crucial building block for numerical linear algebra problems. We show for any real p, 1 ≤ p < ∞, given a matrix M ∈ Rn × d with n d, with constant probability we can choose a matrix with (1, n1-2/p) (d) rows and n columns so that simultaneously for all x ∈ Rd, \|Mx\|p ≤ \| Mx\|∞ ≤ (d) \|Mx\|p. Importantly, M can be computed in the optimal O((M)) time, where (M) is the number of non-zero entries of M. This generalizes all previous oblivious subspace embeddings which required p ∈ [1,2] due to their use of p-stable random variables. Using our matrices , we also improve the best known distortion of oblivious subspace embeddings of 1 into 1 with O(d) target dimension in O((M)) time from O(d3) to O(d2), which can further be improved to O(d3/2) 1/2 n if d = ( n), answering a question of Meng and Mahoney (STOC, 2013). We apply our results to p-regression, obtaining a (1+)-approximation in O((M) n) + (d/) time, improving the best known (d/) factors for every p ∈ [1, ∞) \2\. If one is just interested in a (d) rather than a (1+)-approximation to p-regression, a corollary of our results is that for all p ∈ [1, ∞) we can solve the p-regression problem without using general convex programming, that is, since our subspace embeds into ∞ it suffices to solve a linear programming problem. Finally, we give the first protocols for the distributed p-regression problem for every p ≥ 1 which are nearly optimal in communication and computation.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.