Low-distortion Subspace Embeddings in Input-sparsity Time and Applications to Robust Linear Regression
Abstract
Low-distortion embeddings are critical building blocks for developing random sampling and random projection algorithms for linear algebra problems. We show that, given a matrix A ∈ n × d with n d and a p ∈ [1, 2), with a constant probability, we can construct a low-distortion embedding matrix ∈ O((d)) × n that embeds p, the p subspace spanned by A's columns, into (O((d)), \| · \|p); the distortion of our embeddings is only O((d)), and we can compute A in O((A)) time, i.e., input-sparsity time. Our result generalizes the input-sparsity time 2 subspace embedding by Clarkson and Woodruff [STOC'13]; and for completeness, we present a simpler and improved analysis of their construction for 2. These input-sparsity time p embeddings are optimal, up to constants, in terms of their running time; and the improved running time propagates to applications such as (1 ε)-distortion p subspace embedding and relative-error p regression. For 2, we show that a (1+ε)-approximate solution to the 2 regression problem specified by the matrix A and a vector b ∈ n can be computed in O((A) + d3 (d/ε) /ε2) time; and for p, via a subspace-preserving sampling procedure, we show that a (1 ε)-distortion embedding of p into O((d)) can be computed in O((A) · n) time, and we also show that a (1+ε)-approximate solution to the p regression problem x ∈ d \|A x - b\|p can be computed in O((A) · n + (d) (1/ε)/ε2) time. Moreover, we can improve the embedding dimension or equivalently the sample size to O(d3+p/2 (1/ε) / ε2) without increasing the complexity.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.