Exploiting Numerical Sparsity for Efficient Learning : Faster Eigenvector Computation and Regression
Abstract
In this paper, we obtain improved running times for regression and top eigenvector computation for numerically sparse matrices. Given a data matrix A ∈ Rn × d where every row a ∈ Rd has \|a\|22 ≤ L and numerical sparsity at most s, i.e. \|a\|12 / \|a\|22 ≤ s, we provide faster algorithms for these problems in many parameter settings. For top eigenvector computation, we obtain a running time of O(nd + r(s + r s) / gap2) where gap > 0 is the relative gap between the top two eigenvectors of A A and r is the stable rank of A. This running time improves upon the previous best unaccelerated running time of O(nd + r d / gap2) as it is always the case that r ≤ d and s ≤ d. For regression, we obtain a running time of O(nd + (nL / μ) s nL / μ) where μ > 0 is the smallest eigenvalue of A A. This running time improves upon the previous best unaccelerated running time of O(nd + n L d / μ). This result expands the regimes where regression can be solved in nearly linear time from when L/μ = O(1) to when L / μ = O(d2/3 / (sn)1/3). Furthermore, we obtain similar improvements even when row norms and numerical sparsities are non-uniform and we show how to achieve even faster running times by accelerating using approximate proximal point [Frostig et. al. 2015] / catalyst [Lin et. al. 2015]. Our running times depend only on the size of the input and natural numerical measures of the matrix, i.e. eigenvalues and p norms, making progress on a key open problem regarding optimal running times for efficient large-scale learning.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.