Krylov Methods are (nearly) Optimal for Low-Rank Approximation

Shyam Narayanan

Krylov Methods are (nearly) Optimal for Low-Rank Approximation

Abstract

We consider the problem of rank-1 low-rank approximation (LRA) in the matrix-vector product model under various Schatten norms: \|u\|2=1 \|A (I - u u)\|Sp , where \|M\|Sp denotes the p norm of the singular values of M. Given >0, our goal is to output a unit vector v such that \|A(I - vv)\|Sp ≤ (1+) \|u\|2=1\|A(I - u u)\|Sp. Our main result shows that Krylov methods (nearly) achieve the information-theoretically optimal number of matrix-vector products for Spectral (p=∞), Frobenius (p=2) and Nuclear (p=1) LRA. In particular, for Spectral LRA, we show that any algorithm requires ((n)/1/2) matrix-vector products, exactly matching the upper bound obtained by Krylov methods [MM15, BCW22]. Our lower bound addresses Open Question 1 in [Woo14], providing evidence for the lack of progress on algorithms for Spectral LRA and resolves Open Question 1.2 in [BCW22]. Next, we show that for any fixed constant p, i.e. 1≤ p =O(1), there is an upper bound of O((1/)/1/3) matrix-vector products, implying that the complexity does not grow as a function of input size. This improves the O((n/)/1/3) bound recently obtained in [BCW22], and matches their (1/1/3) lower bound, to a (1/) factor.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…