Spectral Guarantees for Adversarial Streaming PCA
Abstract
In streaming PCA, we see a stream of vectors x1, …c, xn ∈ Rd and want to estimate the top eigenvector of their covariance matrix. This is easier if the spectral ratio R = λ1 / λ2 is large. We ask: how large does R need to be to solve streaming PCA in O(d) space? Existing algorithms require R = (d). We show: (1) For all mergeable summaries, R = (d) is necessary. (2) In the insertion-only model, a variant of Oja's algorithm gets o(1) error for R = O( n d). (3) No algorithm with o(d2) space gets o(1) error for R = O(1). Our analysis is the first application of Oja's algorithm to adversarial streams. It is also the first algorithm for adversarial streaming PCA that is designed for a spectral, rather than Frobenius, bound on the tail; and the bound it needs is exponentially better than is possible by adapting a Frobenius guarantee.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.