Spectral Guarantees for Adversarial Streaming PCA

Abstract

In streaming PCA, we see a stream of vectors x1, …c, xn ∈ Rd and want to estimate the top eigenvector of their covariance matrix. This is easier if the spectral ratio R = λ1 / λ2 is large. We ask: how large does R need to be to solve streaming PCA in O(d) space? Existing algorithms require R = (d). We show: (1) For all mergeable summaries, R = (d) is necessary. (2) In the insertion-only model, a variant of Oja's algorithm gets o(1) error for R = O( n d). (3) No algorithm with o(d2) space gets o(1) error for R = O(1). Our analysis is the first application of Oja's algorithm to adversarial streams. It is also the first algorithm for adversarial streaming PCA that is designed for a spectral, rather than Frobenius, bound on the tail; and the bound it needs is exponentially better than is possible by adapting a Frobenius guarantee.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…