Sketched SVD: Recovering Spectral Features from Compressive Measurements
Abstract
We consider a streaming data model in which n sensors observe individual streams of data, presented in a turnstile model. Our goal is to analyze the singular value decomposition (SVD) of the matrix of data defined implicitly by the stream of updates. Each column i of the data matrix is given by the stream of updates seen at sensor i. Our approach is to sketch each column of the matrix, forming a "sketch matrix" Y, and then to compute the SVD of the sketch matrix. We show that the singular values and right singular vectors of Y are close to those of X, with small relative error. We also believe that this bound is of independent interest in non-streaming and non-distributed data collection settings. Assuming that the data matrix X is of size Nxn, then with m linear measurements of each column of X, we obtain a smaller matrix Y with dimensions mxn. If m = O(k ε-2 (log(1/ε) + log(1/δ)), where k denotes the rank of X, then with probability at least 1-δ, the singular values σ'j of Y satisfy the following relative error result (1-ε)(1/2)<= σ'j/σj <= (1 + ε)(1/2) as compared to the singular values σj of the original matrix X. Furthermore, the right singular vectors v'j of Y satisfy ||vj-vj'||2 <= min(sqrt2, (ε1+ε)/(1-ε) maxi≠ j (2σiσj)/(minc∈[-1,1](|σ2i-σ2j(1+cε)|))) as compared to the right singular vectors vj of X. We apply this result to obtain a streaming graph algorithm to approximate the eigenvalues and eigenvectors of the graph Laplacian in the case where the graph has low rank (many connected components).
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.