Perturbation Analysis of Randomized SVD and its Applications to Statistics
Abstract
Randomized singular value decomposition (RSVD) is a class of computationally efficient algorithms for computing the truncated SVD of large data matrices. Given an m × n matrix M, the prototypical RSVD algorithm outputs an approximation of the k leading left singular vectors of M by computing the SVD of M ( M M)g G; here g ≥ 1 is an integer and G ∈ Rn × k is a random Gaussian sketching matrix with k ≥ k. In this paper we derive upper bounds for the 2 and 2,∞ distances between the exact left singular vectors U of M and its approximation Ug (obtained via RSVD), as well as entrywise error bounds when M is projected onto Ug Ug. These bounds depend on the singular values gap and number of power iterations g, and smaller gap requires larger values of g to guarantee the convergences of the 2 and 2,∞ distances. We apply our theoretical results to settings where M is an additive perturbation of some unobserved signal matrix M. In particular, we obtain the nearly-optimal convergence rate and asymptotic normality for RSVD on three inference problems, namely, subspace estimation and community detection in random graphs, noisy matrix completion, and PCA with missing data.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.