Perturbation Analysis of Randomized SVD and its Applications to Statistics

Abstract

Randomized singular value decomposition (RSVD) is a class of computationally efficient algorithms for computing the truncated SVD of large data matrices. Given an m × n matrix M, the prototypical RSVD algorithm outputs an approximation of the k leading left singular vectors of M by computing the SVD of M ( M M)g G; here g ≥ 1 is an integer and G ∈ Rn × k is a random Gaussian sketching matrix with k ≥ k. In this paper we derive upper bounds for the 2 and 2,∞ distances between the exact left singular vectors U of M and its approximation Ug (obtained via RSVD), as well as entrywise error bounds when M is projected onto Ug Ug. These bounds depend on the singular values gap and number of power iterations g, and smaller gap requires larger values of g to guarantee the convergences of the 2 and 2,∞ distances. We apply our theoretical results to settings where M is an additive perturbation of some unobserved signal matrix M. In particular, we obtain the nearly-optimal convergence rate and asymptotic normality for RSVD on three inference problems, namely, subspace estimation and community detection in random graphs, noisy matrix completion, and PCA with missing data.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…