Convergence of Gradient Descent with Small Initialization for Unregularized Matrix Completion

Abstract

We study the problem of symmetric matrix completion, where the goal is to reconstruct a positive semidefinite matrix X ∈ Rd× d of rank-r, parameterized by UU, from only a subset of its observed entries. We show that the vanilla gradient descent (GD) with small initialization provably converges to the ground truth X without requiring any explicit regularization. This convergence result holds true even in the over-parameterized scenario, where the true rank r is unknown and conservatively over-estimated by a search rank r' r. The existing results for this problem either require explicit regularization, a sufficiently accurate initial point, or exact knowledge of the true rank r. In the over-parameterized regime where r'≥ r, we show that, with (dr9) observations, GD with an initial point \|U0\| ≤ ε converges near-linearly to an ε-neighborhood of X. Consequently, smaller initial points result in increasingly accurate solutions. Surprisingly, neither the convergence rate nor the final accuracy depends on the over-parameterized search rank r', and they are only governed by the true rank r. In the exactly-parameterized regime where r'=r, we further enhance this result by proving that GD converges at a faster rate to achieve an arbitrarily small accuracy ε>0, provided the initial point satisfies \|U0\| = O(1/d). At the crux of our method lies a novel weakly-coupled leave-one-out analysis, which allows us to establish the global convergence of GD, extending beyond what was previously possible using the classical leave-one-out analysis.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…