Phase transition of the Sinkhorn-Knopp algorithm
Abstract
The matrix scaling problem, particularly the Sinkhorn-Knopp algorithm, has been studied for over 60 years. In practice, the algorithm often yields high-quality approximations within just a few iterations. Theoretically, however, the best-known upper bound places it in the class of pseudopolynomial-time approximation algorithms. Meanwhile, the lower-bound landscape remains largely unexplored. Two fundamental questions persist: what accounts for the algorithm's strong empirical performance, and can a tight bound on its iteration count be established? For an n× n matrix, its normalized version is obtained by dividing each entry by its largest entry. We say that a normalized matrix has a density γ if there exists a constant > 0 such that one row or column has exactly γ n entries with values at least , and every other row and column has at least γ n such entries. For the upper bound, we show that the Sinkhorn-Knopp algorithm produces a nearly doubly stochastic matrix in O( n - ) iterations and O(n2) time for all nonnegative square matrices whose normalized version has a density γ > 1/2. Such matrices cover both the algorithm's principal practical inputs and its typical theoretical regime, and the O(n2) runtime is optimal. For the lower bound, we establish a tight bound of (n1/2/) iterations for positive matrices under the 2-norm error measure. Moreover, for every γ < 1/2, there exists a matrix with density γ for which the algorithm requires (n1/2/) iterations. In summary, our results reveal a sharp phase transition in the Sinkhorn-Knopp algorithm at the density threshold γ = 1/2.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.