Acceleration by Stepsize Hedging II: Silver Stepsize Schedule for Smooth Convex Optimization
Abstract
We provide a concise, self-contained proof that the Silver Stepsize Schedule proposed in Part I directly applies to smooth (non-strongly) convex optimization. Specifically, we show that with these stepsizes, gradient descent computes an ε-minimizer in O(ε- 2) = O(ε-0.7864) iterations, where = 1+2 is the silver ratio. This is intermediate between the textbook unaccelerated rate O(ε-1) and the accelerated rate O(ε-1/2) due to Nesterov in 1983. The Silver Stepsize Schedule is a simple explicit fractal: the i-th stepsize is 1+v(i)-1 where v(i) is the 2-adic valuation of i. The design and analysis are conceptually identical to the strongly convex setting in Part I, but simplify remarkably in this specific setting.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.