A Riemannian AdaGrad-Norm Method
Abstract
We propose a manifold AdaGrad-Norm method (MAdaGrad), which extends the norm version of AdaGrad (AdaGrad-Norm) to Riemannian optimization. In contrast to line-search schemes, which may require several exponential map computations per iteration, MAdaGrad requires only one. Assuming the objective function f has Lipschitz continuous Riemannian gradient, we show that the method requires at most O(-2) iterations to compute a point x such that \|grad f(x)\|≤ . Under the additional assumptions that f is geodesically convex and the manifold has sectional curvature bounded from below, we show that the method takes at most O(-1) to find x such that f(x)-flow≤ε, where flow is the optimal value. Moreover, if f satisfies the Polyak--ojasiewicz condition globally on the manifold, we establish a complexity bound of O((-1)), provided that the norm of the initial Riemannian gradient is sufficiently large. For the manifold of symmetric positive definite matrices, we construct a family of nonconvex functions satisfying the PL condition. Numerical experiments illustrate the remarkable performance of MAdaGrad in comparison with Riemannian Steepest Descent equipped with Armijo line-search.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.