Nesterov's accelerated gradient for unbounded convex functions finds the minimum-norm point in the dual space

Abstract

We study the behavior of first-order methods applied to a lower-unbounded convex function f, i.e., ∈f f = -∞. Such a setting has received little attention since the trajectories of gradient descent and Nesterov's accelerated gradient method diverge. In this paper, we establish quantitative convergence results describing their speeds and directions of divergence, with implications for unboundedness judgment. A key idea is a relation to a norm-minimization problem in the dual space: minimize \|p\|2/2 over p ∈ domf, which can be naturally solved via mirror descent by taking the Legendre--Fenchel conjugate f as the distance-generating function. It then turns out that gradient descent for f coincides with mirror descent for this norm-minimization problem, and thus it simultaneously solves both problems at O(k-1). This result admits acceleration; Nesterov's accelerated gradient method, without any modifications, simultaneously solves the original minimization and the dual norm-minimization problems at O(k-2), providing a quantitative characterization of divergence in unbounded convex optimization.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…