Accelerating Nesterov's Method for Strongly Convex Functions with Lipschitz Gradient
Abstract
We modify Nesterov's constant step gradient method for strongly convex functions with Lipschitz continuous gradient described in Nesterov's book. Nesterov shows that f(xk) - f* ≤ L Πi=1k (1 - αk) \| x0 - x* \|22 with αk = for all k, where L is the Lipschitz gradient constant and is the reciprocal condition number of f(x). Hence the convergence rate is 1-. In this work, we try to accelerate Nesterov's method by adaptively searching for an αk > at each iteration. The proposed method evaluates the gradient function at most twice per iteration and has some extra Level 1 BLAS operations. Theoretically, in the worst case, it takes the same number of iterations as Nesterov's method does but doubles the gradient calls. However, in practice, the proposed method effectively accelerates the speed of convergence for many problems including a smoothed basis pursuit denoising problem.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.