Fast convex optimization via inertial dynamics with Hessian driven damping

Abstract

We first study the fast minimization properties of the trajectories of the second-order evolution equation x(t) + αt x(t) + β ∇2 (x(t))x (t) + ∇ (x(t)) = 0, where : H R is a smooth convex function acting on a real Hilbert space H, and α, β are positive parameters. This inertial system combines an isotropic viscous damping which vanishes asymptotically, and a geometrical Hessian driven damping, which makes it naturally related to Newton's and Levenberg-Marquardt methods. For α≥ 3, β >0, along any trajectory, fast convergence of the values (x(t))- H = O(t-2) is obtained, together with rapid convergence of the gradients ∇(x(t)) to zero. For α>3, just assuming that has minimizers, we show that any trajectory converges weakly to a minimizer of , and (x(t))- H = o(t-2). Strong convergence is established in various practical situations. For the strongly convex case, convergence can be arbitrarily fast depending on the choice of α. More precisely, we have (x(t))- H = O(t-23α). We extend the results to the case of a general proper lower-semicontinuous convex function : H → R \+∞ \. This is based on the fact that the inertial dynamic with Hessian driven damping can be written as a first-order system in time and space. By explicit-implicit time discretization, this opens a gate to new - possibly more rapid - inertial algorithms, expanding the field of FISTA methods for convex structured optimization problems.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…