Convergence rates of regularized quasi-Newton methods without strong convexity
Abstract
In this paper, we study convergence rates of the cubic regularized proximal quasi-Newton method () for solving non-smooth additive composite problems that satisfy the so-called Kurdyka- ojasiewicz (K ) property with respect to some desingularization function φ rather than strong convexity. After a number of iterations k0, Cubic SR1 PQN exhibits non-asymptotic explicit super-linear convergence rates for any k≥ k0. In particular, when φ(t)=ct1/2, Cubic SR1 PQN has a convergence rate of order (C(k-k0)1/2)(k-k0)/2, where k is the number of iterations and C>0 is a constant. For the special case, i.e. functions which satisfy ojasiewicz inequality, the rate becomes global and non-asymptotic. This work presents, for the first time, non-asymptotic explicit convergence rates of regularized (proximal) SR1 quasi-Newton methods applied to non-convex non-smooth problems with K\ property. Actually, the rates are novel even in the smooth non-convex case. Notably, we achieve this without employing line search or trust region strategies, without assuming the Dennis-Mor\'e condition, without any assumptions on quasi-Newton metrics and without assuming strong convexity. Furthermore, for convex problems, we focus on a more tractable gradient regularized quasi-Newton method (Grad SR1 PQN) which can achieve results similar to those obtained with cubic regularization. We also demonstrate, for the first time, the non-asymptotic super-linear convergence rate of Grad SR1 PQN for solving convex problems with the help of the ojasiewicz inequality instead of strong convexity.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.