A single-loop proximal-conditional-gradient penalty method
Abstract
We consider the problem of minimizing a convex separable objective (as a separable sum of two proper closed convex functions f and g) over a linear coupling constraint. We assume that f can be decomposed as the sum of a smooth part having H\"older continuous gradient (with exponent μ∈(0,1]) and a nonsmooth part that admits efficient proximal mapping computations, while g can be decomposed as the sum of a smooth part having H\"older continuous gradient (with exponent ∈(0,1]) and a nonsmooth part that admits efficient linear oracles. Motivated by the recent works [1,49], we propose a single-loop variant of the standard penalty method, which we call a single-loop proximal-conditional-gradient penalty method (proxCG pen1), for this problem. In each iteration of proxCG pen1, we successively perform one proximal-gradient step involving f and one conditional-gradient step involving g on the quadratic penalty function, followed by an update of the penalty parameter. We present explicit rules for updating the penalty parameter and the stepsize in the conditional-gradient step in each iteration. Under a standard constraint qualification and domain boundedness assumption, we show that the objective value deviations (from the optimal value) along the sequence generated decay in the order of t-\μ,,1/2\ with the associated feasibility violations decaying in the order of t-1/2. Moreover, if the nonsmooth parts are indicator functions and the extended objective is a KL function with exponent α∈[0,1), then the distances to the optimal solution set along the sequence generated by proxCG pen1 decay asymptotically at a rate of t-(1-α)\μ,,1/2\. Finally, we illustrate numerically the behavior of proxCG pen1 on solving low rank Hankel matrix completion problems.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.