Sharper Rates for Separable Minimax and Finite Sum Optimization via Primal-Dual Extragradient Methods
Abstract
We design accelerated algorithms with improved rates for several fundamental classes of optimization problems. Our algorithms all build upon techniques related to the analysis of primal-dual extragradient methods via relative Lipschitzness proposed recently by [CST21]. (1) Separable minimax optimization. We study separable minimax optimization problems x y f(x) - g(y) + h(x, y), where f and g have smoothness and strong convexity parameters (Lx, μx), (Ly, μy), and h is convex-concave with a (xx, xy, yy)-blockwise operator norm bounded Hessian. We provide an algorithm with gradient query complexity O(Lxμx + Lyμy + xxμx + xyμxμy + yyμy). Notably, for convex-concave minimax problems with bilinear coupling (e.g.\ quadratics), where xx = yy = 0, our rate matches a lower bound of [ZHZ19]. (2) Finite sum optimization. We study finite sum optimization problems x 1nΣi∈[n] fi(x), where each fi is Li-smooth and the overall problem is μ-strongly convex. We provide an algorithm with gradient query complexity O(n + Σi∈[n] Linμ ). Notably, when the smoothness bounds \Li\i∈[n] are non-uniform, our rate improves upon accelerated SVRG [LMH15, FGKS15] and Katyusha [All17] by up to a n factor. (3) Minimax finite sums. We generalize our algorithms for minimax and finite sum optimization to solve a natural family of minimax finite sum optimization problems at an accelerated rate, encapsulating both above results up to a logarithmic factor.