On the Complexity of Finite-Sum Smooth Optimization under the Polyak-ojasiewicz Condition

Abstract

This paper considers the optimization problem of the form x∈ Rd f( x) 1nΣi=1n fi( x), where f(·) satisfies the Polyak--ojasiewicz (PL) condition with parameter μ and \fi(·)\i=1n is L-mean-squared smooth. We show that any gradient method requires at least (n+n(1/ε)) incremental first-order oracle (IFO) calls to find an ε-suboptimal solution, where L/μ is the condition number of the problem. This result nearly matches upper bounds of IFO complexity for best-known first-order methods. We also study the problem of minimizing the PL function in the distributed setting such that the individuals f1(·),…,fn(·) are located on a connected network of n agents. We provide lower bounds of (/γ\,(1/ε)), ((+τ/γ\,)(1/ε)) and (n+n(1/ε)) for communication rounds, time cost and local first-order oracle calls respectively, where γ∈(0,1] is the spectral gap of the mixing matrix associated with the network and~τ>0 is the time cost of per communication round. Furthermore, we propose a decentralized first-order method that nearly matches above lower bounds in expectation.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…