On the convergence of decentralized gradient descent with diminishing stepsize, revisited

Abstract

Distributed optimization has received a lot of interest in recent years due to its wide applications in various fields. In this work, we revisit the convergence property of the decentralized gradient descent [A. Nedi\'c-A.Ozdaglar (2009)] on the whole space given by xi(t+1) = Σmj=1wijxj(t) - α(t) ∇ fi(xi(t)), where the stepsize is given as α (t) = a(t+w)p with 0< p≤ 1. Under the strongly convexity assumption on the total cost function f with local cost functions fi not necessarily being convex, we show that the sequence converges to the optimizer with rate O(t-p) when the values of a>0 and w>0 are suitably chosen.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…