Sharp convergence rates for Langevin dynamics in the nonconvex setting

Michael I. Jordan

Sharp convergence rates for Langevin dynamics in the nonconvex setting

Abstract

We study the problem of sampling from a distribution p*(x) (-U(x)), where the function U is L-smooth everywhere and m-strongly convex outside a ball of radius R, but potentially nonconvex inside this ball. We study both overdamped and underdamped Langevin MCMC and establish upper bounds on the number of steps required to obtain a sample from a distribution that is within ε of p* in 1-Wasserstein distance. For the first-order method (overdamped Langevin MCMC), the iteration complexity is O(ecLR2d/ε2), where d is the dimension of the underlying space. For the second-order method (underdamped Langevin MCMC), the iteration complexity is O(ecLR2d/ε) for an explicit positive constant c. Surprisingly, the iteration complexity for both these algorithms is only polynomial in the dimension d and the target accuracy ε. It is exponential, however, in the problem parameter LR2, which is a measure of non-log-concavity of the target distribution.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…