Sharp convergence rates for Langevin dynamics in the nonconvex setting

Abstract

We study the problem of sampling from a distribution p*(x) (-U(x)), where the function U is L-smooth everywhere and m-strongly convex outside a ball of radius R, but potentially nonconvex inside this ball. We study both overdamped and underdamped Langevin MCMC and establish upper bounds on the number of steps required to obtain a sample from a distribution that is within ε of p* in 1-Wasserstein distance. For the first-order method (overdamped Langevin MCMC), the iteration complexity is O(ecLR2d/ε2), where d is the dimension of the underlying space. For the second-order method (underdamped Langevin MCMC), the iteration complexity is O(ecLR2d/ε) for an explicit positive constant c. Surprisingly, the iteration complexity for both these algorithms is only polynomial in the dimension d and the target accuracy ε. It is exponential, however, in the problem parameter LR2, which is a measure of non-log-concavity of the target distribution.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…