Normalized Gradients for All
Abstract
In this short note, I show how to adapt to H\"older smoothness using normalized gradients in a black-box way. Moreover, the bound will depend on a novel notion of local H\"older smoothness. The main idea directly comes from Levy [2017].
0
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.