Generalised Mixability, Constant Regret, and Bayesian Updating
Abstract
Mixability of a loss is known to characterise when constant regret bounds are achievable in games of prediction with expert advice through the use of Vovk's aggregating algorithm. We provide a new interpretation of mixability via convex analysis that highlights the role of the Kullback-Leibler divergence in its definition. This naturally generalises to what we call -mixability where the Bregman divergence D replaces the KL divergence. We prove that losses that are -mixable also enjoy constant regret bounds via a generalised aggregating algorithm that is similar to mirror descent.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.