Logarithmic-Regret Quantum Learning Algorithms for Zero-Sum Games
Abstract
We propose the first online quantum algorithm for solving zero-sum games with O(1) regret under the game setting. Moreover, our quantum algorithm computes an -approximate Nash equilibrium of an m × n matrix zero-sum game in quantum time O(m+n/2.5). Our algorithm uses standard quantum inputs and generates classical outputs with succinct descriptions, facilitating end-to-end applications. Technically, our online quantum algorithm "quantizes" classical algorithms based on the optimistic multiplicative weight update method. At the heart of our algorithm is a fast quantum multi-sampling procedure for the Gibbs sampling problem, which may be of independent interest.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.