Thresholds for sensitive optimality and Blackwell optimality in stochastic games
Abstract
We investigate refinements of the mean-payoff criterion in two-player zero-sum perfect-information stochastic games. A strategy is Blackwell optimal if it is optimal in the discounted game for all discount factors sufficiently close to 1. The notion of d-sensitive optimality interpolates between mean-payoff optimality (corresponding to the case d=-1) and Blackwell optimality (d=+∞). The Blackwell threshold α Bw ∈ [0,1[ is the discount factor above which all optimal strategies in the discounted game are guaranteed to be Blackwell optimal. The d-sensitive threshold α d ∈ [0,1[ is defined analogously. Bounding α Bw and α d are fundamental problems in algorithmic game theory, since these thresholds control the complexity for computing Blackwell and d-sensitive optimal strategies, by reduction to discounted games which can be solved in O((1-α)-1) iterations. We provide the first bounds on the d-sensitive threshold α d beyond the case d=-1, and we establish improved bounds for the Blackwell threshold α Bw. This is achieved by leveraging separation bounds on algebraic numbers, relying on Lagrange bounds and more advanced techniques based on Mahler measures and multiplicity theorems.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.