Algorithms for zero-sum stochastic games with the risk-sensitive average criterion
Abstract
This paper is an attempt to compute the value and saddle points of zero-sum risk-sensitive average stochastic games. For the average games with finite states and actions, we first introduce the so-called irreducibility coefficient and then establish its equivalence to the irreducibility condition. Using this equivalence,we develop an iteration algorithm to compute -approximations of the value (for any given >0) and show its convergence. Based on -approximations of the value and the irreducibility coefficient, we further propose another iteration algorithm, which is proved to obtain -saddle points in finite steps. Finally, a numerical example of energy management in smart grids is provided to illustrate our results.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.