Finite-Horizon Partially Observable Semi-Markov Games with Risk Probability Criteria

Abstract

This paper studies partially observable two-person zero-sum semi-Markov games under a probability criterion, in which the system state may not be completely observed. It focuses on the probability that the accumulated rewards of player 1 (i.e., the incurred costs of player 2) fall short of a specified target at the terminal stage, which represents the risk of player 1 and the capacity of player 2. We study the game model via the technology of augmenting state space with the joint conditional distribution of the current unobserved state and the remaining goal. Under a mild condition, we establish a comparison theorem and derive the Shapley equation for the probability criterion. As a consequence, we prove the existence and the uniqueness of the value function and the existence of a Nash equilibrium.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…