Time-inconsistent Risk-sensitive Equilibrium for Countable-stated Markov Decision Processes
Abstract
This paper is devoted to solving a time-inconsistent risk-sensitive control problem with parameter and its limit case (→0+) for countable-stated Markov decision processes (MDPs for short). Since the cost functional is time-inconsistent, it is impossible to find a global optimal strategy for both cases. Instead, for each case, we will prove the existence of time-inconstant equilibrium strategies which verify the so-called step-optimality. Moreover, we prove the convergence of -equilibriums and the corresponding value functions as →0+.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.