Markov Rewards Processes with Impulse Rewards and Absorbing States
Abstract
We study the expected accumulated reward for a discrete-time Markov reward model with absorbing states. The rewards are impulse rewards, where a reward ij is accumulated when transitioning from state i to state j. We derive an explicit, single-letter expression for the expected accumulated reward as a function of the number of time steps n and include in our analysis the limit in which n ∞.
0
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.