Markov Rewards Processes with Impulse Rewards and Absorbing States

Abstract

We study the expected accumulated reward for a discrete-time Markov reward model with absorbing states. The rewards are impulse rewards, where a reward ij is accumulated when transitioning from state i to state j. We derive an explicit, single-letter expression for the expected accumulated reward as a function of the number of time steps n and include in our analysis the limit in which n ∞.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…