Likelihood Ratio Gradient Estimation for Steady-State Parameters

Abstract

We consider a discrete-time Markov chain on a general state-space X, whose transition probabilities are parameterized by a real-valued vector θ. Under the assumption that is geometrically ergodic with corresponding stationary distribution π(θ), we are interested in estimating the gradient ∇ α(θ) of the steady-state expectation α(θ) = π( θ) f. To this end, we first give sufficient conditions for the differentiability of α(θ) and for the calculation of its gradient via a sequence of finite horizon expectations. We then propose two different likelihood ratio estimators and analyze their limiting behavior.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…