Maximum likelihood estimation for nonembeddable Markov chains when the cycle length is shorter than the data observation interval

Abstract

Time-homogeneous Markov chains are often used as disease progression models in studies of cost-effectiveness and optimal decision-making. Maximum likelihood estimation of these models can be challenging when data are collected at a time interval longer than the model's transition cycle length. For example, it may be necessary to estimate a monthly transition model from data collected annually. The likelihood for a time-homogeneous Markov chain with transition matrix P and data observed at intervals of T cycles is a function of PT. The maximum likelihood estimate of PT is easily obtained from the data. The Tth root of this estimate would then be a maximum likelihood estimate for P. However, the Tth root of PT is not necessarily a valid transition matrix. Maximum likelihood estimation of P is a constrained optimization problem when a valid root is unavailable. The optimization problem is not convex. Local convergence is explored in several case studies through graphical representations of a grid search. The example cases use disease progression data from the literature as well as synthetic data. The global maximum likelihood estimate is increasingly difficult to locate as the number of cycles or the number of states increases. What seems like a straightforward estimation problem can be challenging even for relatively simple models. Researchers should consider alternatives to likelihood maximization or alternative models.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…