The complexity of interior point methods for solving discounted turn-based stochastic games

Abstract

We study the problem of solving discounted, two player, turn based, stochastic games (2TBSGs). Jurdzinski and Savani showed that 2TBSGs with deterministic transitions can be reduced to solving P-matrix linear complementarity problems (LCPs). We show that the same reduction works for general 2TBSGs. This implies that a number of interior point methods for solving P-matrix LCPs can be used to solve 2TBSGs. We consider two such algorithms. First, we consider the unified interior point method of Kojima, Megiddo, Noma, and Yoshise, which runs in time O((1+)n3.5L), where is a parameter that depends on the n × n matrix M defining the LCP, and L is the number of bits in the representation of M. Second, we consider the interior point potential reduction algorithm of Kojima, Megiddo, and Ye, which runs in time O(-δθn4 ε-1), where δ and θ are parameters that depend on M, and ε describes the quality of the solution. For 2TBSGs with n states and discount factor γ we prove that in the worst case = (n/(1-γ)2), -δ = (n/(1-γ)), and 1/θ = (n/(1-γ)2). The lower bounds for , -δ, and 1/θ are obtained using the same family of deterministic games.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…