Learning Sparse Polymatrix Games in Polynomial Time and Sample Complexity

Jean Honorio

Learning Sparse Polymatrix Games in Polynomial Time and Sample Complexity

Abstract

We consider the problem of learning sparse polymatrix games from observations of strategic interactions. We show that a polynomial time method based on 1,2-group regularized logistic regression recovers a game, whose Nash equilibria are the ε-Nash equilibria of the game from which the data was generated (true game), in O(m4 d4 (pd)) samples of strategy profiles --- where m is the maximum number of pure strategies of a player, p is the number of players, and d is the maximum degree of the game graph. Under slightly more stringent separability conditions on the payoff matrices of the true game, we show that our method learns a game with the exact same Nash equilibria as the true game. We also show that (d (pm)) samples are necessary for any method to consistently recover a game, with the same Nash-equilibria as the true game, from observations of strategic interactions. We verify our theoretical results through simulation experiments.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…