Dynamical systems associated to the β-core in the repeated prisoner's dilemma
Abstract
We consider the repeated prisoner's dilemma (PD). We assume that players make their choices knowing only average payoffs from the previous stages. A player's strategy is a function from the convex hull S of the set of payoffs into the set \C,\,D\ (C means cooperation, D -- defection). S. Smale in smale presented an idea of good strategies in the repeated PD. If both players play good strategies then the average payoffs tends to the payoff corresponding to the profile (C,C) in PD. We adopt the Smale idea to define semi-cooperative strategies - players do not take as a referencing point the payoff corresponding to the profile (C,C), but they can take an arbitrary payoff belonging to the β-core of PD. We show that if both players choose the same point in the β-core then the strategy profile is an equilibrium in the repeated game. If the players choose different points in the β-core then the sequence of the average payoffs tends to a point in S. The obtained limit can be treated as a payoff in a new game. In this game the set of players' actions is the set of points in S that corresponds to the β-core payoffs.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.