Prisoners dilemma
Description
A two-actor game where each chooses cooperate or defect. The payoff structure is temptation > reward > punishment > sucker (T > R > P > S), with the property that defection is the dominant strategy: regardless of what the opponent does, defecting yields a higher payoff to you. So both rational actors defect, both receive the punishment payoff, when both could have received the higher reward payoff if they’d both cooperated. The tragedy is that the game’s structure produces the bad equilibrium even when both players individually prefer cooperation. The concept generalizes beyond literal prisoners. Two firms in price competition, two nations in arms-race, two athletes considering doping — anywhere the payoff structure satisfies T > R > P > S, the prisoners-dilemma fires. Repeated play partially dissolves the dilemma: when the game repeats indefinitely and players can condition on past behavior, cooperation can be sustained (Axelrod’s tit-for-tat is the canonical mechanism). Reputation, institutions, and credible commitment are the real-world mechanisms by which civilizations escape single-shot dilemmas. Distinct from tragedy-of-commons: prisoners-dilemma is the clean 2-person, 2-choice canonical case; tragedy-of-commons is the N-person continuous-resource generalization. Same dynamics, different cardinality.Triggers
User-initiated: User describes a coordination failure between actors with mixed incentives, or asks about cooperation/defection dynamics. Vocabulary cues: “prisoners dilemma,” “cooperate-or-defect,” “Nash equilibrium,” “arms race,” “tit-for-tat.” Agent-initiated: Agent notices a 2-player (or small-N) setup where each actor’s dominant strategy hurts the aggregate. Candidate inference: “is this a one-shot game; what’s the payoff matrix; what escape mechanism (repetition, reputation, commitment) is available?” Situation-shape signals: Two firms, two countries, two AI labs, two roommates. Mixed-motive games. Cooperation problems with monitoring costs. Whenever “if only we could trust each other to cooperate” is the lament.Exclusions
- Single payoff dominates — when one strategy dominates regardless of payoff structure (a pure coordination game with one equilibrium far above the other), the dilemma’s tension is absent.
- Communication and binding commitment available — when actors can credibly commit (contracts, escrow, third-party enforcement), the one-shot dilemma collapses; defection is no longer dominant because punishment is credible.
- Repeated game with sufficient shadow — when the future matters enough relative to one-shot gains, tit-for-tat and other reciprocity strategies sustain cooperation; the dilemma applies to the one-shot, not the iterated version.
- Aligned objectives — when actors share goals (parents raising shared children), the cooperation isn’t strategic; the concept mischaracterizes the relationship.
Structure
Relationships
- tragedy-of-commons — generalization to N-player continuous-resource case; structurally the same dynamics.
- mutualism — payoff-polarity opposite; same coordination setup, but mutualism’s payoffs reward cooperation rather than defection.
- doctrine — escape mechanisms (tit-for-tat, reputation, institutional enforcement) are doctrines that re-shape the game’s incentive structure.
- hoist-by-own-petard — each defector’s rationally-self-interested choice helps build the collective punishment they themselves receive.
- feedback-loop — repeated-play dynamics turn the single-shot dilemma into a feedback loop; cooperative reputation can develop, defection-spirals can also develop.
Examples
Arms races (Cold War; AI safety race) · economics
Arms races (Cold War; AI safety race) · economics
Doping in sports · human-physical-performance-and-recreation
Doping in sports · human-physical-performance-and-recreation
Advertising-spend escalation · economics
Advertising-spend escalation · economics
AI safety race dynamics · economics
AI safety race dynamics · economics
Axelrod, R. (1984). The Evolution of Cooperation — iterated tournament; tit-for-tat winner. · economics
Axelrod, R. (1984). The Evolution of Cooperation — iterated tournament; tit-for-tat winner. · economics
Deforestation by neighboring nations · economics
Deforestation by neighboring nations · economics
Flood, M., & Dresher, M. (1950, RAND Corporation) — original problem. · economics
Flood, M., & Dresher, M. (1950, RAND Corporation) — original problem. · economics
Merrill Flood & Melvin Dresher (1950, RAND); Albert Tucker named and framed the parable (1950); Axelrod, R. (1984) The Evolution of Cooperation — repeated-game analysis · economics
Merrill Flood & Melvin Dresher (1950, RAND); Albert Tucker named and framed the parable (1950); Axelrod, R. (1984) The Evolution of Cooperation — repeated-game analysis · economics
Nash, J. (1950). "Equilibrium points in n-person games." · economics
Nash, J. (1950). "Equilibrium points in n-person games." · economics
Oligopoly pricing · economics
Oligopoly pricing · economics
Open-source contribution · economics
Open-source contribution · economics
Schelling, T. (1960). The Strategy of Conflict — credibility, commitment, and escape mechanisms. · economics
Schelling, T. (1960). The Strategy of Conflict — credibility, commitment, and escape mechanisms. · economics
Tucker, A. W. (1950) — named the parable. · economics
Tucker, A. W. (1950) — named the parable. · economics
Vaccination free-riding · economics
Vaccination free-riding · economics