Models of Cooperation in Social Systems Example 1: Prisoner’s Dilemma Prisoner’s Dilemma: Player 2 Player 1 cooperate defect cooperatedefect 3, 30, 5.

Slides:



Advertisements
Similar presentations
1 G492, Big Ideas in Economics Lecture 1, January 14, 2010 This is not a complete lecture– just slides to supplement my lecture notes.
Advertisements

Tutorial 1 Ata Kaban School of Computer Science University of Birmingham.
Ultimatum Game Two players bargain (anonymously) to divide a fixed amount between them. P1 (proposer) offers a division of the “pie” P2 (responder) decides.
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
Mechanism Design without Money Lecture 1 Avinatan Hassidim.
Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)
Game Theory “Доверяй, Но Проверяй” (“Trust, but Verify”) - Russian Proverb (Ronald Reagan) Topic 5 Repeated Games.
Evolution of Cooperation The importance of being suspicious.
Infinitely Repeated Games. In an infinitely repeated game, the application of subgame perfection is different - after any possible history, the continuation.
6-1 LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems
Prospects (and perils) of modeling. “Idea” models Show that a proposed mechanism for a phenomenon is plausible Explore general mechanisms underlying behavior.
On the Genetic Evolution of a Perfect Tic-Tac-Toe Strategy
4 Why Should we Believe Politicians? Lupia and McCubbins – The Democratic Dilemma GV917.
Maynard Smith Revisited: Spatial Mobility and Limited Resources Shaping Population Dynamics and Evolutionary Stable Strategies Pedro Ribeiro de Andrade.
CITS4403 Computational Modelling Agent Based Models.
Fundamentals of Political Science Dr. Sujian Guo Professor of Political Science San Francisco State Unversity
EC – Tutorial / Case study Iterated Prisoner's Dilemma Ata Kaban University of Birmingham.
Slide 1 of 13 So... What’s Game Theory? Game theory refers to a branch of applied math that deals with the strategic interactions between various ‘agents’,
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Planning under Uncertainty
Objectives © Pearson Education, 2005 Oligopoly LUBS1940: Topic 7.
Minority Games A Complex Systems Project. Going to a concert… But which night to pick? Friday or Saturday? You want to go on the night with the least.
Tags and Image Scoring for Robust Cooperation By Nathan Griffiths Presented at AAMAS 2008.
A Memetic Framework for Describing and Simulating Spatial Prisoner’s Dilemma with Coalition Formation Sneak Review by Udara Weerakoon.
Are Markets Rational or Irrational? The El Farol Problem Inductive reasoning Bounded rationality W Brian Arthur.
6/2/2001 Cooperative Agent Systems: Artificial Agents Play the Ultimatum Game Steven O. Kimbrough Presented at FMEC 2001, Oslo Joint work with Fang Zhong.
Why How We Learn Matters Russell Golman Scott E Page.
Adam Smith Chapter 4 January 29-February 4, 2007.
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Profit Maximization, Supply, Market Structures, and Resource Allocation.
Robust Mechanisms for Information Elicitation Aviv Zohar & Jeffrey S. Rosenschein The Hebrew University.
Simulating the Evolution of Contest Escalation Winfried Just and Xiaolu Sun Department of Mathematics and Edison Biotechnology Institute Ohio University.
TOPIC 6 REPEATED GAMES The same players play the same game G period after period. Before playing in one period they perfectly observe the actions chosen.
Introduction to Game Theory and Behavior Networked Life CIS 112 Spring 2009 Prof. Michael Kearns.
DANSS Colloquium By Prof. Danny Dolev Presented by Rica Gonen
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Bottom-Up Coordination in the El Farol Game: an agent-based model Shu-Heng Chen, Umberto Gostoli.
Dynamic Models of Segregation
Learning in Multiagent systems
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
1 Near-Optimal Play in a Social Learning Game Ryan Carr, Eric Raboin, Austin Parker, and Dana Nau Department of Computer Science, University of Maryland.
Example Department of Computer Science University of Bologna Italy ( Decentralised, Evolving, Large-scale Information Systems (DELIS)
Rationality meets the tribe: Some models of cultural group selection David Hales, The Open University Hales, D., (2010) Rationality.
Presenter: Chih-Yuan Chou GA-BASED ALGORITHMS FOR FINDING EQUILIBRIUM 1.
Dynamic Games & The Extensive Form
Daniel Ariosa Ecole Polytechnique Fédérale de Lausanne (EPFL) Institut de Physique de la Matière Complexe CH-1015 Lausanne, Switzerland and Hugo Fort Instituto.
Modeling Reasoning in Strategic Situations Avi Pfeffer MURI Review Monday, December 17 th, 2007.
Evolving cooperation in one-time interactions with strangers Tags produce cooperation in the single round prisoner’s dilemma and it’s.
I THINK THAT YOU THINK THAT I THINK - GAME THEORY AND VIDEO GAMES Jonas Heide Smith Game Analysis ITU, “If I had my way, a solid.
Supply and Demand In a market economy (the one we will be studying from now on) the engine that drives it is supply and demand.
Section 2 – Ec1818 Jeremy Barofsky
Robert Axelrod’s Tournaments Robert Axelrod’s Tournaments, as reported in Axelrod, Robert. 1980a. “Effective Choice in the Prisoner’s Dilemma.” Journal.
Complexity in the Economy and Business IBM Almaden Institute April 12, 2007 W. Brian Arthur External Professor, Santa Fe Institute.
Game Theory by James Crissey Luis Mendez James Reid.
Ch 2. Economics may appear to be the study of complicated tables and charts, statistics and numbers, but, more specifically, it is the study of what constitutes.
Iterated Prisoner’s Dilemma Game in Evolutionary Computation Seung-Ryong Yang.
Smart Sleeping Policies for Wireless Sensor Networks Venu Veeravalli ECE Department & Coordinated Science Lab University of Illinois at Urbana-Champaign.
The Role of Altruistic Punishment in Promoting Cooperation
Evolving Strategies for the Prisoner’s Dilemma Jennifer Golbeck University of Maryland, College Park Department of Computer Science July 23, 2002.
The El Farol Bar Problem on Complex Networks Maziar Nekovee BT Research Mathematics of Networks, Oxford, 7/4/2006.
Econ 545, Spring 2016 Industrial Organization Anticompetitive actions: Cartels and collusion.
Consumption is the purpose of all economic activity Consumption Production LandCapital Labor Saving and investment Confidence in the future Respect for.
The Three Key Questions for Economic Systems. Three Economic Questions Every society must answer three economic questions… 1.WHAT? 2.HOW? 3.FOR WHOM?
Evolution of Cooperation in Mobile Ad Hoc Networks Jeff Hudack (working with some Italian guy)
Modeling Changes in Exploitative vs. Protective Behavior Joseph Blass Motivation and Questions Humans exploit others for selfish reasons Humans also protect.
Social Norm, Costly Punishment and the Evolution to Cooperation : Theory, Experiment and Simulation Tongkui Yu 1, 2, Shu-Heng Chen 2, Honggang Li 1* 1.
Indirect Reciprocity in the Selective Play Environment Nobuyuki Takahashi and Rie Mashima Department of Behavioral Science Hokkaido University 08/07/2003.
Game Theory and Cooperation
Evolving cooperation in one-time interactions with strangers
Presentation transcript:

Models of Cooperation in Social Systems

Example 1: Prisoner’s Dilemma Prisoner’s Dilemma: Player 2 Player 1 cooperate defect cooperatedefect 3, 30, 5 5, 01, 1

Dilemma: It’s better for any particular individual to defect. But in the long run, everyone is better off if everyone cooperates. So what are the conditions under which cooperation can be sustained? –This has been investigated by many people using agent- based models: Iterated games Norms and metanorms Spatial distribution of players

“Bob Axelrod’s work on the evolution of cooperation has done a great deal to rebuild the thinking world’s faith in the power of cooperation.”

Iterated games Axelrod’s tournament Axelrod’s genetic algorithm How to represent strategies?

Example 2: Spatial games and chaos M. Nowak and R. May (1992), “Evolutionary games and spatial chaos”, Nature. Experimented with “spatial” Prisoner’s Dilemma to investigate how cooperation can be sustained.

Spatial Prisoner’s dilemma model Players arrayed on a two-dimensional lattice, one player per site. Each player either always cooperates or always defects. Players have no memory. Each player plays with eight nearest neighbors. Score is sum of payoffs. Each site is then occupied by highest scoring player in its neighborhood. Updates are done synchronously.

NetLogo PDCA What are assumptions?

Interpretation Motivation for this work is “primarily biological”. “We believe that deterministically generated spatial structure within populations may often be crucial for the evolution of cooperation, whether it be among molecules, cells, or organisms.“ "That territoriality favours cooperation...is likely to remain valid for real-life communities“ (Karl Sigmund, Nature, 1992, in commentary on Nowak and May paper)

Some prospects for minimal models:

–Often no other way to make progress! –“All models are wrong. Some are useful.” –All the ones I talked about here were useful and led to new insights, new ways of thinking about complex systems, better models, and better understanding of how to build useful models.

Some perils for minimal models:

–Hidden unrealistic assumptions –Sensitivity to parameters –Unreplicability due to missing information in publications –Misinterpretation and over-reliance on results –Lack of theoretical insight

Norms “Norm”: Behavior prescribed by social setting. Individuals are punished for not acting according to norm. Examples: –Norm against cutting people off in traffic. –Norm for covering your nose and mouth when you sneeze –Norm for governments not to raise import taxes without sufficient cause

Axelrod’s (1986) Norms model Norms game: N-player game. 1.Individual i decides whether to cooperate (C) or Defect (D). If D: –Individual gets "temptation" payoff T=3 –Inflicts on the N  1 others a "hurt" payoff (H=-1). If C, no payoff to self or others.

2.If D, there is a (randomly chosen) probability S of being seen by each of the other agents. Each other agent decides whether to punish the defector or not. Punishers incur “enforcement" cost E=  2 every time they punish a defector with punishment P=  9.

Strategy of an agent given by: –Boldness (propensity to defect) –Vengefulness (propensity to punish defectors). B, V  [0,1]. S is the probability that a defector will be observed defecting. An agent defects if B > S (probability of being observed). An agent punishes observed defectors with probability V.

A population of 20 agents play the Norms game repeatedly. One round: Every agent has exactly one opportunity to defect, and also the opportunity to observe (and possibly punish) any defection by other players that has taken place in the round. One generation: Four rounds At beginning of each generation, agents’ payoffs set to zero. At end of generation, new generation is created via selection based on payoff and mutation (i.e., genetic algorithm).

Axelrod’s results (Norms) Axelrod performed five independent runs of 100 generations each. Intuitive expectation for results: Vengefulness will evolve to counter boldness.

NetLogo Norms Model

Actual results (Norms model) “This raises the question of just what it takes to get a norm established.” -- Axelrod, 1986 What might help get norm established? Experiment

Metanorms model On each round, if there is defection and non- punishment when it is seen, each agent gets opportunity to meta-punish non-punishers (excluding self and defector). Meta-punishment payoff to non-punisher:  9 Meta-enforcement payoff to enforcer:  2 every time it meta-punishes

Results (Metanorms model) (From Axelrod, 1986)

Results (Metanorms model) “[T]he metanorms game can prevent defections if the initial conditions are favorable enough.“ (i.e., sufficient level of vengefulness) (From Axelrod, 1986)

Conclusion: “Metanorms can promote and sustain cooperation in a population.”

Re-implementation by Galan & Izquierdo (2005) Galan & Izquierdo showed: 1.Long term behavior of model significantly different from short term. 2.Behavior dependent on specific values of parameters (e.g., payoff values, mutation rate) 3.Behavior dependent on details of genetic algorithm (e.g., selection method)

Re-Implementation results (From Galan & Izquierdo, 2005) Norms model: Norm collapse occurs in long-term on all runs. (Norm collapse: average B  6/7, average V  1/7 Norm establishment: average B  2/7, average V  5/7) Proportion of runs out of 1,000

(From Galan & Izquierdo, 2005) Metanorms model: Norm is established early, but collapses in long term on most runs.

(From Galan & Izquierdo, 2005)

Conclusions from Galan & Izquierdo Not intended to be a specific critique of Axelrod’s work. Analysis in G&I paper required a huge amount of computational power, which was not available in Instead, re-implementation exercise shows that we need to: –Replicate models (Axelrod: "Replication is one of the hallmarks of cumulative science.“) –Explore parameter space

Conclusions from Galan & Izquierdo –Run stochastic simulations for long times, many replications –Complement simulation with analytical work whenever possible –Be aware of the scope of our computer models and of the conclusions obtained with them.

El Farol Problem

Brian Arthur’s “El Farol Problem” ntry/120921/Photo_Video_ _medium.jpg?0 Brian Arthur

Self-Organization and Cooperation in Economics

Traditional economics Assumptions:

Self-Organization and Cooperation in Economics Traditional economics Assumptions: Perfectly rational self-interested agents

Self-Organization and Cooperation in Economics Traditional economics Assumptions: Perfectly rational self-interested agents Each has complete knowledge of others’ strategies

Self-Organization and Cooperation in Economics Traditional economics Assumptions: Perfectly rational self-interested agents Each has complete knowledge of others’ strategies Each can do deductive reasoning

Self-Organization and Cooperation in Economics Traditional economics Assumptions: Perfectly rational self-interested agents Each has complete knowledge of others’ strategies Each can do deductive reasoning Result: “Efficiency”: Best possible situation for all

Self-Organization and Cooperation in Economics Traditional economics Assumptions: Perfectly rational self-interested agents Each has complete knowledge of others’ strategies Each can do deductive reasoning Result: “Efficiency”: Best possible situation for all Adam Smith’s “Invisible Hand”

"Every individual necessarily labours to render the annual revenue of the society as great as he can. He generally neither intends to promote the public interest, nor knows how much he is promoting it... He intends only his own gain, and he is in this, as in many other cases, led by an invisible hand to promote an end which was no part of his intention. Nor is it always the worse for society that it was no part of his intention. By pursuing his own interest he frequently promotes that of the society more effectually than when he really intends to promote it. I have never known much good done by those who affected to trade for the public good.” — Adam Smith. An Inquiry into the Nature and Causes of the Wealth of Nations, Adam Smith, 1723−1790

“Complexity” economics:

Self-interested agents with bounded rationality

“Complexity” economics: Self-interested agents with bounded rationality Limited knowledge of others’ strategies

“Complexity” economics: Self-interested agents with bounded rationality Limited knowledge of others’ strategies Each does primarily inductive reasoning

“Complexity” economics: Self-interested agents with bounded rationality Limited knowledge of others’ strategies Each does primarily inductive reasoning Agents adapt over time to ever-changing environment

“Complexity” economics: Self-interested agents with bounded rationality Limited knowledge of others’ strategies Each does primarily inductive reasoning Agents adapt over time to ever-changing environment Traditional economics: Can make predictions with analytic (mathematical) models, assuming “equilibrium” dynamics

“Complexity” economics: Self-interested agents with bounded rationality Limited knowledge of others’ strategies Each does primarily inductive reasoning Agents adapt over time to ever-changing environment Traditional economics: Can make predictions with analytic (mathematical) models, assuming “equilibrium” dynamics Complexity economics: Analytic models often not possible; equilibria are never reached; often need agent-based models with ability to adapt

Brian Arthur’s “El Farol Problem” ntry/120921/Photo_Video_ _medium.jpg?0 Brian Arthur

Brian Arthur’s “El Farol Problem” ntry/120921/Photo_Video_ _medium.jpg?0 Brian Arthur

El Farol Model ntry/120921/Photo_Video_ _medium.jpg?0

El Farol Model ntry/120921/Photo_Video_ _medium.jpg?0 Live Irish music on Thursday nights

El Farol Model ntry/120921/Photo_Video_ _medium.jpg?0 Live Irish music on Thursday nights 60 people will fit comfortably

El Farol Model ntry/120921/Photo_Video_ _medium.jpg?0 Live Irish music on Thursday nights 60 people will fit comfortably 100 people want to go, but only if 60 or less go.

El Farol Model ntry/120921/Photo_Video_ _medium.jpg?0 Live Irish music on Thursday nights 60 people will fit comfortably 100 people want to go, but only if 60 or less go. No prior communication among people.

El Farol Model Live Irish music on Thursday nights 60 people will fit comfortably 100 people want to go, but only if 60 or less go. No prior communication among people. Only information each person has is the number of people that attended on each of the last M Thursdays (everyone uses same value of M). ntry/120921/Photo_Video_ _medium.jpg?0

El Farol Model ntry/120921/Photo_Video_ _medium.jpg?0 Live Irish music on Thursday nights 60 people will fit comfortably 100 people want to go, but only if 60 or less go. No prior communication among people. Only information each person has is the number of people that attended on each of the last M Thursdays (everyone uses same value of M). E.g., if M = 3, you might have this information: three weeks ago: 35, two weeks ago: 76, one week ago: 20

El Farol Model ntry/120921/Photo_Video_ _medium.jpg?0 Live Irish music on Thursday nights 60 people will fit comfortably 100 people want to go, but only if 60 or less go. No prior communication among people. Only information each person has is the number of people that attended on each of the last M Thursdays (everyone uses same value of M). E.g., if M = 3, you might have this information: three weeks ago: 35, two weeks ago: 76, one week ago: 20 Decision for each person: Should I go this Thursday?

El Farol Model ntry/120921/Photo_Video_ _medium.jpg?0 Live Irish music on Thursday nights 60 people will fit comfortably 100 people want to go, but only if 60 or less go. No prior communication among people. Only information each person has is the number of people that attended on each of the last M Thursdays (everyone uses same value of M). E.g., if M = 3, you might have this information: three weeks ago: 35, two weeks ago: 76, one week ago: 20 Decision for each person: Should I go this Thursday? How can all the people “cooperate” without communicating and without rational deductive reasoning?

How a person decides whether to go or not (NetLogo Model)

Each person has some number N of strategies, each of which use the information from past Thursdays to predict attendance this Thursday. (Each person has a possibly different set of strategies.)

How a person decides whether to go or not (NetLogo Model) Each person has some number N of strategies, each of which use the information from past Thursdays to predict attendance this Thursday. (Each person has a possibly different set of strategies.) For example, if N = 3, your strategies might be: –Strategy 1: Predict attendance will be the same as last week –Strategy 2: Predict attendance will be 100 – last week –Strategy 3: Predict attendance will be 0.2 * last week * two weeks ago

How a person decides whether to go or not (NetLogo Model) Each person has some number N of strategies, each of which use the information from past Thursdays to predict attendance this Thursday. (Each person has a possibly different set of strategies.) For example, if N = 3, your strategies might be: –Strategy 1: Predict attendance will be the same as last week –Strategy 2: Predict attendance will be 100 – last week –Strategy 3: Predict attendance will be 0.2 * last week * two weeks ago One of these strategies is the “current best”. This is the one that did the best job of predicting attendance on the previous Thursday.

How a person decides whether to go or not (NetLogo Model) Each person has some number N of strategies, each of which use the information from past Thursdays to predict attendance this Thursday. (Each person has a possibly different set of strategies.) For example, if N = 3, your strategies might be: –Strategy 1: Predict attendance will be the same as last week –Strategy 2: Predict attendance will be 100 – last week –Strategy 3: Predict attendance will be 0.2 * last week * two weeks ago One of these strategies is the “current best”. This is the one that did the best job of predicting attendance on the previous Thursday. You use your “current best” strategy to predict attendance for this Thursday. If it predicts more than 60 people will show up (60 = “overcrowding threshold”), you decide not to go; otherwise you go.

How a person decides whether to go or not (NetLogo Model) Each person has some number N of strategies, each of which use the information from past Thursdays to predict attendance this Thursday. (Each person has a possibly different set of strategies.) For example, if N = 3, your strategies might be: –Strategy 1: Predict attendance will be the same as last week –Strategy 2: Predict attendance will be 100 – last week –Strategy 3: Predict attendance will be 0.2 * last week * two weeks ago One of these strategies is the “current best”. This is the one that did the best job of predicting attendance on the previous Thursday. You use your “current best” strategy to predict attendance for this Thursday. If it predicts more than 60 people will show up (60 = “overcrowding threshold”), you decide not to go; otherwise you go. All other people do the same thing simultaneously and independently, with no communication.

The next video gives the “nitty gritty details” of this model; if you don’t want that much detail, or don’t feel comfortable with math, you can skip it and go to the video after the next one.

The nitty gritty details (optional)

Let N be the number of strategies each person has and let M be the number of weeks for which the attendance number is known. Let t be the current time (week). The previous weeks are thus t −1, t − 2, etc. Let A(t) be the attendance at time t.

The nitty gritty details (optional) Let N be the number of strategies each person has and let M be the number of weeks for which the attendance number is known. Let t be the current time (week). The previous weeks are thus t −1, t − 2, etc. Let A(t) be the attendance at time t. Each strategy has the following form:

The nitty gritty details (optional) Let N be the number of strategies each person has and let M be the number of weeks for which the attendance number is known. Let t be the current time (week). The previous weeks are thus t −1, t − 2, etc. Let A(t) be the attendance at time t. Each strategy has the following form: Each person has N such strategies (where the set of strategies can be different from person to person). One of these strategies is determined to be the “current best”, and is denoted Prediction*(t). Each person makes a decision as follows: If Prediction*(t) > overcrowding-threshold, don’t go; otherwise, go.

The nitty gritty details, continued

Initialization: Each person’s N strategies are initialized with random

The nitty gritty details, continued Initialization: Each person’s N strategies are initialized with random Initial history: The attendance history (previous M time steps) is initialized at random, with values between 0 and 99. (This is so predictions can be made on the first M time steps.)

The nitty gritty details, continued Initialization: Each person’s N strategies are initialized with random Initial history: The attendance history (previous M time steps) is initialized at random, with values between 0 and 99. (This is so predictions can be made on the first M time steps.) Best current strategy: At each time step t, after each person makes their decision and learns the current attendance A(t), each person determines their best current strategy as the one that would have been the best predictor if it had been used on this round. This is the strategy that will be used by the person on the next round.

“El Farol” Model

Assumes “bounded rationality”, limited knowledge

“El Farol” Model Assumes “bounded rationality”, limited knowledge Includes adaptation (inductive learning from experience)

“El Farol” Model Assumes “bounded rationality”, limited knowledge Includes adaptation (inductive learning from experience) Question: Does self-organized “efficiency” (best situation for all) emerge under these conditions?

notes Show NetLogo model

Conclusion The El Farol model demonstrates that self-organized cooperation and “efficiency” are possible without perfect rationality, complete knowledge, and deductive reasoning!