Evolving cooperation in one-time interactions with strangers

Slides:



Advertisements
Similar presentations
Heterogeneous Payoffs and Social Diversity in the Spatial Prisoner’s Dilemma game Dept Computer Science and Software Engineering Golriz Rezaei Dr. Michael.
Advertisements

Evolving Cooperation in the N-player Prisoner's Dilemma: A Social Network Model Dept Computer Science and Software Engineering Golriz Rezaei Michael Kirley.
Evolution of Cooperation The importance of being suspicious.
Solving Collective Commons Problems: Future Scenarios for P2P Finance David Hales, University of Szeged, Hungary Diversity in Macro.
Infinitely Repeated Games. In an infinitely repeated game, the application of subgame perfection is different - after any possible history, the continuation.
6-1 LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems
Maynard Smith Revisited: Spatial Mobility and Limited Resources Shaping Population Dynamics and Evolutionary Stable Strategies Pedro Ribeiro de Andrade.
Cooperation in Anonymous Dynamic Social Networks Brendan Lucier University of Toronto Brian Rogers Northwestern University Nicole Immorlica Northwestern.
Institutions and the Evolution of Collective Action Mark Lubell UC Davis.
A Memetic Framework for Describing and Simulating Spatial Prisoner’s Dilemma with Coalition Formation Sneak Review by Udara Weerakoon.
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Project funded by the Future and Emerging Technologies arm of the IST Programme Recent directions in DELIS / Overview of on-going work David Hales
Cooperation through the endogenous evolution of social structure David Hales & Shade Shutters The Open University & Arizona State University
5. Alternative Approaches. Strategic Bahavior in Business and Econ 1. Introduction 2. Individual Decision Making 3. Basic Topics in Game Theory 4. The.
Learning in Multiagent systems
Linking multi-agent simulation to experiments in economy Re-implementing John Duffy’s model of speculative learning agents.
Example Department of Computer Science University of Bologna Italy ( Decentralised, Evolving, Large-scale Information Systems (DELIS)
Rationality meets the tribe: Some models of cultural group selection David Hales, The Open University Hales, D., (2010) Rationality.
Engineering with Sociological Metaphors: Examples and Prospects University of Bologna This work is partially supported by the European.
SLAC and SLACER: Simple copy & rewire algorithms for trust and cooperation in P2P David Hales, Stefano Arteconi, Ozalp Babaoglu University of Bologna,
Project funded by the Future and Emerging Technologies arm of the IST Programme Socially Inspired Approaches to Evolving Cooperation David Hales
Can Tags Build Working Systems? From MABS to ESOA Attempting to apply results gained from Multi-Agent- Based Social Simulation (MABSS)
P2P Interaction in Socially Intelligent ICT David Hales Delft University of Technology (Currently visiting University of Szeged, Hungary)
Putting “tags” to work Attempting to apply results gained from agent- based social simulation (ABSS) to MAS. Dr David Hales
David Hales (University of Bologna) University of Bologna, Italywww.davidhales.com WARNING! Superficial sociological interpretation followed by simplistic.
Emergent Group-Like Selection in a Peer-to-Peer Network ECCS Conference Paris, Nov. 16 th, 2005 David Hales University of Bologna, Italy
Evolving cooperation in one-time interactions with strangers Tags produce cooperation in the single round prisoner’s dilemma and it’s.
Evolving Social Rationality for MAS using “Tags” Trying to “make things work” by applying results gained from Agent-Based Social Simulation.
Project funded by the Future and Emerging Technologies arm of the IST Programme From Selfish Nodes to Cooperative Networks – Emergent Link-based Incentives.
Project funded by the Future and Emerging Technologies arm of the IST Programme Understanding “tag” systems by comparing “tag” models David Hales
The Evolution of Specialisation in Groups – Tags (again!) David Hales Centre for Policy Modelling, Manchester Metropolitan University, UK.
Stabilization of Tag-Mediated Interaction by Sexual Reproduction in an Evolutionary Agent System F. Alkemade D.D.B. van Braget J.A. La Poutr é Copyright.
Game Theory by James Crissey Luis Mendez James Reid.
Socially Inspired Computing Engineering with Social Metaphors.
Evolving P2P overlay networks with Tags, SLAC and SLACER for Cooperation and possibly other things… Saarbrücken SP6 workshop July 19-20th 2005 Presented.
Project funded by the Future and Emerging Technologies arm of the IST Programme Altruism “for free” using Tags David Hales Department.
Simple Rewire Protocols for Cooperation in Dynamic Networks David Hales, Stefano Arteconi, Ozalp Babaoglu University of Bologna, Italy Bio-Inspired Workshop,
Evolving Specialisation, Altruism & Group-Level Optimisation Using Tags – The emergence of a group identity? David Hales Centre for Policy Modelling, Manchester.
Evolving Specialisation, Altruism & Group-Level Optimisation Using Tags David Hales Centre for Policy Modelling, Manchester Metropolitan University, UK.
Project funded by the Future and Emerging Technologies arm of the IST Programme Change your tags fast! - A necessary condition for cooperation? David Hales.
Where to look and why? : Utility and heuristics as explanations of cooperation in one-shot PD games Shigehito Tanida & Toshio Yamagishi Hokkaido University.
Evolution of Cooperation in Mobile Ad Hoc Networks Jeff Hudack (working with some Italian guy)
Emergent Group Selection: Tags, Networks and Society David Hales, The Open University ASU, Thursday, November 29th For more details.
Indirect Reciprocity in the Selective Play Environment Nobuyuki Takahashi and Rie Mashima Department of Behavioral Science Hokkaido University 08/07/2003.
Q 2.1 Nash Equilibrium Ben
Introduction to Genetic Algorithms
Game Theory and Cooperation
Game Theory and Cooperation
Replicator Dynamics.
Dynamic Games of Complete Information
Vincent Conitzer CPS Repeated games Vincent Conitzer
Revisiting “The Evolution of Reciprocity in Sizable Groups”
Evolution for Cooperation
Computer-Mediated Communication
Towards Realistic Models for Evolution of Cooperation
Socially Inspired Approaches to Evolving Cooperation
LECTURE 6: MULTIAGENT INTERACTIONS
Group Selection Design Pattern
The outbreak of cooperation among success-driven individuals under noisy conditions Success-driven migration and imitation as a driver for cooperative.
Self-Organising, Open and Cooperative P2P Societies – From Tags to Networks David Hales Department of Computer Science University of.
Evolution for Cooperation
Multiagent Systems Repeated Games © Manfred Huber 2018.
Altruism “for free” using Tags
Ethical Rules, Games, and Evolution
Evolution of human cooperation without reciprocity
EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF INDUSTRIAL ENGINEERING IENG314 OPERATIONS RESEARCH II SAMIR SAMEER ABUYOUSSEF
When fairness bends rationality: Ernst Fehr meets John Nash
Phase transitions to cooperation in the prisoner‘s dilemma
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

Evolving cooperation in one-time interactions with strangers David Hales http://www.davidhales.com dave@davidhales.com Evolving cooperation in one-time interactions with strangers Tags produce cooperation in the single round prisoner’s dilemma www.davidhales.com

Why study cooperation? Many hard to explain cooperative interactions in human societies Production of large-scale open artificial agent based systems More generally, how low level entities can come to form internally cooperative higher level entities www.davidhales.com

Assumptions Agents are greedy (change behaviour to maximise utility) Agents are stupid (bounded rationality) Agents are envious (observe if others are getting more utility than themselves) Agents are imitators (copy behaviour of those they envy) www.davidhales.com

The Prisoner’s Dilemma Given: T > R > P > S and 2R > T + S C D Player 1 Player 2 R T R S S P P T www.davidhales.com

Payoff values Temptation T > 1 (say, 1.5) Reward R = 1 Punishment (P) and Sucker (S) set to small values (say, 0.0001 and 0.0002) Hence T > R > P > S and 2R > T + S www.davidhales.com

A one bit agent An agent represented by a single bit A value of “1” indicates the agent will cooperate in a game interaction A value of “0” indicates the agent will defect in a game interaction The value is not visible to other agents www.davidhales.com

An evolutionary algorithm Initialise all agents with randomly selected strategies LOOP some number of generations LOOP for each agent (a) in the population Select a game partner (b) at random from the population Agent (a) and (b) invoke their strategies receiving the appropriate payoff END LOOP Reproduce agents in proportion to their average payoff with some small probability of mutation (M) www.davidhales.com

The obvious result Agents quickly become all defectors A defector always does at least as well as his opponent and sometimes better This is the “Nash Equilibrium” for the single round PD game The evolutionary algorithm therefore evolves the “rational” strategy www.davidhales.com

How can cooperation evolve? Repeated interaction when agents remember the last strategy played by opponent Interaction restricted to spatial neighbours Agents observe the interactions of others before playing themselves (reputation) However, these require agents with the ability to identify individuals or have strict spatial structures imposed on interaction www.davidhales.com

An agent with “tags” Take the “one bit agent” and add extra bits “tags” which have no effect on the strategy played but are observable by other agents 1 1 Tag bits observable Strategy bit not observable www.davidhales.com

Bias interaction by tag Change the evolutionary algorithm so agents bias their interaction towards those sharing the same tag bit pattern When an agent selects a game partner it is allowed some number (F) of refusals if the tags of the partner do not match After F refusals game interaction is forced on the next selected agent During reproduction mutation is applied to both strategy bit and tag bits with same probability www.davidhales.com

Parameter values and measures Population size (N) = 100 Length of tag (L) = [2..64] bits Refusals allowed (F) = 1000 Mutation rate (M) = 0.001 PD payoffs T = [1..2], R =1, P > S = small Execute algorithm for 100,000 generations Measure cooperation as proportion of total game interactions which are mutually cooperative www.davidhales.com

Results Cooperation increases: as T decreases as L increases Each bar an average of 5 runs to 100,000 generations with different initial random number seeds cooperation T T = temptation payoff L = length of tag in bits L

What’s happening? We can consider agents holding identical tags to be sharing the corner of a hyper-cube Interaction is limited to agents sharing a corner (identical tag bits) Therefore cooperative “groups” are emerging in these corners www.davidhales.com

A hypercube for 4 bit tags To visualise the process in time we produce a graph in which each horizontal line represents a single unique corner of the hypercube (set of unique tag bits) We colour each line to indicate if it is occupied by all cooperative, all defective, mixed or no agents

Visualising the process

Visualising the process

What’s happening? Defectors only do better than cooperators if they are in a mixed group (have cooperators to exploit) But by exploiting those cooperators they turn the group into all defectors quickly Agents in an “all defective group” do worse than agents in an “all cooperative group” So long as an all cooperative group exists the agents within it will out perform an all defective group, thus reproducing the group – mutation of tag bits spreads the cooperative group to neigbouring corners of the hypercube www.davidhales.com

Cooperation from total defection If we start the run such that all strategy bits are set to defection, does cooperation evolve? Yes, from observation of the runs, cooperation emerges as soon as two agents sharing tag bits cooperate We can produce a crude analytical model predicting how long before cooperation evolves www.davidhales.com

Cooperation from total defection L=32, m=0.001 Number of agents (n) Generations before cooperation

Some conclusions A very simple mechanism can produce cooperation between strangers in the single round PD game Culturally, the tags can be interpreted as “social cues” or “cultural markers” which identify some kind of cultural group The “groups” exist in an abstract “tag space” not real physical space The easy movement between groups (via mutation and imitation) but strict game interaction within groups is the key to producing high cooperation www.davidhales.com

Future work Links between this model and sociobiological (patch based) analytical models Links between this model and models of the evolution of language Extending the model to examine the conditions under which “refusals” evolve (here they are simply assumed as part of the model) www.davidhales.com

Links with other work at SimSoc V Dittrich, Kron & Banzhaf – action clusters with no preferences Hiroshi Deguchi – Rep. Dynamics, preferences, analytical results – very difficult (for me!) but ultimate aim Hagselmann – systematic parameter space exploration Jager & Janssen – heterogeneous decision mechanisms – not just really dumb or really cleaver www.davidhales.com