Tutorial 1 Ata Kaban School of Computer Science University of Birmingham.

Slides:

Advertisements

Similar presentations

Advertisements

PowerPoint Slides by Robert F. BrookerCopyright (c) 2001 by Harcourt, Inc. All rights reserved. Strategic Behavior Game Theory –Players –Strategies –Payoff.

RL - Worksheet -worked exercise- Ata Kaban School of Computer Science University of Birmingham.

Genetic Algorithms Genetic Programming Ata Kaban School of Computer Science University of Birmingham 2003.

Vincent Conitzer CPS Repeated games Vincent Conitzer

Infinitely Repeated Games

Worksheet I. Exercise Solutions Ata Kaban School of Computer Science University of Birmingham.

Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)

1 Evolution & Economics No Evolutionary Stability in Repeated Games Played by Finite Automata K. Binmore & L. Samuelson J.E.T Automata.

6-1 LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems

An Introduction to... Evolutionary Game Theory

Game Theory Advertising Example 1. Game Theory What is the optimal strategy for Firm A if Firm B chooses to advertise? 2.

Infinitely Repeated Games Econ 171. Finitely Repeated Game Take any game play it, then play it again, for a specified number of times. The game that is.

Game Theory. Games Oligopolist Play ▫Each oligopolist realizes both that its profit depends on what its competitor does and that its competitor’s profit.

Automata-based adaptive behavior for economic modeling using game theory Rawan Ghnemat, Khalaf Khatatneh, Saleh Oqeili Al-Balqa’ Applied University, Al-Salt,

By: James Parsons Game theory Mathematics Game Theory.

On Quantum Walks and Iterated Quantum Games G. Abal, R. Donangelo, H. Fort Universidad de la República, Montevideo, Uruguay UFRJ, RJ, Brazil.

Prisoner’s dilemma TEMPTATION>REWARD>PUNISHMENT>SUCKER.

Prisoner’s Dilemma. The scenario In the Prisoner’s Dilemma, you and Lucifer are picked up by the police and interrogated in separate cells without the.

5/16/20151 VII. Cooperation & Competition The Iterated Prisoner’s Dilemma.

EC – Tutorial / Case study Iterated Prisoner's Dilemma Ata Kaban University of Birmingham.

The Prisoner’s Dilemma -Both arrested during botched bank robbery. -Kept in separate cells – NO COMMUNICATION. -Offered separate deals if they confess.

Institutions and the Evolution of Collective Action Mark Lubell UC Davis.

Evolving New Strategies The Evolution of Strategies in the Iterated Prisoner’s Dilemma 01 / 25.

Cooperation Reciprocators, Cheaters, and Everyone Else.

Story time! Robert Axelrod. Contest #1 Call for entries to game theorists All entrants told of preliminary experiments 15 strategies = 14 entries + 1.

Review: Game theory Dominant strategy Nash equilibrium

Cs301fs07a5 The Iterated Prisoner’s Dilemma. Overview Two criminals get caught and choose to cooperate or defect to alter their sentence Two criminals.

Nash Equilibrium Econ 171. Suggested Viewing A Student’s Suggestion: Video game theory lecture Open Yale Economics Ben Pollack’s Game Theory Lectures.

APEC 8205: Applied Game Theory Fall 2007

PRISONER’S DILEMMA By Ajul Shah, Hiten Morar, Pooja Hindocha, Amish Parekh & Daniel Castellino.

1 (Intro to) Evolutionary Computation Lecture 1: Overview of EC Ata Kaban School of Computer Science.

On Bounded Rationality and Computational Complexity Christos Papadimitriou and Mihallis Yannakakis.

Reinforcement Learning (1)

Anonymizing Web Services Through a Club Mechanism With Economic Incentives Mamata Jenamani Leszek Lilien Bharat Bhargava Department of Computer Sciences.

Peter B. Henderson Butler University

Changing Perspective… Common themes throughout past papers Repeated simple games with small number of actions Mostly theoretical papers Known available.

A Game-Theoretic Approach to Strategic Behavior. Chapter Outline ©2015 McGraw-Hill Education. All Rights Reserved. 2 The Prisoner’s Dilemma: An Introduction.

Punishment and Forgiveness in Repeated Games. A review of present values.

Co-evolution time, changing environments agents. Static search space  solutions are determined by the optimization process only, by doing variation and.

Lecture 8: 24/5/1435 Genetic Algorithms Lecturer/ Kawther Abas 363CS – Artificial Intelligence.

Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.

Presenter: Chih-Yuan Chou GA-BASED ALGORITHMS FOR FINDING EQUILIBRIUM 1.

ART – Artificial Reasoning Toolkit Evolving a complex system Marco Lamieri

Finite Iterated Prisoner’s Dilemma Revisited: Belief Change and End Game Effect Jiawei Li (Michael) & Graham Kendall University of Nottingham.

1/67 Institute of Information Science, Academia Sinica Research Assistant: Lin, Hsin-Nan 林信男.

Section 2 – Ec1818 Jeremy Barofsky

1. Genetic Algorithms: An Overview  Objectives - Studying basic principle of GA - Understanding applications in prisoner’s dilemma & sorting network.

Robert Axelrod’s Tournaments Robert Axelrod’s Tournaments, as reported in Axelrod, Robert. 1980a. “Effective Choice in the Prisoner’s Dilemma.” Journal.

Game Theory by James Crissey Luis Mendez James Reid.

1. Genetic Algorithms: An Overview 4 학습목표 GA 의 기본원리를 파악하고, Prisoner’s dilemma 와 sorting network 에의 응용 및 이론적 배경을 이해한 다.

UUCF Summer RE 2011 Brain Glitches Session 4: Prisoner’s Dilemma.

Iterated Prisoner’s Dilemma Game in Evolutionary Computation Seung-Ryong Yang.

Evolving Strategies for the Prisoner’s Dilemma Jennifer Golbeck University of Maryland, College Park Department of Computer Science July 23, 2002.

Game Theory Dr. Andrew L. H. Parkes “Economics for Business (2)” 卜安吉.

Taylor. Na, Amy. Hong, Brian. Sang, Luck Yoon. What is the Game Theory? intelligent rational decision-makers Game theory is "the study of mathematical.

Advanced Subjects in GT Outline of the tutorials Static Games of Complete Information Introduction to games Normal-form (strategic-form) representation.

An Evolutionary Algorithm for Neural Network Learning using Direct Encoding Paul Batchis Department of Computer Science Rutgers University.

GENETIC ALGORITHM By Siti Rohajawati. Definition Genetic algorithms are sets of computational procedures that conceptually follow steps inspired by the.

PRISONER’S DILEMMA BERK EROL

USING MICROBIAL GENETIC ALGORITHM TO SOLVE CARD SPLITTING PROBLEM.

Evolving New Strategies

Project BEST Game Theory.

tit-for-tat algorithm

Introduction to Game Theory

BEC 30325: MANAGERIAL ECONOMICS

LECTURE 6: MULTIAGENT INTERACTIONS

1. Genetic Algorithms: An Overview

Introduction to RePast and Tutorial I

Presentation transcript:

Tutorial 1 Ata Kaban School of Computer Science University of Birmingham

Iterated Prisoner's Dilemma Invented by Merrill Flood & Melvin Dresher in 1950s Studied in game theory, economics, political science The story –Alice and Bob arrested, no communication between them –They are offered a deal: If any of them confesses & testifies against the other then gets suspended sentence while the other gets 5 years in prison If both confess & testify against the other, they both get 4 years If none of them confesses then they both get 2 years –What is the best strategy for maximising one’s own payoff?

Abstract formulation through a payoff matrix Player A Player B CooperateDefect Cooperate3,30,5 Defect5,01,1

2 tournaments – participants have sent strategies Human strategies played against each other Winner: TIT FOR TAT –Cooperates as long the other player does not defect –Defects on defection until the other player begins to cooperate again Can GA evolve a better strategy?

Individuals = strategies How to encode a strategy by a string? Let memory depth of previous moves=1 Fix a canonical order of cases: AB –Case 1:CC –Case 2:CD –Case 3:DC –Case 4:DD e.g. strategy encoding (for A): ‘CDCD’

Now let memory dept of previous moves=3 –How many cases? ……… Case 1: …….. Case 2: …….. … –How many letters are needed to encode a strategy as a string? …………… –How many strategies there are? …………. Is that a large number?

Experiment 1 –40 runs with different random initialisations –50 generations each –Population of 20 –Fitness=avg score over all games played –A fixed environment of 8 human-designed strategies Results –Found better strategy than those 8 strategies in the environment! –Even though – how many strategies were only tested in a run out of all possible strategies? …………… –What does this result mean? …………….

Experiment 2 –changing environment: the evolving strategies played against each other. Results – Found strategies similar in essence with the winner human-designed strategy Idealised model of evolution & co-evolution

When to use Evolutionary Algorithms? The space to be searched is large –What if it is not? The space to be searched is known not to be perfectly smooth and unimodal –Otherwise? The search space is not well understood –If it is? Quickly finding a sufficiently good solution is enough Noisy data –Think about it, why?