Myopic and non-myopic agent optimization in game theory, economics, biology and artificial intelligence Michael J Gagen Institute of Molecular Bioscience.

Slides:



Advertisements
Similar presentations
An Introduction to Game Theory Part V: Extensive Games with Perfect Information Bernhard Nebel.
Advertisements

The Basics of Game Theory
Chapter 17: Making Complex Decisions April 1, 2004.
Evolving Cooperation in the N-player Prisoner's Dilemma: A Social Network Model Dept Computer Science and Software Engineering Golriz Rezaei Michael Kirley.
Tutorial 1 Ata Kaban School of Computer Science University of Birmingham.
Introduction to Game Theory Networked Life CSE 112 Spring 2005 Prof. Michael Kearns.
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
Continuation Methods for Structured Games Ben Blum Christian Shelton Daphne Koller Stanford University.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.
Game Theoretical Insights in Strategic Patrolling: Model and Analysis Nicola Gatti – DEI, Politecnico di Milano, Piazza Leonardo.
A short history of equilibrium John Nash and Game Theory.
An Introduction to... Evolutionary Game Theory
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
Game Theory Eduardo Costa. Contents What is game theory? Representation of games Types of games Applications of game theory Interesting Examples.
EKONOMSKA ANALIZA PRAVA. Game Theory Outline of the lecture: I. What is game theory? II. Elements of a game III. Normal (matrix) and Extensive (tree)
Games What is ‘Game Theory’? There are several tools and techniques used by applied modelers to generate testable hypotheses Modeling techniques widely.
5/16/20151 Game Theory Game theory was developed by John Von Neumann and Oscar Morgenstern in Economists! One of the fundamental principles of.
GAME THEORY By Ben Cutting & Rohit Venkat. Game Theory: General Definition  Mathematical decision making tool  Used to analyze a competitive situation.
Eponine Lupo.  Game Theory is a mathematical theory that deals with models of conflict and cooperation.  It is a precise and logical description of.
G A M E T H E O R Y A N D I N C E N T I V E S S ystems Analysis Laboratory Osborne’s quota rule makes the joint optimum an equilibrium OPEC oil cartel.
Slide 1 of 13 So... What’s Game Theory? Game theory refers to a branch of applied math that deals with the strategic interactions between various ‘agents’,
Chapter 9: Economics of Strategy: Game theory
Algoritmi per Sistemi Distribuiti Strategici
Lecture 1 - Introduction 1.  Introduction to Game Theory  Basic Game Theory Examples  Strategic Games  More Game Theory Examples  Equilibrium  Mixed.
Game Theory: Key Concepts Zero Sum Games Zero Sum Games Non – Zero Sum Games Non – Zero Sum Games Strategic Form Games  Lay out strategies Strategic Form.
Games as Systems Administrative Stuff Exercise today Meet at Erik Stemme
Advanced Microeconomics Instructors: Wojtek Dorabialski & Olga Kiuila Lectures: Mon. & Wed. 9:45 – 11:20 room 201 Office hours: Mon. & Wed. 9:15 – 9:45.
Artificial Intelligence for Games and Puzzles1 Games in the real world Many real-world situations and problems.
Job Market Signaling (Spence model)
Intelligence without Reason
Introduction to Game Theory and Behavior Networked Life CIS 112 Spring 2009 Prof. Michael Kearns.
Introduction to Game Theory Yale Braunstein Spring 2007.
Artificial Intelligence for Games and Puzzles1 Games in the real world Many real-world situations and.
On Bounded Rationality and Computational Complexity Christos Papadimitriou and Mihallis Yannakakis.
Game Theory April 9, Prisoner’s Dilemma  One-shot, simultaneous game  Nash Equilibrium (individually rational strategies) is not Pareto Optimal.
Game Theory, Strategic Decision Making, and Behavioral Economics 11 Game Theory, Strategic Decision Making, and Behavioral Economics All men can see the.
5. Alternative Approaches. Strategic Bahavior in Business and Econ 1. Introduction 2. Individual Decision Making 3. Basic Topics in Game Theory 4. The.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Presenter: Chih-Yuan Chou GA-BASED ALGORITHMS FOR FINDING EQUILIBRIUM 1.
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Game theory & Linear Programming Steve Gu Mar 28, 2008.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.
Notes in Game Theory1 Game Theory Overview & Applications Galina Albert Schwartz Department of Finance University of Michigan Business School.
2008/01/30Lecture 11 Game Theory. 2008/01/30Lecture 12 What is Game Theory? Game theory is a field of Mathematics, analyzing strategically inter-dependent.
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
1. Genetic Algorithms: An Overview  Objectives - Studying basic principle of GA - Understanding applications in prisoner’s dilemma & sorting network.
Dynamic games, Stackelburg Cournot and Bertrand
Introduction to Game Theory Presented by 蘇柏穎 2004/12/9 2004/12/9.
Extensive Form (Dynamic) Games With Perfect Information (Theory)
Econ 805 Advanced Micro Theory 1 Dan Quint Fall 2009 Lecture 1 A Quick Review of Game Theory and, in particular, Bayesian Games.
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
Advanced Subjects in GT Outline of the tutorials Static Games of Complete Information Introduction to games Normal-form (strategic-form) representation.
1 Fahiem Bacchus, University of Toronto CSC384: Intro to Artificial Intelligence Game Tree Search I ● Readings ■ Chapter 6.1, 6.2, 6.3, 6.6 ● Announcements:
Now that we have set of pure strategies for each player, we need to find the payoffs to put the game in strategic form. Random payoffs. The actual outcome.
Game Theory By Ben Cutting & Rohit Venkat.
Game Theory M.Pajhouh Niya M.Ghotbi
Game Theory and Cooperation
PRISONER’S DILEMMA BERK EROL
Non-additive Security Games
Project BEST Game Theory.
Introduction to Game Theory
Vincent Conitzer CPS Repeated games Vincent Conitzer
Multiagent Systems Repeated Games © Manfred Huber 2018.
Vincent Conitzer Repeated games Vincent Conitzer
Game Theory Lesson 15 Section 65.
M9302 Mathematical Models in Economics
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

Myopic and non-myopic agent optimization in game theory, economics, biology and artificial intelligence Michael J Gagen Institute of Molecular Bioscience University of Queensland Kae Nemoto Quantum Information Science National Institute of Informatics, Japan

Overview: Functional Optimization in Strategic Economics (and AI) Mathematics / Physics (minimize action)  Formalized by von Neumann and Morgenstern, Theory of Games and Economic Behavior (1944)

Overview: Functional Optimization in Strategic Economics (and AI) Strategic Economics (maximize expected payoff)  Formalized by von Neumann and Morgenstern, Theory of Games and Economic Behavior (1944) Functionals: Fully general Not necessarily continuous Not necessarily differentiable Nb: Implicit Assumption of Continuity !!

von Neumann’s “myopic” assumption Overview: Functional Optimization in Strategic Economics (and AI) Strategic Economics (maximize expected payoff) Evidence: von Neumann & Nash used fixed point theorems in probability simplex equivalent to a convex subset of a real vector space von Neumann and Morgenstern, Theory of Games and Economic Behavior (1944) J. F. Nash, Equilibrium points in n-person games. PNAS, 36(1):48–49 (1950)

Non-myopic Optimization Correlations  Constraints and forbidden regions Overview: Functional Optimization in Strategic Economics (and AI) No communications between players

∞ correlations &  ∞ different trees constraint sets Non-myopic Optimization Overview: Functional Optimization in Strategic Economics (and AI) Myopic  “The” Game Tree lists All play options X Myopic  One Constraint = One Tree “Myopic” Economics (= Physics)

Myopic = Missing Information! Correlation = Information Chess: “Chunking” or pattern recognition in human chess play Experts: · Performance in speed chess doesn’t degrade much · Rapidly direct attention to good moves · Assess less than 100 board positions per move · Eye movements fixate only on important positions · Re-produce game positions after brief exposure better than novices, but random positions only as well as novices Learning Strategy = Learning information to help win game Nemoto: “It is not what they are doing, its what they are thinking!” What Information?

Optimization and Correlations are Non-Commuting! Complex Systems Theory Emergence of Complexity via correlated signals  higher order structure

Optimization and Correlations are Non-Commuting! Life Sciences (Evolutionary Optimization) Selfish Gene Theory Mayr: Incompatibility between biology and physics Rosen: “Correlated” Components in biology, rather than “uncorrelated” parts Mattick: Biology informs information science 6 Gbit DNA program more complex than any human program, implicating RNA as correlating signals allowing multi-tasking and developmental control of complex organisms. Mattick: RNA signals in molecular networks Prokaryotic gene mRNA protein Eukaryotic gene mRNA & eRNA protein networking functions Hidden layer

Optimization and Correlations are Non-Commuting! Economics Selfish independent agents: “homo economicus” Challenges: Japanese Development Economics, Toyota “Just-In-Time” Production System

Optimization and Correlations are Non-Commuting! o = F(i) = F(t,d) = F t (d)  {F(x,y,z), …,F(x,x,z),…} Functional Programming, Dataflow computation, re-write architectures, … o i 1 Player Evolving / Learning Machines (neural and molecular networks) endogenously exploit correlations to alter own decision tree, dynamics and optima

Discrepancies: Myopic Agent Optimization and Observation Heuristic statistics Iterated Prisoner’s DilemmaIterated Ultimatum Game Chain Store Paradox (Incumbent never fights new market entrants)

Sum-Over-Histories or Path Integral formulation Myopic Agent Optimization Normal FormStrategic Form PxPx PyPy von Neumann and Morgenstern (1944): All possible information = All possible move combinations for all histories and all futures ?? 

Optimization Sum over all stages Probability of each path Payoff from each stage for each path Sum over all paths to n th stage Myopic Agent Optimization

Myopic agents (  probability distributions) · uncorrelated · no additional constraints Backwards Induction & Minimax x1x1 y1y1 1-p p 0 ≤ p ≤ 1/2

Non-Myopic Agent Optimization Fully general, notationally emphasized by: Optimization Sum over all correlation strategies Constraint set of each strategy Payoff for each path Sum over all paths given strategy Probability of each strategy Conditioned path probability

Non-Myopic Agent Optimization in the Iterated Prisoner’s Dilemma In 1950 Melvin Dresher and Merrill Flood devised a game later called the Prisoner’s Dilemma Two prisoners are in separate cells and must decide to cooperate or defect Cooperation Defect CKR: Common Knowledge of Rationality Payoff Matrix P y P x C D C (2, 2) (0, 3) D (3,0) (1,1)

Non-Myopic Agent Optimization in the Iterated Prisoner’s Dilemma Myopic agent assumption max 

Non-Myopic Agent Optimization in the Iterated Prisoner’s Dilemma Myopic agents: N max constraints = 0 > 0  P Nx,H N-1 (1) = 1 =1 > 0  P N-1,x,H N-2 (1) = 1 Simultaneous solution  Backwards Induction  myopic agents always defect

Non-Myopic Agent Optimization in the Iterated Prisoner’s Dilemma Correlated Constraints: (deriving Tit For Tat) < 0  P 1x (1) = 0, so P x cooperates < 0  P 1y (1) = 0, so P y cooperates 2 max constraints

Non-Myopic Agent Optimization in the Iterated Prisoner’s Dilemma Families of correlation constraints: k, j index Change of notation: “dot N” = N, “dot dot N” = 2N, “dot dot N -2 ” = 2N-2, etc Optimize via game theory techniques Many constrained equilibria involving cooperation Cooperation is rational in IPD

Further Reading and Contacts Michael J Gagen URL: See: Cooperative equilibria in the finite iterated prisoner's dilemma, K. Nemoto and M. J. Gagen, EconPapers:wpawuwpga/ ( Kae Nemoto URL: