Use or disclosure of the information contained herein is subject to specific written CIRA approval 1 PURSUIT – EVASION GAMES GAME THEORY AND ANALYSIS OF.

Slides:



Advertisements
Similar presentations
The Maximum Principle: Continuous Time Main purpose: to introduce the maximum principle as a necessary condition that must be satisfied by any optimal.
Advertisements

An Efficient Dynamic Auction for Heterogeneous Commodities (Lawrence M.Ausubel - september 2000) Authors: Oren Rigbi Damian Goren.
6.896: Topics in Algorithmic Game Theory Lecture 20 Yang Cai.
Boyce/DiPrima 9th ed, Ch 2.8: The Existence and Uniqueness Theorem Elementary Differential Equations and Boundary Value Problems, 9th edition, by William.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Unità di Perugia e di Roma “Tor Vergata” "Uncertain production systems: optimal feedback control of the single site and extension to the multi-site case"
Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.
Game theory.
Models for Control and Verification Ian Mitchell Department of Computer Science The University of British Columbia research supported by National Science.
How Bad is Selfish Routing? By Tim Roughgarden Eva Tardos Presented by Alex Kogan.
ECO290E: Game Theory Lecture 9 Subgame Perfect Equilibrium.
Properties of State Variables
Part 3: The Minimax Theorem
Equilibrium Concepts in Two Player Games Kevin Byrnes Department of Applied Mathematics & Statistics.
EC941 - Game Theory Prof. Francesco Squintani Lecture 8 1.
Christos alatzidis constantina galbogini.  The Complexity of Computing a Nash Equilibrium  Constantinos Daskalakis  Paul W. Goldberg  Christos H.
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Differential Game Theory Notes by Alberto Bressan.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/19 3/12.
INSTITUTO DE SISTEMAS E ROBÓTICA Minimax Value Iteration Applied to Robotic Soccer Gonçalo Neto Institute for Systems and Robotics Instituto Superior Técnico.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
Nash Q-Learning for General-Sum Stochastic Games Hu & Wellman March 6 th, 2006 CS286r Presented by Ilan Lobel.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
An Introduction to Black-Box Complexity
Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 68 Chapter 9 The Theory of Games.

Chapter 6 APPLICATIONS TO PRODUCTION AND INVENTORY Hwang, Fan, Erickson (1967) Hendricks et al (1971) Bensoussan et al (1974) Thompson-Sethi (1980) Stochastic.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Chapter 3 The Maximum Principle: Mixed Inequality Constraints Mixed inequality constraints: Inequality constraints involving control and possibly state.
Boyce/DiPrima 10th ed, Ch 10.5: Separation of Variables; Heat Conduction in a Rod Elementary Differential Equations and Boundary Value Problems, 10th.
CHAPTER 7 NON-LINEAR CONDUCTION PROBLEMS
A Projection Framework for Near- Potential Polynomial Games Nikolai Matni Control and Dynamical Systems, California.
MAKING COMPLEX DEClSlONS
Chapter 9 Games with Imperfect Information Bayesian Games.
AUTOMATIC CONTROL THEORY II Slovak University of Technology Faculty of Material Science and Technology in Trnava.
1 Performance Evaluation of Computer Networks: Part II Objectives r Simulation Modeling r Classification of Simulation Modeling r Discrete-Event Simulation.
3.1. Strategic Behavior Matilde Machado.
Corporate Banking and Investment Risk tolerance and optimal portfolio choice Marek Musiela, BNP Paribas, London.
Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 3 Differential Game Zhu Han, Dusit Niyato, Walid Saad, Tamer.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 4.1.Dynamic Games of Incomplete Information Lecture
Markov Decision Processes1 Definitions; Stationary policies; Value improvement algorithm, Policy improvement algorithm, and linear programming for discounted.
Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.
Dynamic Games & The Extensive Form
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.
Lecture 7 and 8 The efficient and optimal use of natural resources.
Dynamic Programming Discrete time frame Multi-stage decision problem Solves backwards.
1. 2 Some details on the Simplex Method approach 2x2 games 2xn and mx2 games Recall: First try pure strategies. If there are no saddle points use mixed.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 22.
Bayesian Brain: Probabilistic Approaches to Neural Coding Chapter 12: Optimal Control Theory Kenju Doya, Shin Ishii, Alexandre Pouget, and Rajesh P.N.Rao.
Boundary-Value Problems in Rectangular Coordinates
EC941 - Game Theory Prof. Francesco Squintani Lecture 6 1.
5.1.Static Games of Incomplete Information
Chapter 4 The Maximum Principle: General Inequality Constraints.
Space Complexity. Reminder: P, NP classes P is the class of problems that can be solved with algorithms that runs in polynomial time NP is the class of.
Introduction and Preliminaries D Nagesh Kumar, IISc Water Resources Planning and Management: M4L1 Dynamic Programming and Applications.
2.5 The Fundamental Theorem of Game Theory For any 2-person zero-sum game there exists a pair (x*,y*) in S  T such that min {x*V. j : j=1,...,n} =
ECO290E: Game Theory Lecture 10 Examples of Dynamic Games.
UNIVERSITA’ DEGLI STUDI NAPOLI FEDERICO II DOTTORATO IN INGEGNERIA DEI MATERIALI E DELLE STRUTTURE Brunella Corrado Filomena Gioiella Bernadette Lombardi.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.1.Dynamic Games of Complete and Perfect Information Lecture
Ch 9.6: Liapunov’s Second Method In Section 9.3 we showed how the stability of a critical point of an almost linear system can usually be determined from.
Stackleberg-Nash Equilibrium Presentation: Belov Nikolay.
Authors: Oren Rigbi Damian Goren
Optimal Control and Reachability with Competing Inputs
Basic Concepts, Necessary and Sufficient conditions
Stability Analysis of Linear Systems
Chapter 5: Morse functions and function-induced persistence
1st semester a.y. 2018/2019 – November 22, 2018
Presentation transcript:

Use or disclosure of the information contained herein is subject to specific written CIRA approval 1 PURSUIT – EVASION GAMES GAME THEORY AND ANALYSIS OF COMPETITIVE DYNAMICS FOR INDUSTRIALSYSTEMS PURSUIT - EVASION GAMES E. De Lellis Dottorato in Ingegneria Aerospaziale, XXIV ciclo L. Garbarino Dottorato in Ingegneria Aerospaziale, XXVI ciclo A. Vitale Dottorato in Ingegneria Aerospaziale, XXV ciclo

Use or disclosure of the information contained herein is subject to specific written CIRA approval 2 PURSUIT – EVASION GAMES  Differential Games  Pursuit – Evasion Games  Feedback Nash Equilibrium Definition  Dynamic Programming and Hamilton Jacobi Bellman Equation  Finding Feedback Nash Equilibrium for PE Games  Simulation Example Outline

Use or disclosure of the information contained herein is subject to specific written CIRA approval 3 PURSUIT – EVASION GAMES Differential Games: Problem Formulation (1/2) u i is the control implemented by the i-th player, f is continuous in t, u i, and is continuously differentiate in x The i-th player aims at maximizing his own payoff ψ (terminal payoff) is continuous in T and is continuously differentiate in x(T) L (running cost) is continuous in t, u i, and is continuously differentiate in x Differential games constitute a class of decision problems wherein the evolution of the state is described by a differential equation and the players act throughout a time interval. Let x S subset of R N describe the state of the system, evolving in time according to the ODE:

Use or disclosure of the information contained herein is subject to specific written CIRA approval 4 PURSUIT – EVASION GAMES Differential Games: Problem Formulation (2/2) Differential games could be continuous time or discrete time finite time or infinite time if finite time, the duration could be fixed pre-specified or the end point in both state and time could be variable, for example: The saddle-point, Nash and Stackelberg equilibrium solution concepts are still valid for dynamic games

Use or disclosure of the information contained herein is subject to specific written CIRA approval 5 PURSUIT – EVASION GAMES Differential Games: Available Information The strategy adopted by a player depends on the information available to him at each time We assume that each player has perfect knowledge of The function f determining the evolution of the system, the initial state x 0 and the sets U i of control values available to the two players The payoff functions J i The instantaneous time t [0, T] (i.e. both players have a clock). Moreover we distinguish the following cases: Open Loop strategies Feedback (or Markovian) strategies Hierarchical Play Delayed Information

Use or disclosure of the information contained herein is subject to specific written CIRA approval 6 PURSUIT – EVASION GAMES Pursuit Evasion Games Pursuit-Evasion games are two-person deterministic zero-sum differential games defined by a dynamics and a target The target is a subset of R N. In the pursuit-evasion game, the first player tries to maintain the state of the system as long as possible outside of the target while the second player aims at reaching C as soon as possible (capturability) Sufficiency conditions for the for existence of feedback saddle-point equilibrium strategy are provided by a natural two-person extension of the Hamilton-Jacobi-Bellman equation (called the "Isaacs equation“) However, other strategies could be possible

Use or disclosure of the information contained herein is subject to specific written CIRA approval 7 PURSUIT – EVASION GAMES Zero-Sum Games A two players game where the payoffs are is called zero-sum game The goal of the first player is to maximize this payoff, while the second player wishes to minimize it For a zero-sum game the Nash equilibrium coincides with saddle point :

Use or disclosure of the information contained herein is subject to specific written CIRA approval 8 PURSUIT – EVASION GAMES Feedback Nash Equilibrium for Differential Games A set of control functions (t, x) → (u * 1 (t, x),… u * k (t, x)) is a Nash equilibrium for the differential game within the class of feedback strategies if for each i the control function (t, x) → u * i (t, x) provides an optimal feedback in connection with the optimal control problem for i-th player : for the system with dynamics

Use or disclosure of the information contained herein is subject to specific written CIRA approval 9 PURSUIT – EVASION GAMES Dynamic Programming (1/2) Consider the optimization the following problem maximize: given where U is a compact domain and the function f is continuous w.r.t. all variables and continuously differentiable w.r.t. x. Moreover there exists a constant C such that

Use or disclosure of the information contained herein is subject to specific written CIRA approval 10 PURSUIT – EVASION GAMES Dynamic Programming (2/2) Let introduce the value function Principle of Dynamic Programming: for any initial data x 0 R and 0 ≤ t 0 < t 1 < T, one has

Use or disclosure of the information contained herein is subject to specific written CIRA approval 11 PURSUIT – EVASION GAMES Hamilton Jacobi Bellman Equation It is not easy to compute V. Moreover, the continuous differentiability assumption imposed on V is rather restrictive. Nevertheless, if such a function exists, then the HJB equation provides a means of obtaining the optimal control strategy Theorem If a continuously differentiable function V can be found that satisfies the HJB equation subject to the boundary condition V(T, x) = ψ(T, x), then it generates the optimal strategy through the static (pointwise) maximization problem defined by the RHS of the Hamiltonian According the dynamic programming approach, under the assumption of continuous differentiability of V, the value function for the given optimization problem satisfies the Hamilton Jacobi Bellman equation

Use or disclosure of the information contained herein is subject to specific written CIRA approval 12 PURSUIT – EVASION GAMES NE for Zero-Sum Games For a two-person zero-sum differential game the HJB equation is called Isaacs equation and can be rewrite as follows: A pair of strategies (u 1, u 2 ) provides a feedback saddle-point solution if there exists a function V : [0,T] x R N —> R satisfying the Isaacs PDE Interchangeability of the min and max operations in the Isaacs equation is often referred to as the Isaacs condition. It holds is both f and L are separable in u 1 and u 2

Use or disclosure of the information contained herein is subject to specific written CIRA approval 13 PURSUIT – EVASION GAMES Sufficient Condition for Zero-Sum NE Strategy Theorem: If : I.a continuously differentiable function V(t, x) exists that satisfies the Isaacs equation, II. V(T, x) = ψ(T, x) on the boundary of the target set, defined by l(t, x) = 0, III.either u 1 *(t, x) or u 2 *(t, x), as derived from Isaac equation, generates trajectories that terminate in finite time (whatever u 1 * or u 2 *, is), then V(t, x) is the value function and the pair u 1 *(t, x) or u 2 *(t, x) constitutes a saddle point This theorem is valid for pursuit-evasion games for which the duration of the game is not fixed, but is compute through where Λ is a closed subset, called the target set, and its surface is characterized by the scalar function l(t,x) = 0

Use or disclosure of the information contained herein is subject to specific written CIRA approval 14 PURSUIT – EVASION GAMES Necessary Condition for Zero-Sum NE Strategy (1/2) Theorem: Given a two-person zero-sum differential game, suppose that the pair (u 1 *, u 2 *) provides a saddle-point solution in feedback strategies, with x*(t) denoting the corresponding state trajectory. Furthermore, let its open-loop representation {u i (t) = u i (t,x*(t)), i = 1,2} also provide a saddle-point solution (in open-loop policies). Then there exists a costate function p(.): [0,T] —> R N such that the following relations are satisfied: where

Use or disclosure of the information contained herein is subject to specific written CIRA approval 15 PURSUIT – EVASION GAMES Simulation Example

Use or disclosure of the information contained herein is subject to specific written CIRA approval 16 PURSUIT – EVASION GAMES Simulation Example: Homicidal Chauffeur Game Coordinates Transformation: New Refs Pursuit position as origin and X-axis along Pursuit Direction

Use or disclosure of the information contained herein is subject to specific written CIRA approval 17 PURSUIT – EVASION GAMES Assuption: Instantaneous Evasor rotation Target Set Usable Part of the boundary

Use or disclosure of the information contained herein is subject to specific written CIRA approval 18 PURSUIT – EVASION GAMES The Barrier: those initial points that allow solution of the problem and Isaac Solution

Use or disclosure of the information contained herein is subject to specific written CIRA approval 19 PURSUIT – EVASION GAMES Final Condition Solution

Use or disclosure of the information contained herein is subject to specific written CIRA approval 20 PURSUIT – EVASION GAMES Solution