Stochastic Games Games played on graphs with stochastic transitions Markov decision processes Games against nature Turn-based games Games against adversary.

Slides:

Advertisements

Similar presentations

From Graph Models to Game Models Tom Henzinger EPFL.

Advertisements

THE PRICE OF STOCHASTIC ANARCHY Christine ChungUniversity of Pittsburgh Katrina LigettCarnegie Mellon University Kirk PruhsUniversity of Pittsburgh Aaron.

Uri Zwick Tel Aviv University Simple Stochastic Games Mean Payoff Games Parity Games.

Vincent Conitzer CPS Repeated games Vincent Conitzer

Winning concurrent reachability games requires doubly-exponential patience Michal Koucký IM AS CR, Prague Kristoffer Arnsfelt Hansen, Peter Bro Miltersen.

Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.

Dispute Settlement Mechanism The role of dispute settlement mechanism –information gathering and dispatching, not enforcement of trade arrangements Main.

Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.

Energy and Mean-Payoff Parity Markov Decision Processes Laurent Doyen LSV, ENS Cachan & CNRS Krishnendu Chatterjee IST Austria MFCS 2011.

Joint Strategy Fictitious Play Sherwin Doroudi. “Adapted” from J. R. Marden, G. Arslan, J. S. Shamma, “Joint strategy fictitious play with inertia for.

Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.

Copyright (c) 2003 Brooks/Cole, a division of Thomson Learning, Inc

© 2015 McGraw-Hill Education. All rights reserved. Chapter 15 Game Theory.

Krishnendu Chatterjee1 Graph Games with Reachabillity Objectives: Mixing Chess, Soccer and Poker Krishnendu Chatterjee 5 th Workshop on Reachability Problems,

Krishnendu Chatterjee1 Partial-information Games with Reachability Objectives Krishnendu Chatterjee Formal Methods for Robotics and Automation July 15,

Randomness for Free Laurent Doyen LSV, ENS Cachan & CNRS joint work with Krishnendu Chatterjee, Hugo Gimbert, Tom Henzinger.

Concurrent Reachability Games Peter Bro Miltersen Aarhus University 1CTW 2009.

IN SEARCH OF VALUE EQUILIBRIA By Christopher Kleven & Dustin Richwine xkcd.com.

Uri Zwick – Tel Aviv Univ. Randomized pivoting rules for the simplex algorithm Lower bounds TexPoint fonts used in EMF. Read the TexPoint manual before.

Discounting the Future in Systems Theory Chess Review May 11, 2005 Berkeley, CA Luca de Alfaro, UC Santa Cruz Tom Henzinger, UC Berkeley Rupak Majumdar,

An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.

Stochastic Zero-sum and Nonzero-sum  -regular Games A Survey of Results Krishnendu Chatterjee Chess Review May 11, 2005.

Games, Times, and Probabilities: Value Iteration in Verification and Control Krishnendu Chatterjee Tom Henzinger.

Models and Theory of Computation (MTC) EPFL Dirk Beyer, Jasmin Fisher, Nir Piterman Simon Kramer: Logic for cryptography Marc Schaub: Models for biological.

Beyond Nash: Raising the Flag of Rebellion Yisrael Aumann University of Haifa, 11 Kislev 5771 ( )

1 Computing Nash Equilibrium Presenter: Yishay Mansour.

Approaches to Reactive System Synthesis J.-H. Roland Jiang.

6.1 Consider a simultaneous game in which player A chooses one of two actions (Up or Down), and B chooses one of two actions (Left or Right). The game.

CS Master – Introduction to the Theory of Computation Jan Maluszynski - HT Lecture 1 Introduction Jan Maluszynski, IDA, 2007

Nash Q-Learning for General-Sum Stochastic Games Hu & Wellman March 6 th, 2006 CS286r Presented by Ilan Lobel.

Chess Review November 21, 2005 Berkeley, CA Edited and presented by Advances in Hybrid System Theory: Overview Claire J. Tomlin UC Berkeley.

Chess Review November 18, 2004 Berkeley, CA Hybrid Systems Theory Edited and Presented by Thomas A. Henzinger, Co-PI UC Berkeley.

Game Theory April 9, Prisoner’s Dilemma  One-shot, simultaneous game  Nash Equilibrium (individually rational strategies) is not Pareto Optimal.

Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Stochastic Games Krishnendu Chatterjee CS 294 Game Theory.

Chess Review May 10, 2004 Berkeley, CA Rich Interface Theories for Component-based Design Arindam Chakrabarti Luca de Alfaro Thomas A. Henzinger Marcin.

1 On the Agenda(s) of Research on Multi-Agent Learning by Yoav Shoham and Rob Powers and Trond Grenager Learning against opponents with bounded memory.

Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer

Jun. Sun Singapore University of Technology and Design Songzheng Song and Yang Liu National University of Singapore.

Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Reinforcement Learning on Markov Games Nilanjan Dasgupta Department of Electrical and Computer Engineering Duke University Durham, NC Machine Learning.

林偉楷 Taiwan Evolutionary Intelligence Laboratory.

Energy Parity Games Laurent Doyen LSV, ENS Cachan & CNRS Krishnendu Chatterjee IST Austria.

1 Efficiency and Nash Equilibria in a Scrip System for P2P Networks Eric J. Friedman Joseph Y. Halpern Ian Kash.

EC941 - Game Theory Prof. Francesco Squintani Lecture 5 1.

Strategic Decisions in Noncooperative Games Introduction to Game Theory.

Rutgers University A Polynomial-time Nash Equilibrium Algorithm for Repeated Stochastic Games Enrique Munoz de Cote Michael L. Littman.

Uri Zwick Tel Aviv University Simple Stochastic Games Mean Payoff Games Parity Games TexPoint fonts used in EMF. Read the TexPoint manual before you delete.

Games with Secure Equilibria Krishnendu Chatterjee (Berkeley) Thomas A. Henzinger (EPFL) Marcin Jurdzinski (Warwick)

Computing and Approximating Equilibria: How… …and What’s the Point? Yevgeniy Vorobeychik Sandia National Laboratories.

Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.

PhD Projects Rahul Santhanam University of Edinburgh.

Adversarial Search Chapter Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent reply Time limits.

Equilibrium transitions in stochastic evolutionary games Dresden, ECCS’07 Jacek Miękisz Institute of Applied Mathematics University of Warsaw.

1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.

Algorithms for solving two-player normal form games

Symbolic Algorithms for Infinite-state Systems Rupak Majumdar (UC Berkeley) Joint work with Luca de Alfaro (UC Santa Cruz) Thomas A. Henzinger (UC Berkeley)

1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.

Extensive Form (Dynamic) Games With Perfect Information (Theory)

ECO290E: Game Theory Lecture 10 Examples of Dynamic Games.

Krishnendu ChatterjeeFormal Methods Class1 MARKOV CHAINS.

The Multiple Dimensions of Mean-Payoff Games

Stochastic -Regular Games

Convergence, Targeted Optimality, and Safety in Multiagent Learning

Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 10 Stochastic Game Zhu Han, Dusit Niyato, Walid Saad, and.

Uri Zwick Tel Aviv University

Quantitative Modeling, Verification, and Synthesis

Collaboration in Repeated Games

Normal Form (Matrix) Games

Presentation transcript:

Stochastic Games Games played on graphs with stochastic transitions Markov decision processes Games against nature Turn-based games Games against adversary Turn-based stochastic games Games against nature and adversary Concurrent Games Simultaneous games Objectives  - regular: generalization of classical regular language to infinite strings Specify properties like reachability, safety, fairness, liveness Canonical representation of such objectives are Rabin objectives Streett objectives Rabin-chain objectives Simpler objectives Reachability- Safety Computational issues: Maximal value with which players can win Example of game graphs: Results Complexity of turn-based stochastic games[CdAH04a]: : NP-complete for Rabin objectives coNP-complete for Streett objectives NP Å coNP for Rabin-chain objectives Previous best known results 3EXPTIME. Existence of simple optimal strategies Hence simple controllers for stochastic reactive systems Results Notion of nonzero-sum games: objectives are not complementary Concept of rationality in nonzero-sum game: Nash equilibrium Existence of Nash equilibrium in stochastic games open problem Results in nonzero-sum stochastic games[CMJ04] Concurrent games: Existence of  -Nash equilibrium for reachability objectives, for all  >0 Complexity of computing equilibrium values: NP Turn-based stochastic games: Existence of  -Nash equilibrium for all Borel objectives, for all  >0 Existence of Nash equilibrium for  -regular objectives Existence of Nash equilibrium for Borel objectives for turn-based games Refined notion of equilibria [CHJ04] Turn-based nonzero-sum games with adverserial external criteria Relevant from verification perspective Existence of unique equilibrium payoff profile Nash equilibrium payoff profile can be several Computability for  -regular objectives In the same complexity class as zero-sum games Future directions of research Relation of the refined notion of equilibria and assume-guarantee verification Notion of bounded-rationality in concurrent games Identify the class of objectives for which simple optimal strategies exist in turn-based stochastic games Study complexity of verifying several other class objectives relevant from verification of quantitative properties Stochastic  -regular Games Krishnendu Chatterjee*, Luca de Alfaro**, Thomas A. Henzinger*, Marcin Jurdzinski***, Rupak Majumdar **** * EECS, Berkeley, ** CE, UCSC, *** University of Warwick, **** CS, UCLA November 18, 2004 Pl. 1 Pl. 2 Pl. random Turn-based stochastic game ac,bd bc Player 1 actions: a, b Player 2 actions: c, d ad Concurrent game Emerson-Jutla’88: Turn-based games with Rabin objectives Condon’92: Turn-based stochastic games with Reachability objectives Turn-based stochastic games with Rabin objectives Game complexity Objective complexity Complexity of Concurrent games[CdAH04b] NP Å coNP for Rabin chain objectives Previous best known 3EXPTIME Strategy classification: Complete the precise requirements of optimal strategies in terms of memory and randomization Characterize several interesting properties of optimal strategies deAlfaroMajumdar01: 3EXPTIME algorithm Complexity: NP \cap coNP Complexity improvement