Uri Zwick Tel Aviv University

Slides:



Advertisements
Similar presentations
The Primal-Dual Method: Steiner Forest TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A AA A A A AA A A.
Advertisements

Weighted Matching-Algorithms, Hamiltonian Cycles and TSP
Uri Zwick Tel Aviv University Simple Stochastic Games Mean Payoff Games Parity Games.
C&O 355 Lecture 6 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
GAME THEORY.
C&O 355 Lecture 23 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
C&O 355 Mathematical Programming Fall 2010 Lecture 22 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.
Energy and Mean-Payoff Parity Markov Decision Processes Laurent Doyen LSV, ENS Cachan & CNRS Krishnendu Chatterjee IST Austria MFCS 2011.
Mini-course on algorithmic aspects of stochastic games and related models Marcin Jurdziński (University of Warwick) Peter Bro Miltersen (Aarhus University)
Simple Stochastic Games and Propositional Proof Systems Toniann Pitassi Joint work with Lei Huang University of Toronto.
C&O 355 Mathematical Programming Fall 2010 Lecture 20 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
The Theory of NP-Completeness
C&O 355 Mathematical Programming Fall 2010 Lecture 21 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
Krishnendu Chatterjee1 Partial-information Games with Reachability Objectives Krishnendu Chatterjee Formal Methods for Robotics and Automation July 15,
Randomness for Free Laurent Doyen LSV, ENS Cachan & CNRS joint work with Krishnendu Chatterjee, Hugo Gimbert, Tom Henzinger.
Concurrent Reachability Games Peter Bro Miltersen Aarhus University 1CTW 2009.
1 Maximum matching in graphs with an excluded minor Raphael Yuster University of Haifa Uri Zwick Tel Aviv University TexPoint fonts used in EMF. Read the.
Uri Zwick – Tel Aviv Univ. Randomized pivoting rules for the simplex algorithm Lower bounds TexPoint fonts used in EMF. Read the TexPoint manual before.
On the randomized simplex algorithm in abstract cubes Jiři Matoušek Charles University Prague Tibor Szabó ETH Zürich.
Uri Zwick – Tel Aviv Univ. Randomized pivoting rules for the simplex algorithm Upper bounds TexPoint fonts used in EMF. Read the TexPoint manual before.
Complexity 16-1 Complexity Andrei Bulatov Non-Approximability.
Design and Analysis of Algorithms
Lecture 20: April 12 Introduction to Randomized Algorithms and the Probabilistic Method.
Stochastic Games Krishnendu Chatterjee CS 294 Game Theory.
C&O 355 Mathematical Programming Fall 2010 Lecture 17 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
MAKING COMPLEX DEClSlONS
C&O 355 Lecture 2 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
The Theory of NP-Completeness 1. Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking.
1 The Theory of NP-Completeness 2012/11/6 P: the class of problems which can be solved by a deterministic polynomial algorithm. NP : the class of decision.
חישוביות וסיבוכיות Computability and Complexity Lecture 7 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AAAA.
Energy Parity Games Laurent Doyen LSV, ENS Cachan & CNRS Krishnendu Chatterjee IST Austria.
C&O 355 Mathematical Programming Fall 2010 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
1 Joe Meehean.  Log: binary search in sorted array  Linear: traverse a tree  Log-Linear: insert into a heap  Quadratic (N 2 ): your sort from P1 
CSE 326: Data Structures NP Completeness Ben Lerner Summer 2007.
Uri Zwick Tel Aviv University Simple Stochastic Games Mean Payoff Games Parity Games TexPoint fonts used in EMF. Read the TexPoint manual before you delete.
Week 10Complexity of Algorithms1 Hard Computational Problems Some computational problems are hard Despite a numerous attempts we do not know any efficient.
CSCI 3160 Design and Analysis of Algorithms Tutorial 10 Chengyu Lin.
Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.
Institute for Applied Information Processing and Communications (IAIK) 1 TU Graz/Computer Science/IAIK Graz, 2009 AK Design and Verification Presentation.
NPC.
The Theory of NP-Completeness 1. Nondeterministic algorithms A nondeterminstic algorithm consists of phase 1: guessing phase 2: checking If the checking.
C&O 355 Lecture 19 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
Theory of Computational Complexity Probability and Computing Lee Minseon Iwama and Ito lab M1 1.
Krishnendu ChatterjeeFormal Methods Class1 MARKOV CHAINS.
The Theory of NP-Completeness
Game Theory Just last week:
The Multiple Dimensions of Mean-Payoff Games
Markov Decision Processes
Stochastic and non-Stochastic Games – a survey
Computability and Complexity
Convergence, Targeted Optimality, and Safety in Multiagent Learning
Uri Zwick Tel Aviv University
Analysis of Algorithms
Secular session of 2nd FILOFOCS April 10, 2013
Uri Zwick – Tel Aviv Univ.
Alternating tree Automata and Parity games
Thomas Dueholm Hansen – Aarhus Univ. Uri Zwick – Tel Aviv Univ.
Oliver Friedmann – Univ. of Munich Thomas Dueholm Hansen – Aarhus Univ
Discounted Deterministic Markov Decision Processes
Uri Zwick – Tel Aviv Univ.
Analysis of Algorithms
CS 188: Artificial Intelligence Fall 2007
Memoryless Determinacy of Parity Games
The Theory of NP-Completeness
The Selection Problem.
Markov Decision Processes
Collaboration in Repeated Games
Markov Decision Processes
Presentation transcript:

Uri Zwick Tel Aviv University Deterministic subexponential algorithm for Parity Games Uri Zwick Tel Aviv University China Theory Week 2007 2007理论计算机科学明日 之星中国论坛 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAA

Simple Stochastic Games Mean Payoff Games Parity Games

A simple Simple Stochastic Game R R

Simple Stochastic game (SSGs) Reachability version [Condon (1992)] MAX RAND min MAX-sink min-sink Two Players: MAX and min Objective: MAX/min the probability of getting to the MAX-sink

Simple Stochastic game (SSGs) [Condon (1992)] Terminating binary games The outdegrees of all non-sinks are 2 All probabilities are ½. The game terminates with prob. 1

Simple Stochastic games (SSGs) Basic properties : Every vertex in the game has a value v Both players have positional optimal strategies Positional strategy for MAX: choice of an outgoing edge from each MAX vertex Decision version: Is value v

“Solving” terminating binary SSGs The values vi of the vertices of a game are the unique solution of the following equations: The values are rational numbers requiring only a linear number of bits Corollary: Decision version in NP  co-NP

Simple Stochastic game (SSGs) Payoff version [Shapley (1953)] MAX RAND min Limiting average version Discounted version

Markov Decision Processes (MDPs) MAX RAND min Theorem: [Derman (1970)] Values and optimal strategies of a MDP can be found by solving an LP

NP  co-NP – Another proof Deciding whether the value of a game is at least (at most) v is in NP  co-NP To show that value  v , guess an optimal strategy  for MAX Find an optimal counter-strategy  for min by solving the resulting MDP. Is the problem in P ?

Mean Payoff Games (MPGs) [Ehrenfeucht, Mycielski (1979)] MAX RAND min Non-terminating version Discounted version COCOON 1995 Reachability SSGs MPGs (PZ’96) Pseudo-polynomial algorithm (PZ’96)

Mean Payoff Games (MPGs) [Ehrenfeucht, Mycielski (1979)] Again, both players have optimal positional strategies. Value – average of the cycle

Selecting the second largest element with only four storage locations [PZ’96]

Non-emptyness of -tree automata modal -calculus model checking Parity Games (PGs) ODD 8 Priorities EVEN 3 EVEN wins if largest priority seen infinitely often is even Equivalent to many interesting problems in automata and verification: Non-emptyness of -tree automata modal -calculus model checking

Parity Games (PGs) A simple example 2 3 2 1 4 1

Mean Payoff Games (MPGs) Parity Games (PGs) Mean Payoff Games (MPGs) [Stirling (1993)] [Puri (1995)] ODD 8 EVEN 3 Replace priority k by payoff (n)k Move payoffs to outgoing edges

A randomized subexponential algorithm for simple stochastic games

Simple Stochastic games (SSGs) Switches … A switch is a change of strategy at a single vertex A switch is profitable for MAX if it increases the value of the game (sum of values of all vertices) A strategy is optimal iff no switch is profitable

A randomized subexponential algorithm for binary SSGs [Ludwig (1995)] [Kalai (1992)] [Matousek-Sharir-Welzl (1992)] Start with an arbitrary strategy  for MAX Choose a random vertex iVMAX Find the optimal strategy ’ for MAX in the game in which the only outgoing edge of i is (i,(i)) If switching ’ at i is not profitable, then ’ is optimal Otherwise, let  (’)i and repeat

Would never be switched ! A randomized subexponential algorithm for binary SSGs [Ludwig (1995)] [Kalai (1992)] [Matousek-Sharir-Welzl (1992)] MAX vertices All correct ! Would never be switched ! There is a hidden order of MAX vertices under which the optimal strategy returned by the first recursive call correctly fixes the strategy of MAX at vertices 1,2,…,i

Positions 1,..,i where switched and would never be switched again. The hidden order Let vi be the value of the optimal strategy for MAX that agrees with σ on i Order the vertices such that Positions 1,..,i where switched and would never be switched again.

A deterministic subexponential algorithm for parity games

Exponential algorithm for PGs [McNaughton (1993)] [Zielonka (1998)] Vertices of highest priority (even) First recursive call Vertices from which EVEN can force the game to enter A Lemma: (i) (ii)

Exponential algorithm for PGs [McNaughton (1993)] [Zielonka (1998)] Second recursive call In the worst case, both recursive calls are on games of size n1

Idea: Look for small dominions! Deterministic subexponential alg for PGs Jurdzinski, Paterson, Z (2006) Idea: Look for small dominions! Second recursive call Dominions of size s can be found in O(ns) time Dominion Dominion: A (small) set from which one of the players can win without the play ever leaving this set

Open problems Polynomial algorithms? Faster subexponential algorithms for parity games? Deterministic subexponential algorithms for MPGs and SSGs? Faster pseudo-polynomial algorithms for MPGs?