An Algorithm for Probabilistic Alternating Simulation University of New South Wales University of Luxembourg Authors: Chenyi Zhang Jun Pang Presentation:

Slides:



Advertisements
Similar presentations
Introduction The concept of transform appears often in the literature of image processing and data compression. Indeed a suitable discrete representation.
Advertisements

Lecture 3 Universal TM. Code of a DTM Consider a one-tape DTM M = (Q, Σ, Γ, δ, s). It can be encoded as follows: First, encode each state, each direction,
Theory of Computing Lecture 23 MAS 714 Hartmut Klauck.
Circuit and Communication Complexity. Karchmer – Wigderson Games Given The communication game G f : Alice getss.t. f(x)=1 Bob getss.t. f(y)=0 Goal: Find.
On Complexity, Sampling, and -Nets and -Samples. Range Spaces A range space is a pair, where is a ground set, it’s elements called points and is a family.
Lecture 24 MAS 714 Hartmut Klauck
Game-theoretic simulation checking tool Peter Bulychev, Vladimir Zakharov, Igor Konnov Moscow State University.
Game-theoretic approach to the simulation checking problem Peter Bulychev Vladimir Zakharov Lomonosov Moscow State University.
Transportation Problem (TP) and Assignment Problem (AP)
COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.
1 Parallel Parentheses Matching Plus Some Applications.
Gibbs sampler - simple properties It’s not hard to show that this MC chain is aperiodic. Often is reversible distribution. If in addition the chain is.
6.896: Topics in Algorithmic Game Theory Lecture 11 Constantinos Daskalakis.
Determinization of Büchi Automata
Coalition Formation and Price of Anarchy in Cournot Oligopolies Joint work with: Nicole Immorlica (Northwestern University) Georgios Piliouras (Georgia.
Tic Tac Toe Game playing strategies
Algorithms and Networks
TIC-TAC-TOE FELIX CHEN CLUSTER 5: Computers in Biophysics and Robotics.
Microsoft® Small Basic Advanced Games Estimated time to complete this lesson: 1 hour.
PLANNING THE TIC TAC TOE GAME BY NEEL DAVE. TIC TAC TOE INSTRUCTIONS Tic Tac Toe Instructions The basic concept of Tic Tac Toe 1.This is a game for two.
Tutorial 6 of CSCI2110 Bipartite Matching Tutor: Zhou Hong ( 周宏 )
Integer Tic Tac Toe Let’s Begin Rules: 1.Erase all x’s and o’s from the previous game before you begin. 2.Decide which player will be x’s and which will.
Augmenting path algorithm Two theorems to recall: Theorem (Berge). A matching M in a graph G is a maximum matching in G iff G has no M-augmenting.
Parallel Scheduling of Complex DAGs under Uncertainty Grzegorz Malewicz.
New Toads and Frogs Results By Jeff Erickson Presented by Nate Swanson.
Representing a Game Board In a game, we represent the action taking place using an array – In a very simple game, we use individual variables to represent.
Advanced Topics in Algorithms and Data Structures Page 1 Parallel merging through partitioning The partitioning strategy consists of: Breaking up the given.
1 Introduction to Computability Theory Lecture12: Reductions Prof. Amos Israeli.
Limitations of VCG-Based Mechanisms Shahar Dobzinski Joint work with Noam Nisan.
Discrete Mathematics Lecture 4 Harper Langston New York University.
Parallel Merging Advanced Algorithms & Data Structures Lecture Theme 15 Prof. Dr. Th. Ottmann Summer Semester 2006.
Upper Bounds on the Time and Space Complexity of Optimizing Additively Separable Functions Matthew J. Streeter Carnegie Mellon University Pittsburgh, PA.
Aho-Corasick String Matching An Efficient String Matching.
Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 68 Chapter 9 The Theory of Games.
Study Group Randomized Algorithms Jun 7, 2003 Jun 14, 2003.
11 -1 Chapter 11 Randomized Algorithms Randomized algorithms In a randomized algorithm (probabilistic algorithm), we make some random choices.
Transparency No. 8-1 Formal Language and Automata Theory Chapter 8 DFA state minimization (lecture 13, 14)
Computational aspects of stability in weighted voting games Edith Elkind (NTU, Singapore) Based on joint work with Leslie Ann Goldberg, Paul W. Goldberg,
Great Theoretical Ideas in Computer Science.
Theory of Computing Lecture 19 MAS 714 Hartmut Klauck.
Polynomial Factorization Olga Sergeeva Ferien-Akademie 2004, September 19 – October 1.
Great Theoretical Ideas in Computer Science.
Chapter 10: Iterative Improvement Simplex Method The Design and Analysis of Algorithms.
CSC 213 – Large Scale Programming Lecture 3: Object-Oriented Analysis.
Presenter: Jen Hua Chi Adviser: Yeong Sung Lin Network Games with Many Attackers and Defenders.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 11.
1 2. Independence and Bernoulli Trials Independence: Events A and B are independent if It is easy to show that A, B independent implies are all independent.
Markov Decision Processes1 Definitions; Stationary policies; Value improvement algorithm, Policy improvement algorithm, and linear programming for discounted.
11 -1 Chapter 11 Randomized Algorithms Randomized Algorithms In a randomized algorithm (probabilistic algorithm), we make some random choices.
Rules “rules, play, culture”. COSC 4126 rules Rules of Tic-Tac-Toe 1.Play occurs on a 3 by 3 grid of 9 squares. 2.Two players take turns marking empty.
3.2 Matchings and Factors: Algorithms and Applications This copyrighted material is taken from Introduction to Graph Theory, 2 nd Ed., by Doug West; and.
Flows in Planar Graphs Hadi Mahzarnia. Outline O Introduction O Planar single commodity flow O Multicommodity flows for C 1 O Feasibility O Algorithm.
Part 3 Linear Programming
CSP: Algorithms and Dichotomy Conjecture Andrei A. Bulatov Simon Fraser University.
Matching Algorithms and Networks. Algorithms and Networks: Matching2 This lecture Matching: problem statement and applications Bipartite matching Matching.
Exercise 1 Consider a language with the following tokens and token classes: ID ::= letter (letter|digit)* LT ::= " " shiftL ::= " >" dot ::= "." LP ::=
Introduction to State Space Search
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 8.
XO Who gets the square? XO XO XO XO XO XO XO XO Let’s start the game Click here to learn how to play. Click on a character.
TU/e Algorithms (2IL15) – Lecture 12 1 Linear Programming.
Krishnendu ChatterjeeFormal Methods Class1 MARKOV CHAINS.
Algorithms and Networks
Chapter 6 : Game Search 게임 탐색 (Adversarial Search)
Alternating tree Automata and Parity games
Memoryless Determinacy of Parity Games
Witzzle Pro The Rules of the Game.
Haskell Tips You can turn any function that takes two inputs into an infix operator: mod 7 3 is the same as 7 `mod` 3 takeWhile returns all initial.
3.IV. Change of Basis 3.IV.1. Changing Representations of Vectors
2019 SAIMC Puzzle Challenge General Regulations
Presentation transcript:

An Algorithm for Probabilistic Alternating Simulation University of New South Wales University of Luxembourg Authors: Chenyi Zhang Jun Pang Presentation: Tim Muller

Goal Introduce games and simulations thereon. – Relevant simulation is PA- I -simulation. Introduce General Coarsest Partition Problem. Link between GCPP and PA- I -simulation. Solve the GCPP with polynomial algorithm. – Obtain largest probabilistic alternating simulation relation.

Example of a game Probabilistic simultaneous tic-tac-toe: – Similar to tic-tac-toe: Board consists of nine squares. At most one player can mark a square. Three in a row wins (vertical, horizontal or diagonal). – Turns are simultaneous. – A player picks a favorite and an alternative square: If the favorites differ, both players mark their favorite. If the favorites are the same, each player has a 50% chance of marking their favorite, the other player marks his alternative. – If there is only one square left, and there is no winner, then the game is over, and it’s a draw.

(5,7) (5,1) (3,8) (4,8) (6,9) (6,1) (5,7) (5,1) Example run Player I Player II

Probabilistic game structure A probabilistic game structure consists of: – A finite set of states S, with an initial state s 0. – A finite set of joint actions Act = Act I ×Act II, Act I is the set of actions for player I. – A labeling function L :S → 2 Prop – A transition function δ : S×Act → D (S)

Example (ctd.) A state is a board with equal number of O’s and X’s, and squares marked at most once. – Initial state is the empty board. An action for one player is any pair (n,m), with 1 ≤ n, m ≤ 9 and n ≠ m. Predicate P 1 holds iff there is a line of three O’s, and P 2 holds iff there is a line of three X’s. The pair s, ((n,m), (n′,m′)) maps to U(s) if n, m, n′ or m′ is marked in s or if P 1 or P 2 holds. Otherwise: – The pair s, ((n,m), (n′,m′)) maps to U (s[O on n; X on n′]) if n ≠ n′. – The pair s, ((n,m), (n,m′)) maps to U (s[O on n; X on m′], s[O on m; X on n]).

Mixed actions A mixed action of player I is a distribution over Act I, denoted with π  Π I. δ(s,(π 1, π 2 ))(t) = ∑ a 1  Act I, a 2  Act II : π 1 (a 1 )  π 2 (a 2 )  δ(s,(a 1,a 2 ))(t)

Simulation If A simulates B, then every move by B can be matched by A. – Probabilistic: From R to R – Alternating: If A I -simulates B, then every move of I in B can be matched by a move of I in A, such that for every counter- move of II in A there is a counter-move of II in B. s1s1 s2s2 t1t1 t2t2 t3t3 ½ ½ ⅓ ⅓ ⅓ ⅓ ⅓ ⅙ ⅙

Probabilistic alternating I-simulation A PA- I -simulation is a relation   S×S satisfying: if s t, then: – L(s) = L(t) – For all π 1  Π I, there exists π 1 ′  Π I, such that for all π 2 ′  Π II, there exists π 2  Π II, such that δ(s,(π 1,π 2 )) δ(t,(π 2 ′, π 2 ′)) If s t and t s, then s and t are PA- I -simulation equivalent.

Examples of PA-I-simulations 7,22,71,66,1 PA-I-simulation equivalent!

Examples of PA-I-simulations PA- I -simulation equivalent! ?,?4,8 Tie All ties, wins and losses are PA- I -simulation equivalent.

General Coarsest Partition Problem We want to partition a set of states into blocks, where all states in a block are PA- I - simulation equivalent. We want to define a partial order, , such that if P  Q, then every state in Q simulates every state in P. We want to find the coarsest such partition and partial order (partition pair).

From GCPP to PA-I-simulation We write [s] Σ for the block containing s. Define a relation (Σ,  )  S×S as s (Σ,  ) t iff [s] Σ  [t] Σ If (Σ,  ) is the coarsest partition pair, then s (Σ,  ) t is the largest PA-I-simulation. – There is a unique coarsest partition pair.

Sketch of approach Start with a partition pair that is too coarse. – If s t then [s] Σ  [t] Σ Keep refining blocks in the partition pair, until nothing changes. – Make sure not to refine too much! Return the partition pair.

Main procedure

Sketch of refining blocks Input is a partition pair (Σ,  ) and a block B E. Start with a partition pair (Σ B,  B ) that is to fine. Keep merging blocks until (Σ B,  B ) stops changing: – For any pair of blocks: If a state s in A can simulate a state t in B under (Σ,  ), and vice versa, then A and B should be the same block. If a state s in A can simulate a state t in B under (Σ,  ), but not vice versa, then A  B should hold (and vice versa). Return (Σ B,  B )

Refinement procedure

Correctness Define an operator ρ on partition pairs, by letting ρ((Σ,  )) be the coarsest partition pair (Σ′,  ′), finer than (Σ,  ) that is stable on (Σ,  ). – If (Σ 1,  1 ) is finer than (Σ 2,  2 ), then (Σ 1,  1 ) is stable on (Σ 1,  1 ), when for all P, Q  Σ 1 with P  1 Q and s  P and t  Q, s (Σ,  ) t. Running Split on each block in Σ, for (Σ,  ) is equal to ρ((Σ,  )). Theorem 1: If (Σ,  ) is the fixed point of ρ((Σ 0, Id)) then (Σ,  ) is the largest PA- I -simulation. 2 2

Polynomial run-time CanSim can be reduced to |Act I | applications of CanFollow. CanFollow can be reduced to a linear program (Lemma 9). The foreach in split is trivially polynomial. For the while in split, note that there are at most O(number of blocks) iterations. The same argument applies to the main GCPP algorithm.

Example (ctd.) State space reduction: Winning strategy: Block

Conclusion Probabilistic game structures are a relevant (interesting) class of structures. PA-I-simulations offer reductions that help answer (some) questions about a PGS. There is a polynomial algorithm that finds the strongest PA-I-simulation.

Questions?