Presentation is loading. Please wait.

Presentation is loading. Please wait.

FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY 15-453.

Similar presentations


Presentation on theme: "FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY 15-453."— Presentation transcript:

1 FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY 15-453

2 CONTEXT-FREE GRAMMARS AND PUSH-DOWN AUTOMATA

3 Σ = {0, 1}, L = { 0 n 1 n | n ≥ 0 } Σ = {a, b, c, …, z}, L = { w | w = w R } Σ = { (, ) }, L = { balanced strings of parens } NONE OF THESE ARE REGULAR (), ()(), (()()) are in L, (, ()), ())(() are not in L

4 PUSHDOWN AUTOMATA (PDA) FINITE STATE CONTROL STACK (Last in, first out) INPUT

5 ε,ε → $ 0,ε → 0 1,0 → ε ε,$ → ε inputpoppush 0011 STACK$ 0011011 $0 11 $ 0 1 Non-deterministic

6 ε,ε → $ 0,ε → 0 1,0 → ε ε,$ → ε inputpoppush 001 STACK$$0$ 0 1 01001 PDA that recognizes L = { 0 n 1 n | n ≥ 0 }

7 Definition: A (non-deterministic) PDA is a tuple P = (Q, Σ, Γ, , q 0, F), where: Q is a finite set of states Γ is the stack alphabet q 0  Q is the start state F  Q is the set of accept states Σ is the input alphabet  : Q  Σ ε  Γ ε → 2 Q  Γ ε 2 Q is the set of subsets of Q and Σ ε = Σ  {ε}

8 Let w  Σ* and suppose w can be written as w 1... w n where w i  Σ ε (recall Σ ε = Σ  {ε}) Then P accepts w if there are r 0, r 1,..., r n  Q and s 0, s 1,..., s n  Γ* (sequence of stacks) such that 1.r 0 = q 0 and s 0 = ε ( P starts in q 0 with empty stack) 2.For i = 0,..., n-1: (r i+1, b)   (r i, w i+1, a), where s i =at and s i+1 = bt for some a, b  Γ ε and t  Γ* (P moves correctly according to state, stack and symbol read) 3. r n  F (P is in an accept state at the end of its input)

9 ε,ε → $ 0,ε → 0 1,0 → ε ε,$ → ε q0q0 q1q1 q2q2 q3q3 Q = {q 0, q 1, q 2, q 3 }Γ =Σ =  : Q  Σ ε  Γ ε → 2 Q  Γ ε {0,1}{$,0,1}  (q 1,1,0) = { (q 2,ε) }  (q 2,1,1) = 

10 EVEN-LENGTH PALINDROMES Σ = {a, b, c, …, z} ε,ε → $ ε,ε → εε,ε → ε , → ε, → ε ε,$ → ε q0q0 q1q1 q2q2 q3q3 ,ε → ,ε → 

11 Build a PDA to recognize L = { a i b j c k | i, j, k ≥ 0 and (i = j or i = k) } ε,ε → $ b,a → ε ε,$ → ε q0q0 q5q5 q1q1 q3q3 a,ε → a q2q2 q4q4 ε,ε → ε q6q6 ε,$ → ε b,ε → ε c,a → ε c,ε → ε choose i=j choose i=k

12 A → 0A1 A → B B → # CONTEXT-FREE GRAMMARS A variables terminals production rules start variable  0A1  (yields)  00A11  00B11  00#11 Non-deterministic A  * 00#11 (derives) Derivation We say: 00#11 is generated by the Grammar

13 CONTEXT-FREE GRAMMARS A → 0A1 A → B B → # A → 0A1 | B B → #

14 CONTEXT-FREE GRAMMARS A context-free grammar (CFG) is a tuple G = (V, Σ, R, S), where: V is a finite set of variables R is set of production rules of the form A → W, where A  V and W  (V  Σ)* S  V is the start variable Σ is a finite set of terminals (disjoint from V) L(G) = {w  Σ* | S  * w} Strings Generated by G

15 CONTEXT-FREE LANGUAGES A context-free grammar (CFG) is a tuple G = (V, Σ, R, S), where: V is a finite set of variables R is set of production rules of the form A → W, where A  V and W  (V  Σ)* S  V is the start variable Σ is a finite set of terminals (disjoint from V) G = { {S}, {0,1}, R, S }R = { S → 0S1, S → ε } L(G) =

16 CONTEXT-FREE LANGUAGES A context-free grammar (CFG) is a tuple G = (V, Σ, R, S), where: V is a finite set of variables R is set of production rules of the form A → W, where A  V and W  (V  Σ)* S  V is the start variable Σ is a finite set of terminals (disjoint from V) G = { {S}, {0,1}, R, S }R = { S → 0S1, S → ε } L(G) = { 0 n 1 n | n ≥ 0 } Strings Generated by G

17 WRITE A CFG FOR EVEN-LENGTH PALINDROMES S →  S  for all   Σ S → ε

18 WRITE A CFG FOR THE EMPTY SET G = { {S}, Σ, , S }

19 PARSE TREES A  0A1  00A11  00B11  00#11 A B 01# A A 01

20 → + → x → ( ) → a Build a parse tree for a + a x a ax+a a a+xa a

21 Definition: a string is derived ambiguously in a context-free grammar if it has more than one parse tree Definition: a grammar is ambiguous if it generates some string ambiguously See G 4 for unambiguous standard arithmetic precedence

22 Σ = {0, 1}, L = { 0 n 1 n | n ≥ 0 } WHAT ABOUT? But L is CONTEXT FREE NOT REGULAR Σ = {0, 1}, L 1 = { 0 n 1 n 0 m | m,n ≥ 0 } Σ = {0, 1}, L 2 = { 0 n 1 m 0 n | m,n ≥ 0 } Σ = {0, 1}, L 3 = { 0 m 1 n 0 n | m=n ≥ 0 } A → 0A1 A → ε

23 THE PUMPING LEMMA FOR CFGs Let L be a context-free language Then there is a P such that if w  L and |w| ≥ P 1. |vy| > 0 then can write w = uvxyz, where: 3. For every i ≥ 0, uv i xy i z  L 2. |vxy| ≤ P

24 Idea of Proof: If w is long enough, then any parse tree for w must have a path that contains a variable more than once T R R uvxzy T uz R R v y R R vxy

25 If the height of a parse tree is h, the length of the string generated by that tree is at most: Formal Proof: Let b be the maximum number of symbols on the right-hand side of any rule bhbh Let |V| be the number of variables in G Define P = b |V|+2 Let w be a string of length at least P Let T be a parse tree for w with a minimum number of nodes. T must have height at least |V|+2

26 Let T be a parse tree for w with a minimum number of nodes. T must have height at least |V|+2 The longest path in T must have ≥ |V|+1 variables Select R to be the variable that repeats among the lowest |V|+1 variables (in the path) T R R uvxzy T uz R R v y R R vxy 1. |vy| > 0 2. |vxy| ≤ P

27 WHAT ABOUT? Σ = {0, 1}, L 1 = { 0 n 1 n 0 m | m, n ≥ 0 } Σ = {0, 1}, L 2 = { 0 n 1 m 0 n | m, n ≥ 0 } Σ = {0, 1}, L 3 = { 0 m 1 n 0 n | m=n ≥ 0 }

28 THE PUMPING LEMMA Let L be a context-free language Then there exists P such that if w  L and |w| ≥ P 1. |vy| > 0 then w = uvxyz, where: 3. uv i xy i z  L for any i ≥ 0 2. |vxy| ≤ P

29 Idea: If w is long enough, then any parse tree for w must have a path that contains a variable more than once T R R uvxzy T uz R R v y R R vxy

30 If the height of a parse tree is h, the length of the string generated is at most: Formal Proof: Let b be the maximum number of symbols on the right-hand side of a rule bhbh Let |V| be the number of variables in G Define P = b |v|+1 Let w be a string of length at least P (= b |v|+1 ) Let T be the parse tree for w with the smallest number of nodes. T must have height at least |V|+1

31

32

33 A Language L is generated by a CFG  L is recognized by a PDA EQUIVALENCE OF CFGs and PDAs

34 Read the rest of Chapter 2 for next time


Download ppt "FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY 15-453."

Similar presentations


Ads by Google