CS 154, Lecture 3: DFANFA, Regular Expressions.

Slides:

Advertisements

Similar presentations

8/27/2009 Sofya Raskhodnikova Intro to Theory of Computation L ECTURE 2 Theory of Computation Finite Automata Operations on languages Nondeterminism L2.1.

Advertisements

January 7, 2015CS21 Lecture 21 CS21 Decidability and Tractability Lecture 2 January 7, 2015.

FORMAL LANGUAGES, AUTOMATA, AND COMPUTABILITY

1 Introduction to Computability Theory Lecture3: Regular Expressions Prof. Amos Israeli.

Finite Automata Great Theoretical Ideas In Computer Science Anupam Gupta Danny Sleator CS Fall 2010 Lecture 20Oct 28, 2010Carnegie Mellon University.

1 Introduction to Computability Theory Lecture4: Regular Expressions Prof. Amos Israeli.

1 Introduction to Computability Theory Lecture3: Regular Expressions Prof. Amos Israeli.

Lecture 3UofH - COSC Dr. Verma 1 COSC 3340: Introduction to Theory of Computation University of Houston Dr. Verma Lecture 3.

1 FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY (For next time: Read Chapter 1.3 of the book)

Costas Busch - RPI1 Single Final State for NFAs. Costas Busch - RPI2 Any NFA can be converted to an equivalent NFA with a single final state.

Courtesy Costas Busch - RPI1 Non Deterministic Automata.

CS 310 – Fall 2006 Pacific University CS310 Regular Expressions Sections:1.3 page 63 September 18, 2006 September 20, 2006.

Lecture 3 Goals: Formal definition of NFA, acceptance of a string by an NFA, computation tree associated with a string. Algorithm to convert an NFA to.

Fall 2004COMP 3351 Single Final State for NFA. Fall 2004COMP 3352 Any NFA can be converted to an equivalent NFA with a single final state.

1 Single Final State for NFAs and DFAs. 2 Observation Any Finite Automaton (NFA or DFA) can be converted to an equivalent NFA with a single final state.

CS5371 Theory of Computation Lecture 6: Automata Theory IV (Regular Expression = NFA = DFA)

Lecture 3 Goals: Formal definition of NFA, acceptance of a string by an NFA, computation tree associated with a string. Algorithm to convert an NFA to.

Fall 2006Costas Busch - RPI1 Non-Deterministic Finite Automata.

CS5371 Theory of Computation Lecture 4: Automata Theory II (DFA = NFA, Regular Language)

1 Regular Languages Finite Automata eg. Supermarket automatic door: exit or entrance.

Costas Busch - LSU1 Non-Deterministic Finite Automata.

FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY

1 A Single Final State for Finite Accepters. 2 Observation Any Finite Accepter (NFA or DFA) can be converted to an equivalent NFA with a single final.

INHERENT LIMITATIONS OF COMPUTER PROGRAMS CSci 4011.

Regular Languages A language is regular over  if it can be built from ;, {  }, and { a } for every a 2 , using operators union ( [ ), concatenation.

Theory of Computation, Feodor F. Dragan, Kent State University 1 Regular expressions: definition An algebraic equivalent to finite automata. We can build.

Introduction to CS Theory Lecture 3 – Regular Languages Piotr Faliszewski

CHAPTER 1 Regular Languages

January 9, 2015CS21 Lecture 31 CS21 Decidability and Tractability Lecture 3 January 9, 2015.

INHERENT LIMITATIONS OF COMPUTER PROGAMS CSci 4011.

CSCI 2670 Introduction to Theory of Computing September 13.

Lecture Notes 

Finite Automata Great Theoretical Ideas In Computer Science Victor Adamchik Danny Sleator CS Spring 2010 Lecture 20Mar 30, 2010Carnegie Mellon.

INHERENT LIMITATIONS OF COMPUTER PROGRAMS CSci 4011.

 2004 SDU Lecture4 Regular Expressions.  2004 SDU 2 Regular expressions A third way to view regular languages. Say that R is a regular expression if.

Regular Languages Chapter 1 Giorgi Japaridze Theory of Computability.

Complexity and Computability Theory I Lecture #5 Rina Zviel-Girshin Leah Epstein Winter

CSE 105 theory of computation

Lexical analysis Finite Automata

Non Deterministic Automata

Properties of Regular Languages

PROPERTIES OF REGULAR LANGUAGES

CSE 105 theory of computation

Regular Expressions Sections:1.3 page 63 September 17, 2008

FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY

Single Final State for NFA

Intro to Theory of Computation

Non-Deterministic Finite Automata

COSC 3340: Introduction to Theory of Computation

4. Properties of Regular Languages

Non-Deterministic Finite Automata

Non Deterministic Automata

CSE 2001: Introduction to Theory of Computation Fall 2009

4b Lexical analysis Finite Automata

CSCI 2670 Introduction to Theory of Computing

FORMAL LANGUAGES, AUTOMATA, AND COMPUTABILITY

CSCI 2670 Introduction to Theory of Computing

Great Theoretical Ideas in Computer Science

Closure Properties of Regular Languages

CS21 Decidability and Tractability

CS21 Decidability and Tractability

CSE 105 theory of computation

Chapter 1 Regular Language

CSCI 2670 Introduction to Theory of Computing

CHAPTER 1 Regular Languages

NFAs accept the Regular Languages

Intro to Theory of Computation

CSE 105 theory of computation

Presentation transcript:

CS 154, Lecture 3: DFANFA, Regular Expressions

Homework 1 is coming out …

Deterministic Finite Automata Computation with finite memory

Non-Deterministic Finite Automata Computation with finite memory and “verified guessing”

Set Q = 2Q From NFAs to DFAs Input: NFA N = (Q, Σ, , Q0, F) Output: DFA M = (Q, Σ, , q0, F) To learn if an NFA accepts, we could do the computation in parallel, maintaining the set of all possible states that can be reached Idea: Set Q = 2Q

From NFAs to DFAs: Subset Construction Input: NFA N = (Q, Σ, , Q0, F) Output: DFA M = (Q, Σ, , q0, F) Q = 2Q  : Q  Σ → Q * (R,) =  ε( (r,) ) rR q0 = ε(Q0) F = { R  Q | f  R for some f  F } * For S  Q, the ε-closure of S is ε(S) = {q | q reachable from some s  S by taking 0 or more ε transitions}

Example of the ε-closure ε({q0}) = {q0 , q1, q2} ε({q1}) = {q1, q2} ε({q2}) = {q2}

N Given: NFA N = ( {1,2,3}, {a,b},  , {1}, {1} ) Construct: Equivalent DFA M = (2{1,2,3}, {a,b}, , {1,3}, …) M N a a b 1 {1,3} {3}  a b b a b b a,b ε a , b {2} 2 3 a {2,3} b a a ε({1}) = {1,3} {1}, {1,2} ? {1,2,3}

Reverse Theorem for Regular Languages Theorem: The reverse of a regular language is also a regular language If a language can be recognized by a DFA that reads strings from right to left, then there is an “normal” DFA that accepts the same language Proof? Given a DFA for a language L, “reverse” its arrows and flip its start and accept states, getting an NFA. Convert that NFA back to a DFA!

Remember this on homework/exams! Using NFAs in place of DFAs can make proofs about regular languages much easier! Remember this on homework/exams!

Union Theorem using NFAs?

Regular Languages are closed under concatenation Concatenation: A  B = { vw | v  A and w  B } Given DFAs M1 and M2, connect the accept states of M1 to the start states of M2 ε 0,1 1 0,1 1 0,1 1 ε L(N) = L(M1)  L(M2)

Regular Languages are closed under star A* = { s1 … sk | k ≥ 0 and each si  A } Let M be a DFA, and let L = L(M) We can construct an NFA N that recognizes L* ε 0,1 1 ε ε

Formally, the construction is: Input: DFA M = (Q, Σ, , q1, F) Output: NFA N = (Q, Σ, , q0, F) Q = Q  {q0} F = F  {q0} (q,a) if q  Q and a ≠ ε {q1} if q  F and a = ε (q,a) = {q1} if q = q0 and a = ε  if q = q0 and a ≠ ε  else

Regular Languages are Closed Under Star How would we prove that this NFA construction works? Want to show: L(N) = L* L(N)  L* L(N)  L*

1. L(N)  L* Assume w = w1…wk is in L* where w1,…,wk  L We show N accepts w by induction on k Base Cases:  k = 0 (w = ε)  k = 1 (w  L) Inductive Step: Assume N accepts all strings v = v1…vk  L*, vi  L Let u = u1…ukuk+1  L* , uj L Since N accepts u1…uk (by induction) and M accepts uk+1, N also accepts u (by construction)

Assume w is accepted by N; we want to show w  L* 2. L(N)  L* Assume w is accepted by N; we want to show w  L* If w = ε, then w  L* ε I.H. N accepts u and takes at most k ε-transitions  u  L* uL(N), so By I.H. u uL* Let w be accepted by N with k+1 ε-transitions. Write w as w=uv, where v is the substring read after the last ε-transition ε v vL w = uv L* accept

Closure Properties for Regular Languages  Union: A  B = { w | w  A or w  B }  Intersection: A  B = { w | w  A and w  B }  Complement: A = { w  Σ* | w  A }  Reverse: AR = { w1 …wk | wk …w1  A, wi  Σ} Concatenation: A  B = { vw | v  A and w  B }  Star: A* = { s1 … sk | k ≥ 0 and each si  A }  Theorem: if A and B are regular then so are: A  B, A  B, A, AR , A  B, and A*

Computation as simple, logical description Regular Expressions Computation as simple, logical description A totally different way of thinking about computation: What is the complexity of describing the strings in the language?

Inductive Definition of Regexp Let Σ be an alphabet. We define the regular expressions over Σ inductively: For all  ∊ Σ,  is a regexp ε is a regexp  is a regexp If R1 and R2 are both regexps, then (R1R2), (R1 + R2), and (R1)* are regexps

* then  then + Precedence Order: Example: R1*R2 + R3 = ( ( R1* ) · R2

Definition: Regexps Represent Languages The regexp  ∊ Σ represents the language {} The regexp ε represents {ε} The regexp  represents  If R1 and R2 are regular expressions representing L1 and L2 then: (R1R2) represents L1  L2 (R1 + R2) represents L1  L2 (R1)* represents L1*

Regexps Represent Languages For every regexp R, define L(R) to be the language that R represents A string w ∊ Σ* is accepted by R (or, w matches R) if w ∊ L(R) Examples: 0, 010, and 01010 match (01)*0 110101110100100 matches (0+1)*0

{ w | w has exactly a single 1 } Assume Σ = {0,1} { w | w has exactly a single 1 } 0*10* { w | w contains 001 } (0+1)*001(0+1)*

the regexp * represent? Assume Σ = {0,1} What language does the regexp * represent? {ε}

{ w | w has length ≥ 3 and its 3rd symbol is 0 } Assume Σ = {0,1} { w | w has length ≥ 3 and its 3rd symbol is 0 } (0+1)(0+1)0(0+1)*

{ w | every odd position in w is a 1 } Assume Σ = {0,1} { w | every odd position in w is a 1 } (1(0 + 1))*(1 + ε)

1 + 0 + ε + 0(0+1)*0 + 1(0+1)*1 Assume Σ = {0,1} { w | w has equal number of occurrences of 01 and 10} = { w | w = 1, w = 0, or w = ε, or w starts with a 0 and ends with a 0, or w starts with a 1 and ends with a 1 } Claim: A string w has equal occurrences of 01 and 10  w starts and ends with the same bit. 1 + 0 + ε + 0(0+1)*0 + 1(0+1)*1

L can be represented by some regexp DFAs ≡ NFAs ≡ Regular Expressions! L can be represented by some regexp  L is regular

L can be represented by some regexp  L is regular

L can be represented by some regexp  L is regular Base Cases (R has length 1): Given any regexp R, we will construct an NFA N s.t. N accepts exactly the strings accepted by R Proof by induction on the length of the regexp R R =  R = ε R = 

Induction Step: Suppose every regexp of length < k represents some regular language. Consider a regexp R of length k > 1 Three possibilities for R: R = R1 + R2 R = R1 R2 R = (R1)*

Consider a regexp R of length k > 1 Induction Step: Suppose every regexp of length < k represents some regular language. Consider a regexp R of length k > 1 Three possibilities for R: R = R1 + R2 By induction, R1 and R2 represent some regular languages, L1 and L2 R = R1 R2 But L(R) = L(R1 + R2) = L1  L2 so L(R) is regular, by the union theorem! R = (R1)*

Consider a regexp R of length k > 1 Induction Step: Suppose every regexp of length < k represents some regular language. Consider a regexp R of length k > 1 Three possibilities for R: R = R1 + R2 By induction, R1 and R2 represent some regular languages, L1 and L2 R = R1 R2 But L(R) = L(R1·R2) = L1· L2 so L(R) is regular by the concatenation theorem R = (R1)*

Consider a regexp R of length k > 1 Induction Step: Suppose every regexp of length < k represents some regular language. Consider a regexp R of length k > 1 Three possibilities for R: R = R1 + R2 By induction, R1 and R2 represent some regular languages, L1 and L2 R = R1 R2 But L(R) = L(R1*) = L1* so L(R) is regular, by the star theorem R = (R1)*

Therefore: If L is represented by a regexp, Induction Step: Suppose every regexp of length < k represents some regular language. Consider a regexp R of length k > 1 Three possibilities for R: R = R1 + R2 By induction, R1 and R2 represent some regular languages, L1 and L2 R = R1 R2 But L(R) = L(R1*) = L1* so L(R) is regular, by the star theorem R = (R1)* Therefore: If L is represented by a regexp, then L is regular

Give an NFA that accepts the language represented by (1(0 + 1))* ε 1 0,1 ε Regular expression: ( )* 1 (0+1)

Generalized NFAs (GNFA) L can be represented by a regexp  L is a regular language  Idea: Transform an NFA for L into a regular expression by removing states and re-labeling the arcs with regular expressions Rather than reading in just letters from the string on a step, we can read in entire substrings

Generalized NFA (GNFA) Is aaabcbcba accepted or rejected? Is bba accepted or rejected? Is bcba accepted or rejected? This GNFA recognizes L(a*b(cb)*a)

Add unique start and accept states NFA Add unique start and accept states

NFA While the machine has more than 2 states: Pick an internal state, rip it out and re-label the arrows with regexps, to account for paths through the missing state 1 01*0

G NFA While the machine has more than 2 states: In general: q2 q3 q1 R(q1,q3) R(q1,q2)R(q2,q2)*R(q2,q3) + R(q1,q3) R(q1,q2) R(q2,q2) R(q2,q3) q1 q2 q3

R(q0,q3) = (a*b)(a+b)* represents L(N) ε q2 a,b ε (a*b)(a+b)* a*b q0 q3 R(q0,q3) = (a*b)(a+b)* represents L(N)

DFAs NFAs DEFINITION Regular Languages Regular Expressions

Parting thoughts: Regular Languages can be defined by their closure properties NFA=DFA, does it mean that non-determinism is free for Finite Automata? Questions?