RegExp = (DFA,NFA,NFAe) Sipser pages 66-76.

Slides:



Advertisements
Similar presentations
4b Lexical analysis Finite Automata
Advertisements

Regular Expressions and DFAs COP 3402 (Summer 2014)
1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 2 Mälardalen University 2005.
1 1 CDT314 FABER Formal Languages, Automata and Models of Computation Lecture 3 School of Innovation, Design and Engineering Mälardalen University 2012.
FORMAL LANGUAGES, AUTOMATA, AND COMPUTABILITY
1 Introduction to Computability Theory Lecture3: Regular Expressions Prof. Amos Israeli.
1 Introduction to Computability Theory Lecture4: Regular Expressions Prof. Amos Israeli.
1 Introduction to Computability Theory Lecture3: Regular Expressions Prof. Amos Israeli.
1 FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY (For next time: Read Chapter 1.3 of the book)
Courtesy Costas Busch - RPI1 Non Deterministic Automata.
1 Regular Expressions. 2 Regular expressions describe regular languages Example: describes the language.
Lecture 3: Closure Properties & Regular Expressions Jim Hook Tim Sheard Portland State University.
CSC 3130: Automata theory and formal languages Andrej Bogdanov The Chinese University of Hong Kong Regular.
1 Single Final State for NFAs and DFAs. 2 Observation Any Finite Automaton (NFA or DFA) can be converted to an equivalent NFA with a single final state.
CS5371 Theory of Computation Lecture 6: Automata Theory IV (Regular Expression = NFA = DFA)
79 Regular Expression Regular expressions over an alphabet  are defined recursively as follows. (1) Ø, which denotes the empty set, is a regular expression.
Fall 2006Costas Busch - RPI1 Non-Deterministic Finite Automata.
1 Non-Deterministic Automata Regular Expressions.
Introduction to Finite Automata Adapted from the slides of Stanford CS154.
Fall 2004COMP 3351 Another NFA Example. Fall 2004COMP 3352 Language accepted (redundant state)
Costas Busch - LSU1 Non-Deterministic Finite Automata.
1 A Single Final State for Finite Accepters. 2 Observation Any Finite Accepter (NFA or DFA) can be converted to an equivalent NFA with a single final.
Regular Expressions. Notation to specify a language –Declarative –Sort of like a programming language. Fundamental in some languages like perl and applications.
Theory of Computation, Feodor F. Dragan, Kent State University 1 Regular expressions: definition An algebraic equivalent to finite automata. We can build.
1 Regular Expressions. 2 Regular expressions describe regular languages Example: describes the language.
Prof. Busch - LSU1 NFAs accept the Regular Languages.
1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 3 Mälardalen University 2010.
Regular Expressions and Languages A regular expression is a notation to represent languages, i.e. a set of strings, where the set is either finite or contains.
CHAPTER 1 Regular Languages
Lecture # 12. Nondeterministic Finite Automaton (NFA) Definition: An NFA is a TG with a unique start state and a property of having single letter as label.
Chapter 3 Regular Expressions, Nondeterminism, and Kleene’s Theorem Copyright © 2011 The McGraw-Hill Companies, Inc. Permission required for reproduction.
Regular Language Algorithms. Algorithms We have seen many algorithms for manipulating computational systems for the regular languages. Each treats these.
CFL Big Picture. Context Free Languages Conclusion We have studied the class of context free languages (CFL) We saw two different ways to express a CFL.
1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 3 Mälardalen University 2007.
Regular Expressions Section 1.3 (also 1.1, 1.2) CSC 4170 Theory of Computation.
Chapter 5 Finite Automata Finite State Automata n Capable of recognizing numerous symbol patterns, the class of regular languages n Suitable for.
1 Language Recognition (11.4) Longin Jan Latecki Temple University Based on slides by Costas Busch from the courseCostas Busch
CSCI 4325 / 6339 Theory of Computation Zhixiang Chen.
 2004 SDU Lecture4 Regular Expressions.  2004 SDU 2 Regular expressions A third way to view regular languages. Say that R is a regular expression if.
Regular Languages Chapter 1 Giorgi Japaridze Theory of Computability.
Complexity and Computability Theory I Lecture #5 Rina Zviel-Girshin Leah Epstein Winter
1 CD5560 FABER Formal Languages, Automata and Models of Computation Lecture 3 Mälardalen University 2006.
What do we know? DFA = NFA =  -NFA We have seen algorithms to transform DFA to NFA (trival) NFA to  NFA (trivial) NFA to DFA (subset construction)
Nondeterminism Subset Construction ε -Transitions.
Kleene’s Theorem and NFA
Lexical analysis Finite Automata
Non Deterministic Automata
Regular Expressions.
Single Final State for NFA
Chapter 2 FINITE AUTOMATA.
Language Recognition (12.4)
REGULAR LANGUAGES AND REGULAR GRAMMARS
Non-Deterministic Finite Automata
CS 154, Lecture 3: DFANFA, Regular Expressions.
4. Properties of Regular Languages
Non-Deterministic Finite Automata
Minimal DFA Among the many DFAs accepting the same regular language L, there is exactly one (up to renaming of states) which has the smallest possible.
Non Deterministic Automata
Introduction to Finite Automata
Regular Expressions.
4b Lexical analysis Finite Automata
CSCI 2670 Introduction to Theory of Computing
4b Lexical analysis Finite Automata
Language Recognition (12.4)
Regular Language Equivalence and DFA Minimization
NFA’s With ε-Transitions
Accepting Strings.
Lecture 5 Scanning.
CFL Big Picture.
Presentation transcript:

RegExp = (DFA,NFA,NFAe) Sipser pages 66-76

What do we know? DFA = NFA = ε-NFA We have seen algorithms to transform DFA to NFA (trival) NFA to ε-NFA (trivial) NFA to DFA (subset construction) Ε-NFA to NFA (ε-removal) The last piece of the puzzle is to show that regular expressions are equivalent to automata.

Outline We have two results to prove:   Theorem 1. For every regular expression E, there exists an ε-NFA A such that L(E)=L(A). Theorem 2. For every DFA A, there exists a regular expression E such that L(A)=L(E).

Process Since we've already seen the ''equivalence'' of various forms of finite automata, the picture is about to become complete: the same class of languages (REGULAR LANGUAGES, of course) is defined by Any of the 3 types of automata (DFA, NFA, ε-NFA) Regular expressions Proofs of both theorems are constructive; we'll learn algorithms to construct an ε-NFA from a regular expression, and a regular expression from a DFA.

Four Algorithms RegExp -> e-NFA via delta-extension Sipser pages 67-69 RegExp -> DFA via GenNFA DFA -> RegExp via state elimnation Sipser pages 69-76 DFA -> RegExp via GenNFA

Alg 1 - RE to e-NFA via delta-extension Sipser pages 67-69 Decompose a RegExp into its parts Small parts make simple DFAs Combine smaller parts by merging the delta functions of both parts, and by extending the merger (if necessary) with new transitions, and or new states. data RegExp a = Lambda | Empty | One a | Union (RegExp a) (RegExp a) | Cat (RegExp a) (RegExp a) | Star (RegExp a)

RE as Trees Every RE has a tree like structure. We process the tree bottom-up, strating from the leaves. "(ab+aab)*"

Small (leaf) Cases Suppose E is a given regular expression. The construction of the automaton A such that L(RE)=L(A) is defined by induction on the structure of RE.   Base Case: There are three base cases: E=ε, E=Æ, and E=a, where aÎS. Here are the corresponding automata. a E=ε E=Æ E=a

Induction Step There are three cases to consider Case 1: E=FG. Suppose (Ind. Hyp.) we have automata B and C such that L(F)=L(B) and L(G)=L(C). Let A be the automaton obtained by taking states and transitions of B and C together, Then add L-transitions from all final states of B to the start state of C (delta-extension). Then declare the start state (of the new automata) to be the start state of B. The final states (of the new automata) to be the final states of C.

B C BC ε We need to check that L(A)=L(B)L(C); it will follow then that L(A)=L(E)

Case 2: E=F+G Again, take F,G such that L(F)=L(B) and L(G)=L(C). Get A by adding a new start state and e-transitions from the new state to the start states of B and C. Check that L(A)=L(B) È L(C)! ε A B C

Case 3: E = F* Take B such that L(B)=L(F)and define A as in the picture. Check that L(A)=L(B)*.   B ε B

Exercise Exercise. Construct an ε-NFA accepting the language of the regular expression (ab+aab)*  

Definition - Generalized NFA Generalize DFA so that every transition is a regular expression rather than a letter of the alphabet. An GenNFA is a quintuple A=(Q,S,s,F,d), where the first four components are as in a DFA, and the transition function labels arcs between states with regular expressions. d: Q  Q  ¾® RegExp(S)

Alg 2 - RegExp to DFA via GenNFA (not covered in Sipser) Given a RegExp: ab+ac+ad+ae+af Construct a simple GenNFA with two states with one arc between the two states labeled with the regular expression. The source of the arc is the start state The target of the arc is the final state

Extend the GenNFA

(a+b)*cd*

(a+L+)c*

Alg 3 - From DFA to RE by state elimination (Sipser pages 69-76) The last construction was in fact algorithmic in both directions. We can improve the algorithm as follows. With an input NFA A, it will produce a regular expression E such that L(E)=L(A). First we standardize the automaton A so that: there is only one final state no arcs go out of the final state no arcs come into the initial state Do this by using ε-transitions ε ε ε ε

Assume the initial and final states are n-1 and n Assume the initial and final states are n-1 and n. We eliminate the states 1,2,…,n-2 one by one, producing intermediate automata whose arcs are labeled by regular expressions (i.e GenNFA). We end up with an automaton that has two states n-1 and n, and only one arc (from n-1 to n) whose label is the required regular expression E.

Algorithm For k = 1 to n-2 (* eliminate each node, k in turn *) For i = k+1 to n-1 (* i flows into k *) For j = k+1 to n (* k flows into j *) case paths i to k to j of Each case removes k, and adds a new label from i to j. i k j F E i j EF i k j E F G i j EG*F i k j E F H i j EF + H i k j E F G H i j EG*F + H

Example e 2 4 3 1 5 a b a b there is only one final state no arcs go out of the final state no arcs come into the initial state Renumber so final = n and start = n-1

Eliminate k=1 i k j i j i k j i j 2-1-2 2-(aa)-2 2-1-3 2-(b+ab)-3 5 4 e 2-1-2 2-(aa)-2 2-1-3 2-(b+ab)-3 2-1-4 Æ 2-1-5 Æ 3-1-2 3-(ba+a)-2 3-1-3 3-(bb)-3 3-1-4 Æ 3-1-5 Æ 4-1-2 4-(a)-2 4-1-3 4-(b)-3 4-1-4 Æ 4-1-5 Æ i k j F E i j EF i k j E F H i j EF + H 3 2 ba+a b+ab 5 4 a e b aa bb

Eliminate k=2 i k j i j 3-2-3 3-( (ba+a)(aa)*(b+ab) +bb )-3 3-2-4 Æ 5 4 a e b aa bb 3-2-3 3-( (ba+a)(aa)*(b+ab) +bb )-3 3-2-4 Æ 3-2-5 3-( (ba+a)(aa)*e + e )-5 4-2-3 4-(a(aa)*(b+ab) + b)-3 4-2-4 Æ 4-2-5 4-(a(aa)*e)-5 i k j E F G H i j EG*F + H 3 5 4 (ba+a)(aa)*e + e a(aa)*(b+ab) + b (ba+a)(aa)*(b+ab) + bb a(aa)*e

K=3 a(aa)*e a(aa)*(b+ab) + b (ba+a)(aa)*e + e (ba+a)(aa)*(b+ab) + bb 5 4 (ba+a)(aa)*e + e a(aa)*(b+ab) + b (ba+a)(aa)*(b+ab) + bb a(aa)*e 4-3-4 Æ 4-3-5 4-((a(aa)*(b+ab) + b) ((ba+a)(aa)*(b+ab) + bb)* ((ba+a)(aa)*e + e) + a(aa)*e)-5

Another example

Alg 4 - From DFA to RE via path labels (not covered in Sipser) Suppose A is a given DFA. Our goal is to find a regular expression E such that L(E)=L(A).   Assume the state set of A is {1,2,…,n}$. Let Lij be the language consisting of labels of all paths from i to j. Note that L(A) = ÈqÎ F Lsq s is the start state, F is the set of final states, Lsq is the labels of all paths from s to q We'll be done if we can find regular expressions Eij such that Lij =L(Eij).

Exercise: Compute LBC = {“0”, ”1”} LBD = _________________________ A B D 0,1 C 1 LBC = {“0”, ”1”} LBD = _________________________ LAB = _________________________ LAD = _________________________

DFA NFA GenNFA eNFA RegExp Lift delta fun Subset Construction data DFA q s = DFA { states :: [q], symbols :: [s], delta :: q -> s -> q, start :: q, final :: [q]} Lift delta fun DFA Subset Construction data NFA q s = NFA { states :: [q], symbols :: [s], delta :: q -> s -> [q], start :: q, final :: [q]} NFA data GNFA q s = GNFA { states :: [q], symbols :: [s], delta :: q -> q -> RegExp s, start :: q, final :: q } Delta fun lifting GenNFA e-removal data RegExp a = Epsilon | Empty | One a | Union (RegExp a) (RegExp a) | Cat (RegExp a) (RegExp a) | Star (RegExp a) State Elimination data NFAe q s = NFAe { states :: [q], symbols :: [s], delta :: q -> Maybe s -> [q], start :: q, final :: [q]} eNFA RegExp Via GenNFA by RegExp decompostion

Decidability of the Regular Languages Since we can produce an equivalent DFA for any regular expression or e-NFA, we will know how to recognize whether any two given representations of regular languages are equivalent. Transform both into into DFAs: X and Y Minimize X and Y to Xmin and Ymin Test if Xmin and Ymin are isomorphic. (problem 1.52, page 91 Sipser)