Modeling Computation Rosen, ch. 12.

Slides:



Advertisements
Similar presentations
Natural Language Processing - Formal Language - (formal) Language (formal) Grammar.
Advertisements

C O N T E X T - F R E E LANGUAGES ( use a grammar to describe a language) 1.
Grammars, Languages and Parse Trees. Language Let V be an alphabet or vocabulary V* is set of all strings over V A language L is a subset of V*, i.e.,
COGN1001: Introduction to Cognitive Science Topics in Computer Science Formal Languages and Models of Computation Qiang HUO Department of Computer.
Chapter Chapter Summary Languages and Grammars Finite-State Machines with Output Finite-State Machines with No Output Language Recognition Turing.
CS5371 Theory of Computation
Foundations of (Theoretical) Computer Science Chapter 2 Lecture Notes (Section 2.1: Context-Free Grammars) David Martin With some.
Languages, grammars, and regular expressions
Normal forms for Context-Free Grammars
Chapter 3: Formal Translation Models
COP4020 Programming Languages
Syntactic Pattern Recognition Statistical PR:Find a feature vector x Train a system using a set of labeled patterns Classify unknown patterns Ignores relational.
Finite State Machines Data Structures and Algorithms for Information Processing 1.
Languages and Grammars MSU CSE 260. Outline Introduction: E xample Phrase-Structure Grammars: Terminology, Definition, Derivation, Language of a Grammar,
Language Translation Principles Part 1: Language Specification.
1 INFO 2950 Prof. Carla Gomes Module Modeling Computation: Languages and Grammars Rosen, Chapter 12.1.
Formal Grammars Denning, Sections 3.3 to 3.6. Formal Grammar, Defined A formal grammar G is a four-tuple G = (N,T,P,  ), where N is a finite nonempty.
Languages & Strings String Operations Language Definitions.
Lecture 16 Oct 18 Context-Free Languages (CFL) - basic definitions Examples.
Introduction Syntax: form of a sentence (is it valid) Semantics: meaning of a sentence Valid: the frog writes neatly Invalid: swims quickly mathematics.
111 Computability, etc. Recap. Present homework. Grammar and machine equivalences. Turing history Homework: Equivalence assignments. Presentation proposals,
::ICS 804:: Theory of Computation - Ibrahim Otieno SCI/ICT Building Rm. G15.
CS/IT 138 THEORY OF COMPUTATION Chapter 1 Introduction to the Theory of Computation.
A sentence (S) is composed of a noun phrase (NP) and a verb phrase (VP). A noun phrase may be composed of a determiner (D/DET) and a noun (N). A noun phrase.
Languages, Grammars, and Regular Expressions Chuck Cusack Based partly on Chapter 11 of “Discrete Mathematics and its Applications,” 5 th edition, by Kenneth.
Grammars CPSC 5135.
Languages & Grammars. Grammars  A set of rules which govern the structure of a language Fritz Fritz The dog The dog ate ate left left.
1 Computability Five lectures. Slides available from my web page There is some formality, but it is gentle,
Introduction to Language Theory
Copyright © Curt Hill Languages and Grammars This is not English Class. But there is a resemblance.
CMSC 330: Organization of Programming Languages Context-Free Grammars.
Parsing Introduction Syntactic Analysis I. Parsing Introduction 2 The Role of the Parser The Syntactic Analyzer, or Parser, is the heart of the front.
1 A well-parenthesized string is a string with the same number of (‘s as )’s which has the property that every prefix of the string has at least as many.
CS 208: Computing Theory Assoc. Prof. Dr. Brahim Hnich Faculty of Computer Sciences Izmir University of Economics.
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
September1999 CMSC 203 / 0201 Fall 2002 Week #14 – 25/27 November 2002 Prof. Marie desJardins clip art courtesy of
Programming Languages and Design Lecture 2 Syntax Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
Lecture 16: Modeling Computation Languages and Grammars Finite-State Machines with Output Finite-State Machines with No Output.
Formal Languages and Grammars
Discrete Structures ICS252 Chapter 5 Lecture 2. Languages and Grammars prepared By sabiha begum.
Mathematical Foundations of Computer Science Chapter 3: Regular Languages and Regular Grammars.
1 Course Overview Why this course “formal languages and automata theory?” What do computers really do? What are the practical benefits/application of formal.
Models of Computation by Dr. Michael P. Frank, University of Florida Modified and extended by Longin Jan Latecki, Temple University Rosen 7 th ed., Ch.
1 A well-parenthesized string is a string with the same number of (‘s as )’s which has the property that every prefix of the string has at least as many.
Formal grammars A formal grammar is a system for defining the syntax of a language by specifying sequences of symbols or sentences that are considered.
Week 14 - Friday.  What did we talk about last time?  Simplifying FSAs  Quotient automata.
Lecture 17: Theory of Automata:2014 Context Free Grammars.
Chapter 2. Formal Languages Dept. of Computer Engineering, Hansung University, Sung-Dong Kim.
Formal Languages and Automata FORMAL LANGUAGES FINITE STATE AUTOMATA.
Modeling Arithmetic, Computation, and Languages Mathematical Structures for Computer Science Chapter 8 Copyright © 2006 W.H. Freeman & Co.MSCS SlidesAlgebraic.
PROGRAMMING LANGUAGES
BCT 2083 DISCRETE STRUCTURE AND APPLICATIONS
Discrete Mathematics and its Applications Rosen 7th ed., Ch. 13.1
Context-Free Grammars: an overview
Lecture 1 Theory of Automata
Complexity and Computability Theory I
Natural Language Processing - Formal Language -
Discrete Mathematics and its Applications
Context-Free Languages
Discrete Mathematics and its Applications Rosen 6th ed., Ch. 12.1
Models of Computation by Dr. Michael P
Intro to Data Structures
CHAPTER 2 Context-Free Languages
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Discrete Mathematics and its Applications Rosen 7th ed., Ch. 13.1
Discrete Mathematics and its Applications Rosen 7th ed., Ch. 13.1
Models of Computation by Dr. Michael P
Discrete Mathematics and its Applications Rosen 8th ed., Ch. 13.1
Models of Computation by Dr. Michael P
Languages and Grammer In TCS
Presentation transcript:

Modeling Computation Rosen, ch. 12

Modeling Computation We learned earlier the concept of an algorithm. A description of a computational procedure. Now, how can we model the computer itself, and what it is doing when it carries out an algorithm? For this, we want to model the abstract process of computation itself.

Early Models of Computation Recursive Function Theory Kleene, Church, Turing, Post, 1930’s (before computers!!) Turing Machines – Turing, 1940’s (defined: computable) RAM Machines – von Neumann, 1940’s (“real computer”) Cellular Automata – von Neumann, 1950’s (Wolfram 2005; physics of our world?) Finite-state machines, pushdown automata various people, 1950’s VLSI models – 1970s Parallel RAMs, etc. – 1980’s

§12.1 – Languages & Grammars Phrase-Structure Grammars Types of Phrase-Structure Grammars Derivation Trees Backus-Naur Form

Computers as Transition Functions A computer (or really any physical system) can be modeled as having, at any given time, a specific state sS from some (finite or infinite) state space S. Also, at any time, the computer receives an input symbol iI and produces an output symbol oO. Where I and O are sets of symbols. Each “symbol” can encode an arbitrary amount of data. A computer can then be modeled as simply being a transition function T:S×I → S×O. Given the old state, and the input, this tells us what the computer’s new state and its output will be a moment later. Every model of computing we’ll discuss can be viewed as just being some special case of this general picture.

Language Recognition Problem Let a language L be any set of some arbitrary objects s which will be dubbed “sentences.” “legal” or “grammatically correct” sentences of the language. Let the language recognition problem for L be: Given a sentence s, is it a legal sentence of the language L? That is, is sL? Surprisingly, this simple problem is as general as our very notion of computation itself! Hmm… Ex: addition ‘language’ “num1-num2-(num1+num2)”

Vocabularies and Sentences Remember the concept of strings w of symbols s chosen from an alphabet Σ An alternative terminology for this concept: Sentences σ of words υ chosen from a vocabulary V. No essential difference in concept or notation! Empty sentence (or string): λ (length 0) Set of all sentences over V: Denoted V*.

Grammars A formal grammar G is any compact, precise mathematical definition of a language L. As opposed to just a raw listing of all of the language’s legal sentences, or just examples of them. A grammar implies an algorithm that would generate all legal sentences of the language. Often, it takes the form of a set of recursive definitions. A popular way to specify a grammar recursively is to specify it as a phrase-structure grammar.

Phrase-Structure Grammars A phrase-structure grammar (abbr. PSG) G = (V,T,S,P) is a 4-tuple, in which: V is a vocabulary (set of words) The “template vocabulary” of the language. T  V is a set of words called terminals Actual words of the language. Also, N :≡ V − T is a set of special “words” called nonterminals. (Representing concepts like “noun”) SN is a special nonterminal, the start symbol. P is a set of productions (to be defined). Rules for substituting one sentence fragment for another. A phrase-structure grammar is a special case of the more general concept of a string-rewriting system, due to Post.

Productions A production pP is a pair p=(b,a) of sentence fragments a, b (not necessarily in L), which may generally contain a mix of both terminals and nonterminals. We often denote the production as b → a. Read “replace b by a” Call b the “before” string, a goes the “after” string. It is a kind of recursive definition meaning that If lbr  LT, then lar  LT. (LT = sentence “templates”) That is, if lbr is a legal sentence template, then so is lar. That is, we can substitute a in place of b in any sentence template.

Languages from PSGs The recursive definition of the language L defined by the PSG: G = (V, T, S, P): Rule 1: S  LT (LT is L’s template language) The start symbol is a sentence template (member of LT). Rule 2: (b→a)P: l,rV*: lbr  LT → lar  LT Any production, after substituting in any fragment of any sentence template, yields another sentence template. Rule 3: (σ  LT: ¬nN: nσ) → σL All sentence templates that contain no nonterminal symbols are sentences in L. Abbreviate this using lbr  lar. (read, “lar is directly derivable from lbr”).

PSG Example – English Fragment We have G = (V, T, S, P), where: V = {(sentence), (noun phrase), (verb phrase), (article), (adjective), (noun), (verb), (adverb), a, the, large, hungry, rabbit, mathematician, eats, hops, quickly, wildly} T = {a, the, large, hungry, rabbit, mathematician, eats, hops, quickly, wildly} S = (sentence) P = (see next slide)

Productions for our Language P = { (sentence) → (noun phrase) (verb phrase), (noun phrase) → (article) (adjective) (noun), (noun phrase) → (article) (noun), (verb phrase) → (verb) (adverb), (verb phrase) → (verb), (article) → a, (article) → the, (adjective) → large, (adjective) → hungry, (noun) → rabbit, (noun) → mathematician, (verb) → eats, (verb) → hops, (adverb) → quickly, (adverb) → wildly }

Backus-Naur Form sentence ::= noun phrase verb phrase noun phrase ::= article [adjective] noun verb phrase ::= verb [adverb] article ::= a | the adjective ::= large | hungry noun ::= rabbit | mathematician verb ::= eats | hops adverb ::= quickly | wildly Square brackets [] mean “optional” Vertical bars mean “alternatives”

A Sample Sentence Derivation (sentence) (noun phrase) (verb phrase) (article) (adj.) (noun) (verb phrase) (art.) (adj.) (noun) (verb) (adverb) the (adj.) (noun) (verb) (adverb) the large (noun) (verb) (adverb) the large rabbit (verb) (adverb) the large rabbit hops (adverb) the large rabbit hops quickly On each step, we apply a production to a fragment of the previous sentence template to get a new sentence template. Finally, we end up with a sequence of terminals (real words), that is, a sentence of our language L.

Another Example V T Let G = ({a, b, A, B, S}, {a, b}, S, {S → ABa, A → BB, B → ab, AB → b}). One possible derivation in this grammar is: S  ABa  Aaba  BBaba  Bababa  abababa. P

Derivability Recall that the notation w0  w1 means that (b→a)P: l,rV*: w0 = lbr  w1 = lar. The template w1 is directly derivable from w0. If w2,…wn-1: w0  w1  w2  …  wn, then we write w0 * wn, and say that wn is derivable from w0. The sequence of steps wi  wi+1 is called a derivation of wn from w0. Note that the relation * is just the transitive closure of the relation .

A Simple Definition of L(G) The language L(G) (or just L) that is generated by a given phrase-structure grammar G=(V,T,S,P) can be defined by: L(G) = {w  T* | S * w} That is, L is simply the set of strings of terminals that are derivable from the start symbol.

Language Generated by a Grammar Example: Let G = ({S,A,a,b},{a,b}, S, {S → aA, S → b, A → aa}). What is L(G)? Easy: We can just draw a tree of all possible derivations. We have: S  aA  aaa. and S  b. Answer: L = {aaa, b}. S aA b Example of a derivation tree or parse tree or sentence diagram. aaa

Generating Infinite Languages A simple PSG can easily generate an infinite language. Example: S → 11S, S → 0 (T = {0,1}). The derivations are: S  0 S  11S  110 S  11S  1111S  11110 and so on… L = {(11)*0} – the set of all strings consisting of some number of concaten- ations of 11 with itself, followed by 0.

Another example Construct a PSG that generates the language L = {0n1n | nN}. 0 and 1 here represent symbols being concatenated n times, not integers being raised to the nth power. Solution strategy: Each step of the derivation should preserve the invariant that the number of 0’s = the number of 1’s in the template so far, and all 0’s come before all 1’s. Solution: S → 0S1, S → λ.

Types of Grammars - Chomsky hierarchy of languages Venn Diagram of Grammar Types: Type 0 – Phrase-structure Grammars Type 1 – Context-Sensitive Type 2 – Context-Free Type 3 – Regular

Defining the PSG Types Type 1: Context-Sensitive PSG: All after fragments are either longer than the corresponding before fragments, or empty: if b → a, then |b| < |a|  a = λ . Type 2: Context-Free PSG: All before fragments have length 1: if b → a, then |b| = 1 (b  N). Type 3: Regular PSGs: All after fragments are either single terminals, or a pair of a terminal followed by a nonterminal. if b → a, then a  T  a  TN.

Classifying grammars Given a grammar, we need to be able to find the smallest class in which it belongs. This can be determined by answering three questions: Are the left hand sides of all of the productions single non-terminals? If yes, does each of the productions create at most one non-terminal and is it on the right? Yes – regular No – context-free If not, can any of the rules reduce the length of a string of terminals and non-terminals? Yes – unrestricted No – context-sensitive

A regular grammar is one where each production takes one of the following forms: (where the capital letters are non-terminals and w is a non-empty string of terminals): S  , S  w, S  T, S  wT. Therefore, the grammar: S → 0S1, S → λ is not regular, it is context-free Only one nonterminal can appear on the right side and it must be at the right end of the right side. Therefore the productions A  aBc and S  TU are not part of a regular grammar, but the production A  abcA is.

Definition: Context-Free Grammars Variables Terminal symbols Start variable Productions of the form: Variable String of variables and terminals

Example  The language { anbncn | n  1} is context-sensitive but not context free. A grammar for this language is given by: S  aSBC | aBC CB  BC aB  ab bB  bb bC  bc cC  cc

A derivation from this grammar is:- S  aSBC  aaBCBC (using S  aBC)  aabCBC (using aB  ab)  aabBCC (using CB  BC)  aabbCC (using bB  bb)  aabbcC (using bC  bc)  aabbcc (using cC  cc)  which derives a2b2c2.