Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections 3.1 - 3.3.1.

Slides:



Advertisements
Similar presentations
Programming Languages Third Edition Chapter 6 Syntax.
Advertisements

C O N T E X T - F R E E LANGUAGES ( use a grammar to describe a language) 1.
Chapter Chapter Summary Languages and Grammars Finite-State Machines with Output Finite-State Machines with No Output Language Recognition Turing.
Chapter 3 Describing Syntax and Semantics Sections 1-3.
PZ02A - Language translation
PZ02B Programming Language design and Implementation -4th Edition Copyright©Prentice Hall, PZ02B - Regular grammars Programming Language Design.
Dr. Muhammed Al-Mulhem 1ICS ICS 535 Design and Implementation of Programming Languages Part 1 Fundamentals (Chapter 4) Compilers and Syntax.
COP4020 Programming Languages
(2.1) Grammars  Definitions  Grammars  Backus-Naur Form  Derivation – terminology – trees  Grammars and ambiguity  Simple example  Grammar hierarchies.
1 Syntax and Semantics The Purpose of Syntax Problem of Describing Syntax Formal Methods of Describing Syntax Derivations and Parse Trees Sebesta Chapter.
CS 2104 Prog. Lang. Concepts Dr. Abhik Roychoudhury School of Computing Introduction.
PZ02B Programming Language design and Implementation -4th Edition Copyright©Prentice Hall, PZ02B - Regular grammars Programming Language Design.
CS 326 Programming Languages, Concepts and Implementation Instructor: Mircea Nicolescu Lecture 2.
1 Chapter 3 Describing Syntax and Semantics. 3.1 Introduction Providing a concise yet understandable description of a programming language is difficult.
Syntax: 10/18/2015IT 3271 Semantics: Describe the structures of programs Describe the meaning of programs Programming Languages (formal languages) -- How.
A sentence (S) is composed of a noun phrase (NP) and a verb phrase (VP). A noun phrase may be composed of a determiner (D/DET) and a noun (N). A noun phrase.
CS Describing Syntax CS 3360 Spring 2012 Sec Adapted from Addison Wesley’s lecture notes (Copyright © 2004 Pearson Addison Wesley)
Grammars CPSC 5135.
ISBN Chapter 3 Describing Syntax and Semantics.
Copyright © by Curt Hill Grammar Types The Chomsky Hierarchy BNF and Derivation Trees.
CMSC 330: Organization of Programming Languages Context-Free Grammars.
Parsing Introduction Syntactic Analysis I. Parsing Introduction 2 The Role of the Parser The Syntactic Analyzer, or Parser, is the heart of the front.
CPS 506 Comparative Programming Languages Syntax Specification.
Chapter 3 Describing Syntax and Semantics
1 Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
ISBN Chapter 3 Describing Syntax and Semantics.
1Computer Sciences Department. Book: INTRODUCTION TO THE THEORY OF COMPUTATION, SECOND EDITION, by: MICHAEL SIPSER Reference 3Computer Sciences Department.
Syntax and Semantics Form and Meaning of Programming Languages Copyright © by Curt Hill.
CSC312 Automata Theory Lecture # 26 Chapter # 12 by Cohen Context Free Grammars.
Chapter 3 – Describing Syntax CSCE 343. Syntax vs. Semantics Syntax: The form or structure of the expressions, statements, and program units. Semantics:
1 Regular grammars Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Section
Chapter 3: Describing Syntax and Semantics
Chapter 3 – Describing Syntax
Describing Syntax and Semantics
PROGRAMMING LANGUAGES
lec02-parserCFG May 8, 2018 Syntax Analyzer
Describing Syntax and Semantics
CS 326 Programming Languages, Concepts and Implementation
CS510 Compiler Lecture 4.
Chapter 3 Context-Free Grammar and Parsing
Chapter 3 – Describing Syntax
Concepts of Programming Languages
Regular grammars Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Section
What does it mean? Notes from Robert Sebesta Programming Languages
Automata and Languages What do these have in common?
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Compiler Construction
Syntax versus Semantics
Compiler Lecture 1 CS510.
CS 363 Comparative Programming Languages
CSE 3302 Programming Languages
Compiler Design 4. Language Grammars
R.Rajkumar Asst.Professor CSE
September 13th Grammars.
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Chapter 3 Describing Syntax and Semantics.
Regular grammars Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Section
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
High-Level Programming Language
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Discrete Maths 13. Grammars Objectives
lec02-parserCFG May 27, 2019 Syntax Analyzer
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Programming Languages 2nd edition Tucker and Noonan
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections
Regular grammars Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Section
COMPILER CONSTRUCTION
PZ02B - Regular grammars Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Section PZ02B.
Faculty of Computer Science and Information System
Presentation transcript:

Language translation Programming Language Design and Implementation (4th Edition) by T. Pratt and M. Zelkowitz Prentice Hall, 2001 Sections 3.1 - 3.3.1 Language translation

Program structure Syntax What a program looks like BNF (John Bacus and Peter Naur) context free grammars) a useful notation for describing syntax FORTRAN은 문법이 formal하게 정의되지 않았다. Blank 무시 ::: Do10I=110  Do10I=1,10 Language translation

Program structure Semantics Execution behavior Static semantics - Semantics determined at compile time: var A: integer; Type and storage for A int B[10]; Type and storage for array B float MyProcC(float x;float y){...}; Function attributes Dynamic semantics - Semantics determined during execution: X = ``ABC'' SNOBOL4 example: X a string X = 1 + 2; X an integer :(X) X an address; Go to label X Language translation

Aspects of a program Declarations - Information for compiler var A: integer; typedef struct { int A; float B } C; Control - Changes to state of the machine if (A<B) { ... } while (C>D) { ... } Structure often defined by a Backus Naur Form (BNF) grammar (First used in description of Algol in 1958. Peter Naur was chair of Algol committee, and John Backus was secretary of committee, who wrote report.) We will see later - BNF turns out to be same as context free grammars developed by Noam Chomsky, a linguist) Language translation

Stages in translating a program Language translation

Major stages Lexical analysis (Scanner): Breaking a program into primitive components, called tokens (identifiers, numbers, keywords, ...) We will see that regular grammars and finite state automata are formal models of this. Syntactic analysis (Parsing): Creating a syntax tree of the program. We will see that context free grammars and pushdown automata are formal models of this. Symbol table: Storing information about declared objects (identifiers, procedure names, ...) Semantic analysis: Understanding the relationship among the tokens in the program. Optimization: Rewriting the syntax tree to create a more efficient program. Code generation: Converting the parsed program into an executable form. We will briefly look at scanning and parsing. A full treatment of compiling is beyond scope of this course. Language translation

Translation environments Language translation

BNF grammars Nonterminal: A finite set of symbols: <sentence> <subject> <predicate> <verb> <article> <noun> Terminal: A finite set of symbols: the, boy, girl, ran, ate, cake Start symbol: One of the nonterminals: <sentence> Rules (productions): A finite set of replacement rules: <sentence> ::= <subject> <predicate> <subject> ::= <article> <noun> <predicate>::= <verb> <article> <noun> <verb> ::= ran | ate <article> ::= the <noun> ::= boy | girl | cake Replacement Operator: Replace any nonterminal by a right hand side value using any rule (written ) Language translation

Example BNF sentences <sentence>  <subject> <predicate> First rule  <article> <noun> <predicate> Second rule  the <noun> <predicate> Fifth rule ...  the boy ate the cake Also from <sentence> you can derive  the cake ate the boy Syntax does not imply correct semantics Note: Rule <A> ::= <B><C> This BNF rule also written with equivalent syntax: A  BC Language translation

Languages Any string derived from the start symbol is a sentential form. Sentence: String of terminals derived from start symbol by repeated application of replacement operator A language generated by grammar G (written L(G)) is the set of all strings over the terminal alphabet (i.e., sentences) derived from start symbol. That is, a language is the set of sentential forms containing only terminal symbols. Language translation

Derivations A derivation is a sequence of sentential forms starting from start symbol. Derivation trees: Grammar: B  0B | 1B | 0 | 1 Derivation: B  0B  01B  010 From derivation get parse tree But derivations may not be unique S  SS | (S) | () S  SS (S)S (())S (())() S  SS  S() (S)() (())() Different derivations but get the same parse tree Language translation

Ambiguity But from some grammars you can get 2 different parse trees for the same string: ()()() Each corresponds to a unique derivation: S  SS  SSS ()SS ()()S ()()() A grammar is ambiguous if some sentence has 2 distinct parse trees. We desire unambiguous grammars to understand semantics. Language translation

Role of  How to characterize strings of length 0? – Semantically it makes sense to consider such strings. 1. In BNF, -productions: S  SS | (S) | () |  Can always delete them in grammar. For example: X  abYc Y   Delete -production and add production without : X  abc 2. In fsa -  moves means that in initial state, without input you can move to final state. Language translation

Syntax can be used to determine some semantics During Algol era, thought that BNF could be used for semantics of a program: What is the value of: 2 * 3 + 4 * 5? (a) 26 (b) 70 (c) 50 All are reasonable answers? Why? Language translation

Usual grammar for expressions E  E + T | T T  T * P | P P  i | ( E ) “Natural” value of expression is 26 Multiply 2 * 3 = 6 Multiply 4 * 5 = 20 Add 6 + 20 = 26 Language translation

But the “precedence” of operations is only a convention Grammar for 70 E  E * T | T T  T + P | P P  i | ( E ) Grammar for 50 E  E + T | E * T | T T  i | ( E ) All 3 grammars generate exactly the same language, but each has a different semantics (i.e., expression value) for most expressions. Draw parse tree of expression 2*3+4*5 for each grammar Language translation

Classes of grammars BNF: Backus-Naur Form - Context free - Type 2 - Already described Regular grammars: subclass of BNF - Type 3: BNF rules are restricted: A  t N | t where: N = nonterminal, t = terminal Examples: Binary numbers: B  0 B | 1 B | 0 | 1 Identifiers: I  a L | b L | c L |...| z L | a |...| y | z L  1 L | 2 L |...| 9 L | 0 L | 1 |...| 9 | 0 | a L | b L | c L |...| z L | a |...| y | z ab7d Language translation

Other classes of grammars The context free and regular grammars are important for programming language design. We study these in detail. Other classes have theoretical importance, but not in this course: Context sensitive grammar: Type 1 - Rules:    where |  |  |  | [That is, length of   length of , i.e., all sentential forms are length non-decreasing] Unrestricted, recursively enumerable: Type 0 - Rules:   . No restrictions on  and . Language translation