Download presentation
Presentation is loading. Please wait.
Published byMarilyn Norman Modified over 9 years ago
1
Compilation in More Detail Asst. Prof. Dr. Ahmet Sayar Spring-2012 Kocaeli University Computer Engineering Department Principles of Programming Languages
2
Phases of Compilation
3
3 The Structure of a Compiler 1.Lexical Analysis 2.Parsing 3.Semantic Analysis 4.Symbol Table 5.Optimization 6.Code Generation The first 3, at least, can be understood by analogy to how humans comprehend English.
4
4 1. Lexical Analysis First step: recognize words. – Smallest unit above letters This is a sentence. Note the – Capital “T” (start of sentence symbol) – Blank “ “ (word separator) – Period “.” (end of sentence symbol)
5
5 More Lexical Analysis Lexical analysis is not trivial. Consider: ist his ase nte nce Plus, programming languages are typically more cryptic than English: *p->f += -.12345e-5
6
6 And More Lexical Analysis Lexical analyzer divides program text into “words” or “tokens” if x == y then z = 1; else z = 2; Units: if, x, ==, y, then, z, =, 1, ;, else, z, =, 2, ;
7
7 2. Parsing Once words are understood, the next step is to understand sentence structure Parsing = Diagramming Sentences – The diagram is a tree
8
Diagramming a Sentence 8 Thislineisalongersentence verbarticlenounarticleadjectivenoun subjectobject sentence
9
9 Parsing Programs Parsing program expressions is the same Consider: If x == y then z = 1; else z = 2; Diagrammed: if-then-else xyz1z2== assignrelationassign predicateelse-stmtthen-stmt
10
10 3. Semantic Analysis Once sentence structure is understood, we can try to understand “meaning” – But meaning is too hard for compilers Compilers perform limited analysis to catch inconsistencies Some do more analysis to improve the performance of the program Parse tree Semantic Analyzer Intermediate Program
11
11 Semantic Analysis in English Example: Jack said Jerry left his assignment at home. What does “his” refer to? Jack or Jerry? Even worse: Jack said Jack left his assignment at home? How many Jacks are there? Which one left the assignment?
12
Programming languages define strict rules to avoid such ambiguities This C++ code prints “4”; the inner definition is used Semantic Analysis in Programming 12 { int Jack = 3; { int Jack = 4; cout << Jack; }
13
13 More Semantic Analysis Compilers perform many semantic checks besides variable bindings Example: Jack left her homework at home. A “type mismatch” between her and Jack; we know they are different people – Presumably Jack is male
14
After the lexical analyzing and parsing, symbol table is created. Lexical anayzer gives symbol table as an output Below table shows tokens for a pascal statemet – toplam:=değer+10; 4. Symbol Table
15
15 5. Optimization No strong counterpart in English, but akin to editing Automatically modify programs so that they – Run faster – Use less memory – In general, conserve some resource
16
Result = function1(a+b) + function2(a+b) How do you optimize this code? Machine independent optimization
17
Code motion – Invariant expressions should be executed only once – E.g. for (int i = 0; i < x.length; i++) x[i] *= Math.PI * Math.cos(y); double picosy = Math.PI * Math.cos(y); for (int i = 0; i < x.length; i++) x[i] *= picosy; Machine independent optimization
18
Multiplying a number with the power of 2 – Shift to the left Dividing a number with the power of 2 – Shift to the right Machine dependent optimization
19
19 Issues Compiling is almost this simple, but there are many pitfalls. Example: How are erroneous programs handled? Language design has big impact on compiler – Determines what is easy and hard to compile – Course theme: many trade-offs in language design
20
20 Compilers Today The overall structure of almost every compiler adheres to this outline The proportions have changed since FORTRAN – Early: lexing, parsing most complex, expensive – Today: optimization dominates all other phases, lexing and parsing are cheap
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.