Concordia University Department of Computer Science and Software Engineering Click to edit Master title style COMPILER DESIGN Error recovery in top-down.

Slides:



Advertisements
Similar presentations
Error Handling A compiler should: detect errors locate errors recover from errors Errors may occur in any of the three main analysis phases of compiling:
Advertisements

Joey Paquet, 2000, 2002, 2008, Lecture 7 Bottom-Up Parsing II.
Chap. 5, Top-Down Parsing J. H. Wang Mar. 29, 2011.
Lecture # 7 Chapter 4: Syntax Analysis. What is the job of Syntax Analysis? Syntax Analysis is also called Parsing or Hierarchical Analysis. A Parser.
Concordia University Department of Computer Science and Software Engineering Click to edit Master title style COMPILER DESIGN Syntax-Directed Translation.
Parsing III (Eliminating left recursion, recursive descent parsing)
ISBN Chapter 4 Lexical and Syntax Analysis The Parsing Problem Recursive-Descent Parsing.
1 Predictive parsing Recall the main idea of top-down parsing: Start at the root, grow towards leaves Pick a production and try to match input May need.
Top-Down Parsing.
9/27/2006Prof. Hilfinger, Lecture 141 Parsing Odds and Ends Lecture 14 (P. N. Hilfinger, plus slides adapted from R. Bodik)
CSC3315 (Spring 2009)1 CSC 3315 Lexical and Syntax Analysis Hamid Harroud School of Science and Engineering, Akhawayn University
ERROR HANDLING Lecture on 27/08/2013 PPT: 11CS10037 SAHIL ARORA.
Joey Paquet, 2000, 2002, Lecture 3 Syntactic Analysis Part I.
Top-Down Parsing - recursive descent - predictive parsing
Chapter 5 Top-Down Parsing.
Concordia University Department of Computer Science and Software Engineering Click to edit Master title style COMPILER DESIGN Review Joey Paquet,
Lexical Analysis Hira Waseem Lecture
Profs. Necula CS 164 Lecture Top-Down Parsing ICOM 4036 Lecture 5.
Joey Paquet, Lecture 12 Review. Joey Paquet, Course Review Compiler architecture –Lexical analysis, syntactic analysis, semantic.
SYNTAX ANALYSIS & ERROR RECOVERY By: Sarthak Swaroop.
Review 1.Lexical Analysis 2.Syntax Analysis 3.Semantic Analysis 4.Code Generation 5.Code Optimization.
Lesson 5 CDT301 – Compiler Theory, Spring 2011 Teacher: Linus Källberg.
Parsing Lecture 5 Fri, Jan 28, Syntax Analysis The syntax of a language is described by a context-free grammar. Each grammar rule has the form A.
Joey Paquet, 2000, Lecture 5 Error Recovery Techniques in Top-Down Predictive Syntactic Analysis.
1 Problems with Top Down Parsing  Left Recursion in CFG May Cause Parser to Loop Forever.  Indeed:  In the production A  A  we write the program procedure.
Concordia University Department of Computer Science and Software Engineering Click to edit Master title style COMPILER DESIGN Syntactic analysis: Part.
COP4020 Programming Languages Parsing Prof. Xin Yuan.
Chapter 4 Top-Down Parsing Recursive-Descent Gang S. Liu College of Computer Science & Technology Harbin Engineering University.
Recursive Descent Parsers Lecture 6 Mon, Feb 2, 2004.
Joey Paquet, 2000, Lecture 2 Lexical Analysis.
Top-down Parsing lecture slides from C OMP 412 Rice University Houston, Texas, Fall 2001.
More Parsing CPSC 388 Ellen Walker Hiram College.
Top-down Parsing Recursive Descent & LL(1) Comp 412 Copyright 2010, Keith D. Cooper & Linda Torczon, all rights reserved. Students enrolled in Comp 412.
Top-Down Parsing CS 671 January 29, CS 671 – Spring Where Are We? Source code: if (b==0) a = “Hi”; Token Stream: if (b == 0) a = “Hi”; Abstract.
Overview of Previous Lesson(s) Over View  In our compiler model, the parser obtains a string of tokens from the lexical analyzer & verifies that the.
1 Nonrecursive Predictive Parsing  It is possible to build a nonrecursive predictive parser  This is done by maintaining an explicit stack.
Top-down Parsing. 2 Parsing Techniques Top-down parsers (LL(1), recursive descent) Start at the root of the parse tree and grow toward leaves Pick a production.
1 Compiler Construction (CS-636) Muhammad Bilal Bashir UIIT, Rawalpindi.
Top-Down Parsing.
Top-Down Predictive Parsing We will look at two different ways to implement a non- backtracking top-down parser called a predictive parser. A predictive.
CS 330 Programming Languages 09 / 25 / 2007 Instructor: Michael Eckmann.
COMP 3438 – Part II-Lecture 5 Syntax Analysis II Dr. Zili Shao Department of Computing The Hong Kong Polytechnic Univ.
1 Topic #4: Syntactic Analysis (Parsing) CSC 338 – Compiler Design and implementation Dr. Mohamed Ben Othman ( )
Chapter 2 (part) + Chapter 4: Syntax Analysis S. M. Farhad 1.
Joey Paquet, 2000, 2002, 2008, 2012, Lecture 5 Error Recovery Techniques in Top-Down Predictive Syntactic Analysis.
COMP 3438 – Part II-Lecture 6 Syntax Analysis III Dr. Zili Shao Department of Computing The Hong Kong Polytechnic Univ.
Compilers: Bottom-up/6 1 Compiler Structures Objective – –describe bottom-up (LR) parsing using shift- reduce and parse tables – –explain how LR.
Syntax Analysis Or Parsing. A.K.A. Syntax Analysis –Recognize sentences in a language. –Discover the structure of a document/program. –Construct (implicitly.
Syntax Analysis By Noor Dhia Syntax analysis:- Syntax analysis or parsing is the most important phase of a compiler. The syntax analyzer considers.
Error recovery in predictive parsing An error is detected during the predictive parsing when the terminal on top of the stack does not match the next input.
Semantic analysis Jakub Yaghob
Problems with Top Down Parsing
Introduction to Top Down Parser
Chapter 4 Syntax Analysis.
Syntax Analysis Chapter 4.
Compiler design Bottom-up parsing: Canonical LR and LALR
Syntax Analysis Sections :.
Top-Down Parsing CS 671 January 29, 2008.
CS 540 George Mason University
Top-Down Parsing The parse tree is created top to bottom.
Chapter 4 Top Down Parser.
Compiler design Error recovery in top-down predictive parsing
Computing Follow(A) : All Non-Terminals
Compiler design Syntactic analysis – part II First and follow sets
Nonrecursive Predictive Parsing
Compilers Principles, Techniques, & Tools Taught by Jing Zhang
Predictive Parsing Program
Syntactic Analysis Part II
Compiler design Bottom-up parsing: Canonical LR and LALR
Parsing CSCI 432 Computer Science Theory
Presentation transcript:

Concordia University Department of Computer Science and Software Engineering Click to edit Master title style COMPILER DESIGN Error recovery in top-down predictive parsing Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering A syntax error happens when the stream of tokens coming from the lexical analyzer does not comply with the grammatical rules defining the programming language. The next token in input is not expected according to the syntactic definition of the language. One of the main roles of a compiler is to identify all programming errors and give meaningful indications about the location and nature of errors in the input program. Syntax error recovery Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Detect all compile-time errors Report the presence of errors clearly and accurately Recover from each error quickly enough to be able to detect subsequent errors Should not slow down the processing of correct programs Beware of spurious errors that are just a consequence of earlier errors Goals of error recovery Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Give the position of the error in the source file, maybe print the offending line and point at the error location. If the nature of the error is easily identifiable, give a meaningful error message. The compiler should not provide erroneous information about the nature of errors. If the error is easily correctable, a solution can be proposed or applied by the compiler and a warning issued. Reporting errors Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Good error recovery highly depends on how quickly the error is detected. Often, an error will be detected only after the faulty token has passed. It will then be more difficult to achieve good error reporting, as well as good error recovery. Should recover from each error quickly enough to be able to detect subsequent errors. Error recovery should skip as less tokens as possible. Should not identify more errors than there really is. Cascades of errors should be avoided. Should give meaningful information about the errors, while avoiding to give erroneous information. Error recovery should induce processing overhead only when errors are encountered. Should not report errors that are consequences of the application of error recovery, e.g. semantic errors. Good error recovery Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering There are many different strategies that a parser can employ to recover from syntactic errors. Although some are better than others, none of these methods provide a universal solution. Panic mode, or don’t panic (Nicklaus Wirth) Phrase level correction Error productions Global correction Error recovery strategies Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Panic Mode On discovering an error, the parser discards input tokens until an element of a designated set of synchronizing tokens is found. Synchronizing tokens are typically delimiters such as semicolons or end of block delimiters. Skipping tokens often has a side-effect of skipping other errors. Choosing the right set of synchronizing tokens is of prime importance. Simplest method to implement. Can be integrated in most parsing methods. Cannot enter an infinite loop. Error Recovery Strategies Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Phrase Level Correction On discovering an error, the parser performs a local correction on the remaining input, e.g. replace a comma by a semicolon, delete an extraneous semicolon, insert a missing semicolon, etc. Replacements are done in specific contexts. There are myriads of different such contexts. Cannot cope with errors that occurred before the point of detection. Can enter an infinite loop, e.g. insertion of an expected token. Error Recovery Strategies Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Error Productions The grammar is augmented with “error” productions. For each possible error, an error production is added. An error is trapped when an error production is used. Assumes that all kinds of errors are known in advance. One error production is needed for each possible error. Error productions are specific to the rules in the grammar. A change in the grammar implies a change of the corresponding error productions. Extremely hard to maintain. Error Recovery Strategies Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Global Correction Ideally, a compiler should make as few changes as possible in processing an incorrect token stream. Global correction is about choosing the minimal sequence of changes to obtain a least-cost correction. Given an incorrect input token stream x, such an algorithm will find a parse tree for a related token stream y, such that the number of insertions, deletions, and changes of tokens required to transform x into y is as small as possible. Too costly to implement. The closest correct program does not carry the meaning intended by the programmer anyway. Can be used as a benchmark for other error correction techniques. Error Recovery Strategies Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Click to edit Master title style Different variations of “panic mode” error recovery Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Variation 1: Given a non-terminal A on top of the stack, skip input tokens until an element of FOLLOW(A) appears in the token stream. Pop A from the stack and resume parsing. Report on the error found and where the parsing was resumed. Variation 2: Given a non-terminal A on top of the stack, skip input tokens until an element of FIRST(A) appears in the token stream. Report on the error found and where the parsing was resumed. Variation 3 If we combine variation 1 and 2, when there is a parse error and a variable A on top of the stack, we skip input tokens until we see either a token in FIRST(A), in which case we simply continue, a token in FOLLOW(A), in which case we pop A off the stack and continue. Report on the error found and where the parsing was resumed. Panic mode error recovery: variations Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Click to edit Master title style Error Recovery in Recursive Descent Predictive Parsers Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Three possible cases: The lookahead symbol is not in FIRST(LHS). If  is in FIRST(LHS) and the lookahead symbol is not in FOLLOW(LHS). The match() function is called in a no match situation. Solution : Create a skipErrors() function that skips tokens until an element of FIRST(LHS) or FOLLOW(LHS) is encountered. Upon entering any parsing function, call skipErrors(). Error Recovery in Recursive Descent Predictive Parsers Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Error Recovery in Recursive Descent Predictive Parsers Joey Paquet, COMP 442/6421 – Compiler Design skipErrors([FIRST],[FOLLOW]) if ( lookahead is in [FIRST] or  is in [FIRST] and lookahead is in [FOLLOW] ) return true // no error detected, parse continues in this parsing function else write (“syntax error at “ lookahead.location) while (lookahead not in [FIRST  FOLLOW] ) lookahead = nextToken() if ( is not in [FIRST] and lookahead is in [FOLLOW]) return false// error detected and parsing function should be aborted return true // error detected and parse continues in this parsing function match(token) if ( lookahead == token ) lookahead = nextToken() return true else write (“syntax error at” lookahead.location. “expected” token) lookahead = nextToken() return false

Concordia University Department of Computer Science and Software Engineering Error Recovery in Recursive Descent Predictive Parsers Joey Paquet, COMP 442/6421 – Compiler Design LHS(){ // LHSRHS1 | RHS2 | … |  if ( !skipErrors( FIRST(LHS),FOLLOW(LHS) ) ) return false; if (lookahead  FIRST(RHS1) ) if (non-terminals()  match(terminals) ) write(“LHSRHS1”) else success = false else if (lookahead  FIRST(RHS2) ) if (non-terminals()  match(terminals) ) write(“LHSRHS2”) else success = false else if … // other right hand sides else if (lookahead  FOLLOW(LHS) ) // only if LHS exists write(“LHS”) else success = false return (success)

Concordia University Department of Computer Science and Software Engineering Example 17COMP 442/6421 – Compiler Design E(){ if ( !skipErrors([0,1,(],[),$]) ) return false; if (lookahead is in [0,1,(]) if (T();E'();) write(E->TE') else success = false return (success) } E'(){ if ( !skipErrors([+],[),$]) ) return false; if (lookahead is in [+]) if (match('+');T();E'()) write(E'->TE') else success = false else if (lookahead is in [$,)] write(E'->epsilon); else success = false return (success) } T(){ if ( !skipErrors([0,1,(],[+,),$]) ) return false; if (lookahead is in [0,1,(]) if (F();T'();) write(T->FT') else success = false return (success) } T'(){ if ( !skipErrors([*],[+,),$]) ) return false; if (lookahead is in [*]) if (match('*');F();T'()) write(T'->*FT') else success = false else if (lookahead is in [+,),$] write(T'->epsilon) else success = false return (success) } F(){ success = skipErrors([0,1,(],[*,+,$,)]) if (lookahead is in [0]) match('0') write(F->0) else if (lookahead is in [1]) match('1') write(F->1) else if (lookahead is in [(]) if (match('(');E();match(')')) write(F->1); else success = false return (success) } Joey Paquet,

Concordia University Department of Computer Science and Software Engineering Click to edit Master title style Error Recovery in Table-Driven Predictive Parsers Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering All empty cells in the table represent the occurrence of a syntax error Each case represents a specific kind of error Task when an empty (error) cell is read: Recover from the error Either pop the stack or scan tokens Output an error message Error Recovery in Table-Driven Predictive Parsers Joey Paquet, COMP 442/6421 – Compiler Design

Concordia University Department of Computer Science and Software Engineering Two possible cases: pop the stack if the next token is in the FOLLOW set of our current non-terminal on top of the stack. scan tokens until we get one with which we can resume the parse. Building the table with error cases Joey Paquet, COMP 442/6421 – Compiler Design skipError(){ // A is top() write (“syntax error at “ lookahead.location) if ( lookahead is $ or in FOLLOW( top() ) ) pop() // pop - equivalent to A   else while ( lookahead  FIRST( top() ) or   FIRST( top() ) and lookahead  FOLLOW( top() ) ) lookahead = nextToken() // scan }

Concordia University Department of Computer Science and Software Engineering Original table, grammar and sets Joey Paquet, COMP 442/6421 – Compiler Design FLW(E): { $,) } FLW(E’) : { $,) } FLW(T) : { +,$,) } FLW(T’) : { +,$,) } FLW(F) : { ,+,$,) } r1: E  TE r2: E  +TE r3: E   r4: T  FT r5: T  FT r6: T   r7: F  0 r8: F  1 r9: F  ( E ) FST(E): { 0,1,( } FST(E’) : { ,+ } FST(T) : { 0,1,( } FST(T’) : { ,* } FST(F) : { 0,1,( } 01()+*$ Er1 E’r3r2r3 Tr4 T’r6 r5r6 Fr7r8r9

Concordia University Department of Computer Science and Software Engineering Parsing table with error actions Joey Paquet, COMP 442/6421 – Compiler Design 01()+*$ Er1 popscan pop E’scan r3r2scanr3 Tr4 pop scanpop T’scan r6 r5r6 Fr7r8r9pop

Concordia University Department of Computer Science and Software Engineering Parsing algorithm Joey Paquet, COMP 442/6421 – Compiler Design parse(){ push($) push(S) a = nextToken() while ( stack  $ ) do x = top() if ( x  T ) if ( x == a ) pop(x) ; a = nextToken() else skipError() ; success = false else if ( TT[x,a]  ‘error’ ) pop(x) ; inverseRHSPush(TT[x,a]) else skipError() ; success = false if ( (a  $)  (success == false ) ) return(false) else return(true)}

Concordia University Department of Computer Science and Software Engineering Parsing example with error recovery Joey Paquet, COMP 442/6421 – Compiler Design StackInputProductionDerivation 1$E0(*1)$E 2$E0(*1)$ r1: E  TE’  TE’ 3$E’T0(*1)$ R4: T  FT’  FT’E’ 4$E’T’F0(*1)$ R7: F  0  0T’E’ 5$E’T’00(*1)$ 6$E’T’(*1)$error - scan 7$E’T’*1)$ r5: T   FT  0*FT’E’ 8$E’T’F**1)$ 9$E’T’F1)$ r8: F  1  0*1T’E’ 10$E’T’11)$ 11$E’T’)$ r6: T   0*1E’ 12$E’)$ r3: E   0*1 13$)$error - end