Code Shape IV Procedure Calls & Dispatch Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412.

Slides:



Advertisements
Similar presentations
Procedures in more detail. CMPE12cGabriel Hugh Elkaim 2 Why use procedures? –Code reuse –More readable code –Less code Microprocessors (and assembly languages)
Advertisements

Computer Architecture CSCE 350
The University of Adelaide, School of Computer Science
Prof. Necula CS 164 Lecture 141 Run-time Environments Lecture 8.
Subprogram Control - Data sharing Mechanisms to exchange data Arguments - data objects sent to a subprogram to be processed. Obtained through  parameters.
CSCI 330: Programming Language Concepts Instructor: Pranava K. Jha
(1) ICS 313: Programming Language Theory Chapter 10: Implementing Subprograms.
Code Shape, Part II Addressing Arrays, Aggregates, & Strings Comp 412 Copyright 2010, Keith D. Cooper & Linda Torczon, all rights reserved. Students enrolled.
Code Shape IV Procedure Calls & Dispatch Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412.
1 Storage Registers vs. memory Access to registers is much faster than access to memory Goal: store as much data as possible in registers Limitations/considerations:
Procedures in more detail. CMPE12cCyrus Bazeghi 2 Procedures Why use procedures? Reuse of code More readable Less code Microprocessors (and assembly languages)
1 Handling nested procedures Method 1 : static (access) links –Reference to the frame of the lexically enclosing procedure –Static chains of such links.
CS 536 Spring Run-time organization Lecture 19.
Code Shape II Expressions & Arrays Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved.
The Procedure Abstraction Part III: Allocating Storage & Establishing Addressability Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all.
3/17/2008Prof. Hilfinger CS 164 Lecture 231 Run-time organization Lecture 23.
ISBN Chapter 10 Implementing Subprograms.
CS 536 Spring Code generation I Lecture 20.
Honors Compilers Addressing of Local Variables Mar 19 th, 2002.
Chapter 10 Storage Management Implementation details beyond programmer’s control Storage/CPU time trade-off Binding times to storage.
Run-Time Storage Organization
The Procedure Abstraction Part I: Basics Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412.
Run time vs. Compile time
The environment of the computation Declarations introduce names that denote entities. At execution-time, entities are bound to values or to locations:
Run-time Environment and Program Organization
1 Run time vs. Compile time The compiler must generate code to handle issues that arise at run time Representation of various data types Procedure linkage.
The Procedure Abstraction Part II: Symbol Tables, Storage Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled.
Chapter 8 :: Subroutines and Control Abstraction
Chapter 7: Runtime Environment –Run time memory organization. We need to use memory to store: –code –static data (global variables) –dynamic data objects.
The Procedure Abstraction Part I: Basics Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412.
Runtime Environments What is in the memory? Runtime Environment2 Outline Memory organization during program execution Static runtime environments.
Runtime Environments Compiler Construction Chapter 7.
Programming Language Principles Lecture 24 Prepared by Manuel E. Bermúdez, Ph.D. Associate Professor University of Florida Subroutines.
Compiler Construction
CSc 453 Runtime Environments Saumya Debray The University of Arizona Tucson.
The Procedure Abstraction, Part VI: Inheritance in OOLs Comp 412 Copyright 2010, Keith D. Cooper & Linda Torczon, all rights reserved. Students enrolled.
The Procedure Abstraction, Part V: Support for OOLs Comp 412 Copyright 2010, Keith D. Cooper & Linda Torczon, all rights reserved. Students enrolled in.
Copyright © 2005 Elsevier Chapter 8 :: Subroutines and Control Abstraction Programming Language Pragmatics Michael L. Scott.
Activation Records CS 671 February 7, CS 671 – Spring The Compiler So Far Lexical analysis Detects inputs with illegal tokens Syntactic analysis.
1 Control Abstraction (Section ) CSCI 431 Programming Languages Fall 2003 A compilation of material developed by Felix Hernandez-Campos and Michael.
COP4020 Programming Languages Subroutines and Parameter Passing Prof. Xin Yuan.
Activation Records (in Tiger) CS 471 October 24, 2007.
1 CIS 461 Compiler Design and Construction Winter 2012 Lecture-Module #16 slides derived from Tevfik Bultan, Keith Cooper, and Linda Torczon.
Implementing Subprograms What actions must take place when subprograms are called and when they terminate? –calling a subprogram has several associated.
Run-Time Storage Organization Compiler Design Lecture (03/23/98) Computer Science Rensselaer Polytechnic.
RUN-Time Organization Compiler phase— Before writing a code generator, we must decide how to marshal the resources of the target machine (instructions,
The Procedure Abstraction, Part IV Addressability Comp 412 Copyright 2010, Keith D. Cooper & Linda Torczon, all rights reserved. Students enrolled in Comp.
國立台灣大學 資訊工程學系 薛智文 98 Spring Run Time Environments (textbook ch# 7.1–7.3 )
CSC 8505 Compiler Construction Runtime Environments.
RUNTIME ENVIRONMENT AND VARIABLE BINDINGS How to manage local variables.
Implementing Subprograms
Procedures and Functions Procedures and Functions – subprograms – are named fragments of program they can be called from numerous places  within a main.
The Procedure Abstraction Part III: Symbol Tables, Storage Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students.
The Procedure Abstraction, Part III Storage Layout & Addressability Comp 412 Copyright 2010, Keith D. Cooper & Linda Torczon, all rights reserved. Students.
LECTURE 3 Translation. PROCESS MEMORY There are four general areas of memory in a process. The text area contains the instructions for the application.
LECTURE 19 Subroutines and Parameter Passing. ABSTRACTION Recall: Abstraction is the process by which we can hide larger or more complex code fragments.
Code Generation Instruction Selection Higher level instruction -> Low level instruction Register Allocation Which register to assign to hold which items?
Lecture 3 Translation.
Storage Classes There are three places in memory where data may be placed: In Data section declared with .data in assembly language in C - Static) On the.
The Procedure Abstraction Part III: Symbol Tables, Storage
The Procedure Abstraction Part IV: Allocating Storage & Establishing Addressability Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights.
Chapter 9 :: Subroutines and Control Abstraction
Chap. 8 :: Subroutines and Control Abstraction
Chap. 8 :: Subroutines and Control Abstraction
The Procedure Abstraction Part I: Basics
Code Shape IV Procedure Calls & Dispatch
Runtime Environments What is in the memory?.
Where is all the knowledge we lost with information? T. S. Eliot
Procedure Linkages Standard procedure linkage Procedure has
Presentation transcript:

Code Shape IV Procedure Calls & Dispatch Copyright 2003, Keith D. Cooper, Ken Kennedy & Linda Torczon, all rights reserved. Students enrolled in Comp 412 at Rice University have explicit permission to make copies of these materials for their personal use.

Procedure Linkages Standard procedure linkage procedure p prolog epilog pre-call post-return procedure q prolog epilog Each procedure has a standard prolog (tasks to be performed after entry) standard epilog (tasks to be performed before exit) Furthermore, each call involves a pre-call sequence (such as storing arguments and the return address) post-return sequence (such as saving the return value) These are completely determined by the actual call (particularly, the number, type, and nature of the actual parameters (arguments)

Implementing Procedure Calls If p calls q … In the code for p, compiler emits pre-call sequence  Evaluates each parameter & stores it appropriately  Loads the return address from a label  (with access links) sets up q ‘s access link  Branches to the entry of q In the code for p, compiler emits post-return sequence  Copy return value into appropriate location  Free q ‘s AR (activation record), if needed  Resume p ‘s execution Invariant parts of pre-call sequence might be moved into the prolog

Implementing Procedure Calls If p calls q … In the prolog, q must  Set up its execution environment  (with display) update the display entry for its lexical level  Allocate space for its ( AR &) local variables & initialize them  If q calls other procedures, save the return address  Establish addressability for static data area(s) In the epilog, q must  Store return value (unless “return” statement already did so)  (with display) restore the display entry for its lexical level  Restore the return address (if saved )  Begin restoring p ’s environment  Load return address and branch to it

Implementing Procedure Calls If p calls q, one of them must Preserve register values ( caller-saves versus callee saves )  Caller-saves registers stored/restored by p in p ‘s AR  Callee-saves registers stored/restored by q in q ‘s AR Allocate the AR  Heap allocation  callee allocates its own AR  Stack allocation  caller & callee cooperate to allocate AR Space tradeoff Pre-call & post-return occur on every call Prolog & epilog occur once per procedure More calls than procedures  Moving operations into prolog/epilog saves space

Implementing Procedure Calls If p calls q, one of them must Preserve register values (caller-saves versus callee saves) If space is an issue Moving code to prolog & epilog saves space As register sets grow, save/restore code does, too  Each saved register costs 2 operations  Can use a library routine to save/restore  Pass it a mask to determine actions & pointer to space  Hardware support for save/restore or storeM/loadM Can decouple who saves from what is saved 32, 64, 128, 256

Implementing Procedure Calls If p calls q, one of them must Preserve register values (caller-saves versus callee saves) If space is an issue All saves in prolog, all restores in epilog  Caller provides a bit mask for caller-saves registers  Callee provides a bit mask for callee-saves registers  Store all of them in same AR ( either caller or callee )  Efficient use of time and code space  May waste some register save space in the AR Caller-save & callee-save assign responsibility not work

Examples VAX used callee saving of general purpose registers, and automatic save and restore of certain special purpose registers such as SP and FP. R0,R1 which were expected to be used to hold the function return value. SPARC uses overlapping register windows, but only 32 are visible at any one time. For each procedure, 8 registers are global, 8 hold inparams (& possibly temps), 8 hold locals, 8 hold outparams (& possibly temps). Any procedure wishing not to follow the standard plan must do caller saving. Sparc has a separate floating point processor w 32 FP registers, but it does not directly address memory nor participate directly in parameter passing and calls. MIPS also has 32 general purpose registers, but it also has special register pair for multiply and divide, and 32s.p./16d.p. floating point registers. Floating point instructions automatically reference these registers. Certain registers are expected to be unsaved temps (caller can save), fcn results, fcn args, and callee saved temps.

Implementing Procedure Calls Evaluating parameters Call by reference  evaluate parameter to an lvalue Call by value  evaluate parameter to an rvalue & store it Aggregates, arrays, & strings are usually c-b-r Language definition issues Alternative is copying them at each procedure call ( $$ )  Small structures can be passed in registers ( in & out )  Can pass large c-b-v objects c-b-r and copy on modification AIX does this for C

Implementing Procedure Calls Evaluating parameters Call by reference  evaluate parameter to an lvalue Call by value  evaluate parameter to an rvalue & store it Procedure-valued parameters Must pass starting address of procedure With access links, need the lexical level as well  Procedure value is a tuple  May also need shared data areas (file-level scopes )  In-file & out-of-file calls have (slightly ) different costs  This lets the caller set up the appropriate access link

Implementing Procedure Calls What about arrays as actual parameters? Whole arrays, as call-by-reference parameters Callee needs dimension information  build a dope vector Store the values in the calling sequence Pass the address of the dope vector in the parameter slot Generate complete address polynomial at each reference Some improvement is possible Save len i and low i rather than low i and high i Pre-compute the fixed terms in prologue sequence What about call-by-value? Most c-b-v languages pass arrays by reference This is a language design low 1 high 1 low 2 high 2

Implementing Procedure Calls What about A[12] as an actual parameter? If corresponding parameter is a scalar, it’s easy Pass the address or value, as needed Must know about both formal & actual parameter Language definition must force this interpretation What is corresponding parameter is an array? Must know about both formal & actual parameter Meaning must be well-defined and understood Cross-procedural checking of conformability  Again, we’re treading on language design issues Fortran 77 lets amazing things happen in this case…

An Aside That Doesn’t Fit Well Anywhere … What about code for access to variable-sized arrays? Local arrays dimensioned by actual parameters Same set of problems as parameter arrays Requires dope vectors ( or equivalent )  Place dope vector at fixed offset in activation record  Different access costs for textually similar references This presents lots of opportunities for a good optimizer Common subexpressions in the address polynomial Contents of dope vector are fixed during each activation Should be able to recover much of the lost ground  Handle them like parameter arrays

Implementing Procedure Calls What about a string-valued argument? Call by reference  pass a pointer to the start of the string  Works with either length/contents or null-terminated string Call by value  copy the string & pass it  Can store it in caller’s AR or callee’s AR  Callee’s AR works well with stack-allocated ARs  Can pass by reference & have callee copy it if necessary … Pointer functions as a “descriptor” for the string, stored in the appropriate location (register or slot in the AR)

Implementing Procedure Calls What about a structure-valued parameter? Again, pass a descriptor Call by reference  descriptor (pointer) refers to original Call by value  create copy & pass its descriptor  Can allocate it in either caller’s AR or callee’s AR  Callee’s AR works well with stack-allocated ARs  Can pass by reference & have callee copy it if necessary … If it is actually an array of structures, then use a dope vector If it is an element of an array of structures, then …

What About Calls in an OOL (Dispatch)? In an OOL, most calls are indirect calls Compiled code does not contain address of callee  Finds it by indirection through class’ method table  Required to make subclass calls find right methods  Code compiled in class C cannot know of subclass methods that override methods in C and C ‘s superclasses In the general case, need dynamic dispatch  Map method name to a search key  Perform a run-time search through hierarchy  Start with object’s class, search for 1 st occurrence of key  This can be expensive  Use a method cache to speed search  Cache holds How big? Bigger  more hits & longer search Smaller  fewer hits, faster search

What About Calls in an OOL (Dispatch)? Improvements are possible in special cases If class has no subclasses, can generate direct call  Class structure must be static or class must be FINAL If class structure is static  Can generate complete method table for each class  Single indirection through class pointer ( 1 or 2 operations )  Keeps overhead at a low level If class structure changes infrequently  Build complete method tables at run time  Initialization & any time class structure changes If running program can create new classes, …  Well, not all things can be done quickly

What About Calls in an OOL (Dispatch)? Unusual issues in OOL call Need to pass receiver’s object record as (1 st ) parameter  Becomes self or this Typical OOL has lexical scoping in method  Limited to block-style scoping  no need for access links  Can overlay successive blocks in same method ( reuse ) Method needs access to its class  Object record has static pointer to superclass, and so on …  Class pointers don’t need updating like access-links Method is a full-fledged procedure  It still needs an AR …  Can often stack allocate them ( HotSpot does … )

What About setjmp() and longjmp() ? Unix system calls to implement abnormal returns Setjmp() stores a descriptor for use with longjmp() Invoking longjump(d ) causes execution to continue at the point after the setjump() call that created d How can we implement setjmp() & longjmp() ? Setjmp() must store ARP and return address in descriptor  What about values of registers and variables?  If they are to be preserved, must compute a closure Longjmp() must restore environment at setjmp()  Restore ARP & discard ARs creates since setjmp()  Cheap with stack-allocated ARs, might cost more with heap