x86-64 Programming III & The Stack CSE 351 Winter 2018

Slides:



Advertisements
Similar presentations
University of Washington Procedures and Stacks II The Hardware/Software Interface CSE351 Winter 2013.
Advertisements

Machine/Assembler Language Putting It All Together Noah Mendelsohn Tufts University Web:
Machine-Level Programming 3 Control Flow Topics Control Flow Switch Statements Jump Tables.
Machine-Level Programming II: Control Flow Topics Condition codes Conditional branches Loops Switch statements CS 105 “Tour of the Black Holes of Computing”
University of Washington x86 Programming III The Hardware/Software Interface CSE351 Winter 2013.
Machine-Level Programming III: Switch Statements and IA32 Procedures Seoul National University.
1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Carnegie Mellon Machine-Level Programming II: Control : Introduction.
Machine-Level Programming III: Procedures Topics IA32 stack discipline Register-saving conventions Creating pointers to local variables CS 105 “Tour of.
Machine-Level Programming 3 Control Flow Topics Control Flow Switch Statements Jump Tables.
1 Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Carnegie Mellon Machine-Level Programming II: Control Carnegie Mellon.
Bryant and O’Hallaron, Computer Systems: A Programmer’s Perspective, Third Edition Carnegie Mellon Instructor: San Skulrattanakulchai Machine-Level Programming.
Machine-Level Programming II Control Flow Sept. 10, 1998 Topics Control Flow –Varieties of Loops –Switch Statements class06.ppt “The course that.
Lecture 7 Flow of Control Topics Finish Lecture 6 slides Lab 02 = datalab comments Assembly Language flow of control Test 1 – a week from wednesday February.
Spring 2016Assembly Review Roadmap 1 car *c = malloc(sizeof(car)); c->miles = 100; c->gals = 17; float mpg = get_mpg(c); free(c); Car c = new Car(); c.setMiles(100);
Reading Condition Codes (Cont.)
Credits and Disclaimers
CSCE 212 Computer Architecture
Final exam: Wednesday, March 20, 2:30pm
Arrays CSE 351 Spring 2017 Instructor: Ruth Anderson
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
The Stack & Procedures CSE 351 Spring 2017
x86 Programming I CSE 351 Autumn 2016
The Hardware/Software Interface CSE351 Winter 2013
Machine-Level Programming III: Procedures
Carnegie Mellon Machine-Level Programming III: Switch Statements and IA32 Procedures / : Introduction to Computer Systems 7th Lecture, Sep.
Machine-Level Programming II: Control
Procedures & The Stack Announcements Lab 1 graded HW 2 out
Switch & Procedures & The Stack I CSE 351 Winter 2017
Machine-Level Programming II: Control Flow
x86-64 Programming III & The Stack CSE 351 Winter 2018
x86-64 Programming II CSE 351 Autumn 2017
Carnegie Mellon Machine-Level Programming II: Control : Introduction to Computer Systems 6th Lecture, Sept. 13, 2018.
Machine-Level Programming III: Switch Statements and IA32 Procedures
Assembly Programming IV CSE 351 Spring 2017
x86-64 Programming II CSE 351 Winter 2018
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
Machine-Level Programming III: Procedures /18-213/14-513/15-513: Introduction to Computer Systems 7th Lecture, September 18, 2018.
Procedures & The Stack I CSE 351 Autumn 2016
Arrays CSE 351 Winter
Carnegie Mellon Machine-Level Programming III: Procedures : Introduction to Computer Systems October 22, 2015 Instructor: Rabi Mahapatra Authors:
Assembly Programming IV CSE 410 Winter 2017
Condition Codes Single Bit Registers
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
Carnegie Mellon Wrap-up of Machine-Level Programming II: Control : Introduction to Computer Systems Sept. 18, 2018.
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
CSE 351 Section 10 The END…Almost 3/7/12
Machine-Level Programming V: Control: loops Comp 21000: Introduction to Computer Organization & Systems Systems book chapter 3* * Modified slides from.
x86-64 Programming III & The Stack CSE 351 Autumn 2017
Assembly Programming II CSE 410 Winter 2017
Machine-Level Programming III: Procedures Sept 18, 2001
The Stack & Procedures CSE 351 Winter 2018
x86 Programming III CSE 351 Autumn 2016
The Stack & Procedures CSE 410 Winter 2017
Arrays CSE 351 Winter 2018 Instructor: Mark Wyse Teaching Assistants:
Roadmap C: Java: Assembly language: OS: Machine code: Computer system:
Ithaca College Machine-Level Programming VII: Procedures Comp 21000: Introduction to Computer Systems & Assembly Lang Spring 2017.
x86-64 Programming I CSE 351 Winter 2018
x86-64 Programming III CSE 351 Autumn 2018
x86-64 Programming I CSE 351 Winter 2018
Ithaca College Machine-Level Programming VII: Procedures Comp 21000: Introduction to Computer Systems & Assembly Lang Spring 2017.
Machine-Level Programming V: Control: loops Comp 21000: Introduction to Computer Organization & Systems Systems book chapter 3* * Modified slides from.
CS201- Lecture 8 IA32 Flow Control
Loops, Switches, Intro to Stack & Procedures CSE 351 Winter 2019
x86-64 Programming III CSE 351 Winter 2019
Ithaca College Machine-Level Programming VII: Procedures Comp 21000: Introduction to Computer Systems & Assembly Lang Spring 2017.
Carnegie Mellon Ithaca College
Credits and Disclaimers
Presentation transcript:

x86-64 Programming III & The Stack CSE 351 Winter 2018 Instructor: Mark Wyse Teaching Assistants: Kevin Bi Parker DeWilde Emily Furst Sarah House Waylon Huang Vinny Palaniappan http://xkcd.com/1652/

Administrative Homework 2 (x86) due Monday (1/29) Lab 2 due next Friday (2/2) Midterm: 2/5 You will be provided a fresh reference sheet Must bring your UW Student ID to the exam! Topics are Lectures 1 – 12, Ch 1.0 – 3.7 Review packet / suggested practice problems to be posted soon

x86 Control Flow Condition codes Conditional and unconditional branches Loops Switches

Expressing with Goto Code long absdiff(long x, long y) { long result; if (x > y) result = x-y; else result = y-x; return result; } long absdiff_j(long x, long y) { long result; int ntest = (x <= y); if (ntest) goto Else; result = x-y; goto Done; Else: result = y-x; Done: return result; } cmp jle jmp labels cmp and jle -> conditional jump jmp -> unconditional jump C allows goto as means of transferring control (jump) Closer to assembly programming style Generally considered bad coding style

Compiling Loops Other loops compiled similarly while ( sum != 0 ) { <loop body> } C/Java code: loopTop: testq %rax, %rax je loopDone <loop body code> jmp loopTop loopDone: Assembly code: Test !Test Other loops compiled similarly Will show variations and complications in coming slides, but may skip a few examples in the interest of time Most important to consider: When should conditionals be evaluated? (while vs. do-while) How much jumping is involved?

Compiling Loops What are the Goto versions of the following? Do…while: Test and Body For loop: Init, Test, Update, and Body while ( Test ) { Body } C/Java code: Loop: if ( !Test ) goto Exit; Body goto Loop; Exit: Goto version Loop: Body if (Test) goto Loop; Do…while Init Loop: if (!Test) goto Exit; Body Update goto Loop; Exit: For loop

Compiling Loops While Loop: Do-while Loop: While Loop (ver. 2): C: while ( sum != 0 ) { <loop body> } x86-64: loopTop: testq %rax, %rax je loopDone <loop body code> jmp loopTop loopDone: !Test Test do { <loop body> } while ( sum != 0 ) loopTop: <loop body code> testq %rax, %rax jne loopTop loopDone: x86-64: C: Do-while Loop: Test While Loop (ver. 2): C: while ( sum != 0 ) { <loop body> } x86-64: testq %rax, %rax je loopDone loopTop: <loop body code> jne loopTop loopDone: !Test Do-while loop Test

For Loop → While Loop For Version for (Init; Test; Update) Body Caveat: C and Java have break and continue Conversion works fine for break Jump to same label as loop exit condition But not continue: would skip doing Update, which it should do with for-loops Introduce new label at Update While Version Init; while (Test) { Body Update; } You can expect to see a loop of some form, in assembly, on the midterm!

x86 Control Flow Condition codes Conditional and unconditional branches Loops Switches

Switch Statement Example long switch_ex (long x, long y, long z) { long w = 1; switch (x) { case 1: w = y*z; break; case 2: w = y/z; /* Fall Through */ case 3: w += z; case 5: case 6: w -= z; default: w = 2; } return w; Switch Statement Example Multiple case labels Here: 5 & 6 Fall through cases Here: 2 Missing cases Here: 4 Implemented with: Jump table Indirect jump instruction Switch on some integral value Execute code at the corresponding label, then if a break is present go to instructions following the switch block New concepts: Jump table Indirect jump instruction

Jump Table Structure Switch Form Jump Table Jump Targets JTab: Targ0: switch (x) { case val_0: Block 0 case val_1: Block 1 • • • case val_n-1: Block n–1 } JTab: Targ0 Targ1 Targ2 Targn-1 • Targ0: Code Block 0 Targ1: Code Block 1 Targ2: Code Block 2 Addresses (8-bytes wide) So what is a Jump Table? Like an array of pointers JTab: address of the jump table Approximate Translation • target = JTab[x]; goto target; Targn-1: Code Block n–1

Jump Table Structure Memory C code: switch (x) { case 1: <some code> break; case 2: <some code> case 3: <some code> case 5: case 6: <some code> default: <some code> } Code Blocks Jump table is an array of pointers somewhere in memory Each entry is an address that points to code for corresponding cases Use the jump table when x ≤ 6: 1 2 3 4 5 6 if (x <= 6) target = JTab[x]; goto target; else goto default; Jump Table

Switch Statement Example Register Use(s) %rdi 1st argument (x) %rsi 2nd argument (y) %rdx 3rd argument (z) %rax Return value Switch Statement Example long switch_ex(long x, long y, long z) { long w = 1; switch (x) { . . . } return w; Note compiler chose to not initialize w switch_eg: movq %rdx, %rcx cmpq $6, %rdi # x:6 ja .L8 # default jmp *.L4(,%rdi,8) # jump table Take a look! https://godbolt.org/g/DnOmXb cmpq: comparing x to 6 (x – 6) ja: jump to default case (.L8) if x > 6 (unsigned) Switch has cases 1 through 6 (except 4) Else, execute jmp instruction to perform lookup in the jump table jump above – unsigned > catches negative default cases

Switch Statement Example long switch_ex(long x, long y, long z) { long w = 1; switch (x) { . . . } return w; Jump table .section .rodata .align 8 .L4: .quad .L8 # x = 0 .quad .L3 # x = 1 .quad .L5 # x = 2 .quad .L9 # x = 3 .quad .L8 # x = 4 .quad .L7 # x = 5 .quad .L7 # x = 6 switch_eg: movq %rdx, %rcx cmpq $6, %rdi # x:6 ja .L8 # default jmp *.L4(,%rdi,8) # jump table switch_eg: movq %rdx, %rcx cmpq $6, %rdi # x:6 ja .L8 jmp *.L4(,%rdi,8) Indirect jump: Note that, given the definition of a label x in the .data section of an assembly program, using x to indicate the memory location, as in: mov x,%eax #AT&T is just absolute addressing (i.e. using just a displacement), where the assembler essentially replaces the name x with the address corresponding to x. .quad -> following data is a “quad word”, 8-bytes .L8 -> is an address .L4(,%rdi,8) == D + Ri * S D is address of jump table, Ri is x, S is sizeof(void*) Indirect jump

Assembly Setup Explanation Jump table .section .rodata .align 8 .L4: .quad .L8 # x = 0 .quad .L3 # x = 1 .quad .L5 # x = 2 .quad .L9 # x = 3 .quad .L8 # x = 4 .quad .L7 # x = 5 .quad .L7 # x = 6 Table Structure Each target requires 8 bytes (address) Base address at .L4 Direct jump: jmp .L8 Jump target is denoted by label .L8 Indirect jump: jmp *.L4(,%rdi,8) Start of jump table: .L4 Must scale by factor of 8 (addresses are 8 bytes) Fetch target from effective address .L4 + x*8 Only for 0 ≤ x ≤ 6 Indirect jump uses * notation just like dereference operator! Dereference, then update %rip. .L4(,%rdi,8) is an address. But there is no code there! Instead, we need to get the address stored at that location and jump to the stored address. This is an indirect jump. The effective address holds another address that will be the next instruction to execute.

Jump Table declaring data, not instructions 8-byte memory alignment switch(x) { case 1: // .L3 w = y*z; break; case 2: // .L5 w = y/z; /* Fall Through */ case 3: // .L9 w += z; case 5: case 6: // .L7 w -= z; default: // .L8 w = 2; } .section .rodata .align 8 .L4: .quad .L8 # x = 0 .quad .L3 # x = 1 .quad .L5 # x = 2 .quad .L9 # x = 3 .quad .L8 # x = 4 .quad .L7 # x = 5 .quad .L7 # x = 6 .rodata – read-only data section this data is 64-bits wide

Code Blocks (x == 1) Register Use(s) %rdi 1st argument (x) %rsi 2nd argument (y) %rdx 3rd argument (z) %rax Return value Code Blocks (x == 1) switch(x) { case 1: // .L3 w = y*z; break; . . . } .L3: movq %rsi, %rax # y imulq %rdx, %rax # y*z ret

Handling Fall-Through long w = 1; . . . switch (x) { case 2: // .L5 w = y/z; /* Fall Through */ case 3: // .L9 w += z; break; } case 2: w = y/z; goto merge; case 3: w = 1; More complicated choice than “just fall-through” forced by “migration” of w = 1; Example compilation trade-off merge: w += z;

Code Blocks (x == 2, x == 3) Register Use(s) %rdi 1st argument (x) %rsi 2nd argument (y) %rdx 3rd argument (z) %rax Return value Code Blocks (x == 2, x == 3) long w = 1; . . . switch (x) { case 2: // .L5 w = y/z; /* Fall Through */ case 3: // .L9 w += z; break; } .L5: # Case 2: movq %rsi, %rax # y in rax cqto # Div prep idivq %rcx # y/z jmp .L6 # goto merge .L9: # Case 3: movl $1, %eax # w = 1 .L6: # merge: addq %rcx, %rax # w += z ret

Code Blocks (rest) Register Use(s) %rdi 1st argument (x) %rsi 2nd argument (y) %rdx 3rd argument (z) %rax Return value Code Blocks (rest) switch (x) { . . . case 5: // .L7 case 6: // .L7 w -= z; break; default: // .L8 w = 2; } .L7: # Case 5,6: movl $1, %eax # w = 1 subq %rdx, %rax # w -= z ret .L8: # Default: movl $2, %eax # 2

Roadmap C: Java: Assembly language: OS: Machine code: Computer system: Memory & data Integers & floats x86 assembly Procedures & stacks Executables Arrays & structs Memory & caches Processes Virtual memory Memory allocation Java vs. C car *c = malloc(sizeof(car)); c->miles = 100; c->gals = 17; float mpg = get_mpg(c); free(c); Car c = new Car(); c.setMiles(100); c.setGals(17); float mpg = c.getMPG(); Assembly language: get_mpg: pushq %rbp movq %rsp, %rbp ... popq %rbp ret OS: Machine code: 0111010000011000 100011010000010000000010 1000100111000010 110000011111101000011111 Computer system:

Mechanisms required for procedures Passing control To beginning of procedure code Back to return point Passing data Procedure arguments Return value Memory management Allocate during procedure execution Deallocate upon return All implemented with machine instructions! An x86-64 procedure uses only those mechanisms required for that procedure P(…) { • y = Q(x); print(y) } int Q(int i) { int t = 3*i; int v[10]; • return v[t]; } Procedures take many forms: functions, methods, subroutines, handlers, etc. Well-designed software uses procedures as an abstraction mechanism to hide implementation details and provide a clear interface They all share a set of general features

Procedures Stack Structure Calling Conventions Passing control Passing data Managing local data Register Saving Conventions Illustration of Recursion Here is the outline for our study of procedures. We will only talk about the stack structure today

Simplified Memory Layout 2N-1 High Addresses Stack local variables; procedure context Dynamic Data (Heap) Addr 0x0 is lowest. Addr 0xF…F is highest addr. Stack and Heap are Dynamic over process execution Static Data, Literals, and Instructions/Code are fixed in size variables allocated with new or malloc Memory Addresses Static Data static variables (including global variables (C)) Literals large constants (e.g. “example”) Instructions program code Low Addresses

Memory Permissions segmentation faults? Stack Dynamic Data (Heap) Managed “automatically” (by compiler) writable; not executable Dynamic Data (Heap) writable; not executable Static stuff: Initialized when prog starts Heap managed by programmer Stack usually automatic, but we’ll have to look at how it works in assembly Stack and Heap grow towards each other to maximize use of space HW/OS enforces permissions on regions of memory Will talk about it more later in the course SegFault = accessing memory in a way that is not allowed Managed by programmer Static Data writable; not executable Initialized when process starts Literals read-only; not executable Initialized when process starts Instructions read-only; executable Initialized when process starts

x86-64 Stack Stack “Bottom” High Addresses Stack Pointer: %rsp Stack “Top” Stack “Bottom” Region of memory managed with stack “discipline” Grows toward lower addresses Customarily shown “upside-down” Register %rsp contains lowest stack address %rsp = address of top element, the most-recently-pushed item that is not-yet-popped Increasing Addresses Memory is still just an array But we treat the upper part like a stack that grows downwards Last In First Out rsp points at the lowest stack address, the top of the stack This will be the first thing to be popped off + to rsp is “shrinking” the stack - from rsp is “growing” the stack Remember: stacks only have 2 operations: push and pop Stack Grows Down Low Addresses 0x00…00

x86-64 Stack: Push pushq src Example: Stack “Bottom” High Addresses Stack “Bottom” pushq src Fetch operand at src Src can be reg, memory, immediate Decrement %rsp by 8 Store value at address given by %rsp Example: pushq %rcx Adjust %rsp and store contents of %rcx on the stack Increasing Addresses Stack Grows Down Stack Pointer: %rsp Stack “Top” -8 Low Addresses 0x00…00

x86-64 Stack: Pop popq dst Stack “Bottom” High Addresses Stack “Bottom” popq dst Load value at address given by %rsp Store value at dst (must be register) Increment %rsp by 8 Example: popq %rcx Stores contents of top of stack into %rcx and adjust %rsp Increasing Addresses Stack Grows Down +8 Stack Pointer: %rsp Stack “Top” Low Addresses 0x00…00 Those bits are still there; we’re just not using them.