Presentation is loading. Please wait.

Presentation is loading. Please wait.

COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji.

Similar presentations


Presentation on theme: "COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji."— Presentation transcript:

1 COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji

2 Review Last Class Conditional Instructions Procedure This Class Handling Character Data Starting and Loading a Program Linking Dynamic Linking Multiplication Quiz

3 Character Data Byte-encoded character sets ASCII (American Standard Code for Information Interchange): 128 characters 95 graphic, 33 control Latin-1: 256 characters ASCII, +96 more graphic characters Unicode: 32-bit character set Used in Java, C++ wide characters, … Most of the world’s alphabets, plus symbols UTF-16, UTF-32: variable-length encodings §2.9 Communicating with People

4 ASCII

5 Character Data String “Cal” 4 bytes the decimal numbers: 67,97,108,0

6 Unicode

7 Byte Operation Load Byte (lb) Load a byte from memory Place it in the rightmost 8 bits of a register Example lb $t0, 0($sp) Store Byte (sb) Take a byte from the rightmost 8 bits of a register Write it to the memory Example sb $t0, 0($sp)

8 Byte/Halfword Operations Could use bitwise operations MIPS byte/halfword load/store String processing is a common case lb rt, offset(rs) lh rt, offset(rs) Sign extend to 32 bits in rt lbu rt, offset(rs) lhu rt, offset(rs) Zero extend to 32 bits in rt sb rt, offset(rs) sh rt, offset(rs) Store just rightmost byte/halfword

9 String Copy Example C code (naïve): Null-terminated string void strcpy (char x[], char y[]) { int i; i = 0; while ((x[i]=y[i])!='\0') i += 1; } Addresses of x, y in $a0, $a1 i in $s0

10 String Copy Example MIPS code: strcpy: addi $sp, $sp, -4 # adjust stack for 1 item sw $s0, 0($sp) # save $s0 add $s0, $zero, $zero # i = 0 L1: add $t1, $s0, $a1 # addr of y[i] in $t1 lbu $t2, 0($t1) # $t2 = y[i] add $t3, $s0, $a0 # addr of x[i] in $t3 sb $t2, 0($t3) # x[i] = y[i] beq $t2, $zero, L2 # exit loop if y[i] == 0 addi $s0, $s0, 1 # i = i + 1 j L1 # next iteration of loop L2: lw $s0, 0($sp) # restore saved $s0 addi $sp, $sp, 4 # pop 1 item from stack jr $ra # and return

11 Translation and Startup Many compilers produce object modules directly Static linking

12 Assembler Pseudoinstructions Most assembler instructions represent machine instructions one-to-one Pseudoinstructions: figments of the assembler’s imagination For simplifying translation and programming move $t0, $t1 → add $t0, $zero, $t1 blt $t0, $t1, L → slt $at, $t0, $t1 bne $at, $zero, L $at (register 1): assembler temporary

13 Producing an Object Module Assembler (or compiler) translates program into machine instructions Provides information for building a complete program from the pieces Header: described contents of object module Text segment: translated instructions Static data segment: data allocated for the life of the program Relocation info: for contents that depend on absolute location of loaded program Symbol table: global definitions and external refs Debug info: for associating with source code

14 Linking Object Modules Produces an executable image 1.Merges segments 2.Resolve labels (determine their addresses) 3.Patch location-dependent and external refs Could leave location dependencies for fixing by a relocating loader But with virtual memory, no need to do this Program can be loaded into absolute location in virtual memory space

15 Loading a Program Load from image file on disk into memory 1.Read header to determine segment sizes 2.Create virtual address space 3.Copy text and initialized data into memory Or set page table entries so they can be faulted in 4.Set up arguments on stack 5.Initialize registers (including $sp, $fp, $gp) 6.Jump to startup routine Copies arguments to $a0, … and calls main When main returns, do exit syscall

16 Dynamic Linking Statically linked program Keeping using the older version (library) All routines in the library are loaded. Only link/load library procedure when it is called Requires procedure code to be relocatable Avoids image bloat caused by static linking of all (transitively) referenced libraries Automatically picks up new library versions

17 Starting Java Applications Simple portable instruction set for the JVM Interprets bytecodes Compiles bytecodes of “hot” methods into native code for host machine

18 Time for a Break (10 mins)

19 Review Last Session Handling Character Data Starting and Loading a Program Linking Dynamic Linking This Class Multiplication Quiz

20 Chapter 3

21 Sequential Version of Multiplication Algorithm and Hardware Start with long-multiplication approach 1000 × 1001 1000 0000 1000 1001000 Length of product is the sum of operand lengths multiplicand multiplier product

22 Multiplication Hardware First version of Multiplication Hardware 64-bit ALU 64-bit Multiplicand 64-bit product (initialized to 0) 32-bit multiplier

23 Multiplication Hardware Initially 0 How many steps? 32 Each step has two operations Shift Add

24 Example 2*3

25 Refined Multiplier 32-bit multiplicand and ALU Multiplier is placed in the right half of the product register 64-bit product register

26 Faster Multiplier Uses multiple adders Cost/performance tradeoff Done in 5 steps But needs 31 ALUs Can be pipelined Several multiplication performed in parallel

27 MIPS Multiplication Two 32-bit registers for product HI: most-significant 32 bits LO: least-significant 32-bits Instructions mult rs, rt / multu rs, rt 64-bit product in HI/LO mfhi rd / mflo rd Move from HI/LO to rd Can test HI value to see if product overflows 32 bits mul rd, rs, rt Least-significant 32 bits of product –> rd

28 Back to the Example A better way to do 2*3? 3 << 1 using SLL Almost all compilers will perform a strength reduction optimization by replacing multiplication of power of 2 by shifting

29 What I want you to do Review Chapters 3.3 and 3.4 Work on your assignment 3


Download ppt "COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji."

Similar presentations


Ads by Google