CDA 3101 Fall 2013 Introduction to Computer Organization

Slides:

Advertisements

Similar presentations

Spring 2013 Advising Starts this week! CS2710 Computer Organization1.

Advertisements

Fixed Point Numbers The binary integer arithmetic you are used to is known by the more general term of Fixed Point arithmetic. Fixed Point means that we.

Lecture 16: Computer Arithmetic Today’s topic –Floating point numbers –IEEE 754 representations –FP arithmetic Reminder –HW 4 due Monday 1.

Floating Point Numbers

1 CS61C L9 Fl. Pt. © UC Regents CS61C - Machine Structures Lecture 9 - Floating Point, Part I September 27, 2000 David Patterson

Integer Multiplication (1/3)

CS61C L11 Floating Point II (1) Garcia, Fall 2005 © UCB Lecturer PSOE, new dad Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C.

1 Lecture 9: Floating Point Today’s topics:  Division  IEEE 754 representations  FP arithmetic Reminder: assignment 4 will be posted later today.

Inst.eecs.berkeley.edu/~cs61c CS61CL : Machine Structures Lecture #6 – Number Representation, IEEE FP Greet class Jeremy Huddleston.

CS 61C L16 : Floating Point II (1) Kusalo, Spring 2005 © UCB Lecturer NSOE Steven Kusalo inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures Lecture.

CS 61C L16 : Floating Point II (1) Garcia, Spring 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C.

CS 61C L15 Floating Point I (1) Garcia, Spring 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C :

CS61C L14 MIPS Instruction Representation II (1) Garcia, Spring 2007 © UCB Google takes on Office!  Google Apps: premium “services” ( , instant messaging,

CS61C L15 Floating Point I (1) Garcia, Fall 2011 © UCB Lecturer SOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C : Machine.

CS 61C L15 Floating Point I (1) Krause, Spring 2005 © UCB This day in history… TA Danny Krause inst.eecs.berkeley.edu/~cs61c-td inst.eecs.berkeley.edu/~cs61c.

CS61C L16 Floating Point II (1) Garcia, Fall 2006 © UCB Prof Smoot given Nobel!!  Prof George Smoot joins the ranks of the most distinguished faculty.

CS61C L16 Floating Point II (1)Mowery, Spring 2008 © UCB TA Oddinaire Keaton Mowery inst.eecs.berkeley.edu/~cs61c UC Berkeley.

CS 61C L15 Floating Point I (1) Garcia, Fall 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C : Machine.

CSE 378 Floating-point1 How to represent real numbers In decimal scientific notation –sign –fraction –base (i.e., 10) to some power Most of the time, usual.

CS61C L11 Floating Point II (1) Beamer, Summer 2007 © UCB Scott Beamer, Instructor inst.eecs.berkeley.edu/~cs61c CS61C : Machine Structures Lecture #11.

COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji.

CS 61C L10 Floating Point (1) A Carle, Summer 2005 © UCB inst.eecs.berkeley.edu/~cs61c/su05 CS61C : Machine Structures Lecture #10: Floating Point

CS 61C L16 : Floating Point II (1) Garcia, Fall 2004 © UCB Lecturer PSOE Dan Garcia inst.eecs.berkeley.edu/~cs61c CS61C.

IT 251 Computer Organization and Architecture Introduction to Floating Point Numbers Chia-Chi Teng.

Information Representation (Level ISA3) Floating point numbers.

Computer Organization and Architecture Computer Arithmetic Chapter 9.

Computer Arithmetic Nizamettin AYDIN

Csci 136 Computer Architecture II – Floating-Point Arithmetic

CEN 316 Computer Organization and Design Computer Arithmetic Floating Point Dr. Mansour AL Zuair.

CS61C L11 Floating Point (1) Pearce, Summer 2010 © UCB inst.eecs.berkeley.edu/~cs61c UCB CS61C : Machine Structures Lecture 11 Floating Point

Computing Systems Basic arithmetic for computers.

Floating Point. Agenda  History  Basic Terms  General representation of floating point  Constructing a simple floating point representation  Floating.

Floating Point (a brief look) We need a way to represent –numbers with fractions, e.g., –very small numbers, e.g., –very large numbers,

CPS3340 COMPUTER ARCHITECTURE Fall Semester, /14/2013 Lecture 16: Floating Point Instructor: Ashraf Yaseen DEPARTMENT OF MATH & COMPUTER SCIENCE.

Lecture 9: Floating Point

Floating Point Representation for non-integral numbers – Including very small and very large numbers Like scientific notation – –2.34 × –

CSCI-365 Computer Organization Lecture Note: Some slides and/or pictures in the following are adapted from: Computer Organization and Design, Patterson.

FLOATING POINT ARITHMETIC P&H Slides ECE 411 – Spring 2005.

CS 61C L3.2.1 Floating Point 1 (1) K. Meinz, Summer 2004 © UCB CS61C : Machine Structures Lecture Floating Point Kurt Meinz inst.eecs.berkeley.edu/~cs61c.

CHAPTER 3 Floating Point.

CSE 340 Simulation Modeling | MUSHFIQUR ROUF CSE340:

10/7/2004Comp 120 Fall October 7 Read 5.1 through 5.3 Register! Questions? Chapter 4 – Floating Point.

King Fahd University of Petroleum and Minerals King Fahd University of Petroleum and Minerals Computer Engineering Department Computer Engineering Department.

UNIT 1 INTRODUCTION NUMBER SYSTEMS AND CONVERSION Click the mouse to move to the next page. Use the ESC key to exit this chapter. This chapter in the book.

Floating Point Representations

Morgan Kaufmann Publishers Arithmetic for Computers

Computer Architecture & Operations I

Floating Point Numbers: x 10-18

April 2006 Saeid Nooshabadi

Floating Point Arithmetics

Arithmetic for Computers

Number Representations

inst.eecs.berkeley.edu/~cs61c UC Berkeley CS61C : Machine Structures

CS 61C: Great Ideas in Computer Architecture Floating Point Arithmetic

How to represent real numbers

Computer Arithmetic Multiplication, Floating Point

Hello to Robb Neuschwander listening from Louisville, KY!

October 17 Chapter 4 – Floating Point Read 5.1 through 5.3 1/16/2019

Chapter 3 Arithmetic for Computers

Floating Point Numbers

Lecturer PSOE Dan Garcia

Topic 3d Representation of Real Numbers

Number Representations

CDA 3101 Spring 2016 Introduction to Computer Organization

Presentation transcript:

CDA 3101 Fall 2013 Introduction to Computer Organization Floating Point Theory, Notation, MIPS 30 September, 2 October 2013

Overview Floating point numbers Scientific notation Decimal scientific notation Binary scientific notation IEEE 754 FP Standard Floating point representation inside a computer Greater range vs. precision Decimal to Floating Point conversion Type is not associated with data MIPS floating point instructions, registers

Computer Numbers Computers are made to deal with numbers What can we represent in n bits? Unsigned integers: 0 to 2n - 1 Signed integers: -2(n-1) to 2(n-1) - 1 What about other numbers? Very large numbers? (seconds/century) 3,155,760,00010 (3.1557610 x 109) Very small numbers? (atomic diameter) 0.0000000110 (1.010 x 10-8) Rationals (repeating pattern) 2/3 (0.666666666. . .) Irrationals: 21/2 (1.414213562373. . .) Transcendentals : e (2.718...),  (3.141...)

Scientific Notation exponent mantissa 6.02 x 1023 radix (base) decimal point radix (base) Normalized form: no leadings 0s (exactly one digit to left of decimal point) Alternatives to representing 1/1,000,000,000 Normalized: 1.0 x 10-9 Not normalized: 0.1 x 10-8, 10.0 x 10-10

Binary Scientific Notation Exponent Mantissa 1.0two x 2-1 “binary point” radix (base) Floating point arithmetic Binary point is not fixed (as it is for integers) Declare such variable in C as float

Floating Point Representation Normal format: +1.xxxxxxxxxxtwo*2yyyytwo Multiple of Word Size (32 bits) 31 S Exponent 30 23 22 Significand 1 bit 8 bits 23 bits S represents Sign Exponent represents y’s Significand represents x’s Represent numbers as small as 2.0 x 10-38 to as large as 2.0 x 1038

Overflow and Underflow Result is too large (> 2.0x1038 ) Exponent larger than represented in 8-bit Exponent field Underflow Result is too small >0, < 2.0x10-38 Negative exponent larger than represented in 8-bit Exponent field How to reduce chances of overflow or underflow?

Double Precision FP Use two words (64 bits) 31 30 20 19 S Exponent Significand 1 bit 11 bits 20 bits Significand (cont’d) 32 bits C variable declared as double Represent numbers almost as small as 2.0 x 10-308 to almost as large as 2.0 x 10308 Primary advantage is greater accuracy (52 bits)

Floating Point Representation Normalized scientific notation: +1.xxxxtwo*2yyyytwo 31 S Exponent 30 23 22 Significand 1 bit 8 bits 23 bits Single Precision Significand (cont’d) S Exponent 20 19 Significand 1 bit 11 bits 20 bits 32 bits 31 30 Double Precision Exponent: biased notation Significand: sign – magnitude notation Bias 127 (SP) 1023 (DP)

IEEE 754 FP Standard Used in almost all computers (since 1980) Porting of FP programs Quality of FP computer arithmetic Sign bit: Significand: Leading 1 implicit for normalized numbers 1 + 23 bits single, 1 + 52 bits double always true: 0 < Significand < 1 0 has no leading 1 Reserve exponent value 0 just for number 0 1 means negative 0 means positive (-1)S * (1 + Significand) * 2Exp

IEEE 754 Exponent Use FP numbers even without FP hardware Sort records with FP numbers using integer compares Break FP number into 3 parts: compare signs, then compare exponents, then compare significands Faster (single comparison, ideally) Highest order bit is sign ( negative < positive) Exponent next, so big exponent => bigger # Significand last: exponents same => bigger #

Biased Notation for Exponents Two’s complement does not work for exponent Most negative exponent: 00000001two Most positive exponent: 11111110two Bias: number added to real exponent 127 for single precision 1023 for double precision 1.0 * 2-1 0 0111 1110 0000 0000 0000 0000 0000 000 (-1)S * (1 + Significand) * 2(Exponent - Bias)

Significand Method 1 (Fractions): Method 2 (Place Values): In decimal: 0.34010 => 34010/100010 => 3410/10010 In binary: 0.1102 => 1102/10002 (610/810) => 112/1002 (310/410) Helps understand the meaning of the significand Method 2 (Place Values): Convert from scientific notation In decimal: 1.6732 = (1x100) + (6x10-1) + (7x10-2) + (3x10-3) + (2x10-4) In binary: 1.1001 = (1x20) + (1x2-1) + (0x2-2) + (0x2-3) + (1x2-4) Interpretation of value in each position extends beyond the decimal/binary point Good for quickly calculating significand value Use this method for translating FP numbers

Binary to Decimal FP Sign: 0 => positive Exponent: Significand: 0110 1000 101 0101 0100 0011 0100 0010 Sign: 0 => positive Exponent: 0110 1000two = 104ten Bias adjustment: 104 - 127 = -23 Significand: 1 + 1x2-1+ 0x2-2 + 1x2-3 + 0x2-4 + 1x2-5 +... =1+2-1+2-3 +2-5 +2-7 +2-9 +2-14 +2-15 +2-17 +2-22 = 1.0 + 0.666115 Represents: 1.666115*2-23 ~ 1.986*10-7

Decimal to Binary FP (1/2) Simple Case: If denominator is a power of 2 (2, 4, 8, 16, etc.), then it’s easy. Example: Binary FP representation of -0.75 -0.75 = -3/4 -11two/100two = -0.11two Normalized to -1.1two x 2-1 (-1)S x (1 + Significand) x 2(Exponent-127) (-1)1 x (1 + .100 0000 ... 0000) x 2(126-127) 1 0111 1110 100 0000 0000 0000 0000 0000

Decimal to Binary FP (2/2) Denominator is not an exponent of 2 Number can not be represented precisely Lots of significand bits for precision Difficult part: get the significand Rational numbers have a repeating pattern Conversion Write out binary number with repeating pattern. Cut it off after correct number of bits (different for single vs. double precision). Derive sign, exponent and significand fields.

Decimal to Binary Significand: 101 0101 0101 0101 0101 0101 - 3 . 3 3 3 3 3 3… 0.33333333 x 2 0 .66666666 1 1 . 0 1 0 1 0 1 0 . . . - 0.66666666 x 2 1 .33333332 0.33333332 x 2 0 .66666664 => - 1.1010101.. x 21 Significand: 101 0101 0101 0101 0101 0101 Sign: negative => 1 Exponent: 1+ 127 = 128ten = 1000 0000two 1 1000 0000 101 0101 0101 0101 0101 0101

Types and Data 1.986 *10-7 878,003,010 “4UCB” ori $s5, $v0, 17218 0011 0100 0101 0101 0100 0011 0100 0010 1.986 *10-7 878,003,010 “4UCB” ori $s5, $v0, 17218 Data can be anything; operation of instruction that accesses operand determines its type! Power/danger of unrestricted addresses/pointers: Use ASCII as FP, instructions as data, integers as instructions, ... Security holes in programs

Expressible Negative Numbers Expressible Positive Numbers IEEE 754 Special Values Negative Underflow Positive Underflow Negative Overflow Expressible Negative Numbers Expressible Positive Numbers Positive Overflow -(1-2-24)*2128 -.5*2-127 .5*2-127 (1-2-24)*2128 Special Value Exponent Significand +/- 0 0000 0000 Denormalized number Nonzero NaN 1111 1111 +/- infinity

Value: Not a Number Don’t use NaN What is the result of: sqrt(-4.0)or 0/0? If infinity is not an error, these shouldn’t be either. Called Not a Number (NaN) Exponent = 255, Significand nonzero Applications NaNs help with debugging They contaminate: op(NaN, X) = NaN Don’t use NaN Ask math majors

Value: Denorms Problem: There’s a gap among representable FP numbers around 0 Smallest pos num: a = 1.0… 2 * 2-126 = 2-126 2nd smallest pos num: b = 1.001 2 * 2-126 = 2-126 + 2-150 a - 0 = 2-126 b - a = 2-150 Solution: Denormalized numbers: no leading 1 Smallest pos num: a = 2-150 2nd smallest num: b = 2-149 b a + - Gap! + -

Rounding Math on real numbers => rounding Rounding also occurs when converting types Double  single precision  integer Round towards +infinity ALWAYS round “up”: 2.001 => 3; -2.001 => -2 Round towards -infinity ALWAYS round “down”: 1.999 => 1; -1.999 => -2 Truncate Just drop the last bits (round towards 0) Round to (nearest) even (default) 2.5 => 2; 3.5 => 4

FP Fallacy FP Add, subtract associative: FALSE! x = – 1.5 x 1038, y = 1.5 x 1038, and z = 1.0 x + (y + z) = –1.5x1038 + (1.5x1038 + 1.0) = –1.5x1038 + (1.5x1038) = 0.0 (x + y) + z = (–1.5x1038 + 1.5x1038) + 1.0 = (0.0) + 1.0 = 1.0 Floating Point add, subtract are not associative! Why? FP result approximates real result 1.5 x 1038 is so much larger than 1.0 that 1.5 x 1038 + 1.0 in floating point representation is still 1.5 x 1038

Computational Errors with FP Coalition vs. Iraq (91/92) Military base in Dhahran (Saudi Arabia) was protected by 6 US Patriot Missile batteries (mobile, operates only a few hours). Tracks and intercepts enemy missiles. One hit US Army barracks 02/05/91, killing 28 americans Lost of precision int -> float

FP Addition / Subtraction Much more difficult than with integers Can’t just add significands Algorithm De-normalize to match exponents Add (subtract) significands to get resulting one Keep the same exponent Normalize (possibly changing exponent) Note: If signs differ, just perform a subtract instead.

MIPS FP Architecture (1/2) Separate floating point instructions: Single Precision: add.s, sub.s, mul.s, div.s Double Precision: add.d, sub.d, mul.d, div.d These instructions are far more complicated than their integer counterparts Problems: It’s inefficient to have different instructions take vastly differing amounts of time. Generally, a particular piece of data will not change from FP to int, or vice versa, within a program. Some programs do not do floating point calculations It takes lots of hardware relative to integers to do FP fast

MIPS FP Architecture (2/2) 1990 Solution: separate chip that handles only FP. Coprocessor 1: FP chip Contains 32 32-bit registers: $f0, $f1, … Most registers specified in .s and .d instructions ($f) Separate load and store: lwc1 and swc1 (“load word coprocessor 1”, “store …”) Double Precision: by convention, even/odd pair contain one DP FP number: $f0/$f1, … , $f30/$f31 1990 Computer contains multiple separate chips: Processor: handles all the normal stuff Coprocessor 1: handles FP and only FP; Move data between main processor and coprocessors: mfc0, mtc0, mfc1, mtc1, etc.

Floating Point Hardware (FP Add)

C => MIPS float f2c (float fahr) { return ((5.0 / 9.0) * (fahr – 32.0)); } F2c: lwc1 $f16, const5($gp) # $f16 = 5.0 lwc1 $f18, const9($gp) # $f18 = 9.0 div.s $f16, $f16, $f18 # $f16 = 5.0/9.0 lwc1 $f20, const32($gp) # $f20 = 32.0 sub.s $f20, $f12, $f20 # $f20 = fahr – 32.0 mul.s $f0, $f16, $f20 # $f0 = (5/9)*(fahr-32) jr $ra # return

Conclusion Floating Point numbers approximate values that we want to use. IEEE 754 Floating Point Standard is most widely accepted attempt to standardize FP arithmetic New MIPS architectural elements Registers ($f0-$f31) Single Precision (32 bits, 2x10-38… 2x1038) add.s, sub.s, mul.s, div.s Double Precision (64 bits , 2x10-308…2x10308) add.d, sub.d, mul.d, div.d Type is not associated with data, bits have no meaning unless given in context (e.g., int vs. float)