Download presentation
Presentation is loading. Please wait.
1
(original notes from Prof. J. Kelly Flanagan)
Computer Arithmatic Pradondet Nilagupta Spring 2005 (original notes from Prof. J. Kelly Flanagan) Digital System Architecture April 21, 2017
2
204521 Digital System Architecture
Computer Arithmetic This lecture will introduce basic number representations and introduce basic integer and floating point arithmetic. Digital System Architecture April 21, 2017
3
204521 Digital System Architecture
Arithmetic Where we've studied so far: Boolean algebra and basic logic circuits Abstractions: Instruction Set Assembly Language and Machine Language What's up ahead: Implementing the Architecture So we understand instructions better 32 operation result a b ALU Digital System Architecture April 21, 2017
4
Number Representation
Any number can be represented in any base by the simple sum of each digit's value, positional notation: d * basei The value 1011 (binary) equals 1 * * * * 20 = 11 (decimal). Digital System Architecture April 21, 2017
5
Number Representation
as a 32 bit number this would be represented as follows: The most significant bit (MSB) is on the left and the least significant bit (LSB) is on the right. In the number above, the LSB is called bit 0 and the MSB is called bit 31. Digital System Architecture April 21, 2017
6
204521 Digital System Architecture
Range of Values With 32 bits it is possible to have unsigned numbers that range from 0 to = 4,294,967,295. This is quite reasonable for address calculations of present machines, but future machines will surely have more than 232 memory locations. In addition to more memory it is highly useful to be able to represent negative as well as positive integers and numbers smaller and larger then those possible with this format. Digital System Architecture April 21, 2017
7
204521 Digital System Architecture
MIPS 32 bit signed numbers: two = 0ten two = + 1ten two = + 2ten two = + 2,147,483,646ten two = + 2,147,483,647ten two = – 2,147,483,648ten two = – 2,147,483,647ten two = – 2,147,483,646ten two = – 3ten two = – 2ten two = – 1ten maxint minint Digital System Architecture April 21, 2017
8
204521 Digital System Architecture
Signed Integers Since in the binary number system we have an even number of unique representations for a given number of bits we have two choices: 1. Have a balanced system with the same number of negative and positive values, but what about zero? If we represent zero we have an odd number of values to represent. This can be done by allowing zero to be represented by two different bit patterns. Digital System Architecture April 21, 2017
9
204521 Digital System Architecture
Signed Integers 2. Have an unbalanced system where zero has one representation, but there is either an extra positive or negative value. We would like a balanced system, but this would require two different representations for the value zero. This evil (having two zeros) is better to avoid and not worth the benefits of having a balanced system. Consequently, an unbalanced system has been nearly universally adopted. Digital System Architecture April 21, 2017
10
204521 Digital System Architecture
One's Complement How can we represent negative and positive numbers in the binary system? The 1's complement number system using 32 bits has a range from -(2^31 -1) to +(2^31 - 1). Zero can be represented as either positive or negative. The two forms of zero are represented by (all zeros) or (all ones). Digital System Architecture April 21, 2017
11
204521 Digital System Architecture
One's Complement A negative number is formed by representing its magnitude in normal, positive binary format and then inverting each bit. 8-bit examples: = -1 = -0 = +0 = +1 Having two forms of zero is a problem, but arithmetic is simple. Digital System Architecture April 21, 2017
12
204521 Digital System Architecture
Two's Complement The 2's complement number system using 32 bits has a range from -2^31 to 2^ Zero has only one representation: all zeros, but there is one extra negative value. Negative numbers always have the MSB set to 1 -- thus making sign detection extremely simple. Converting 2's complement numbers to signed decimal is extremely simple. Digital System Architecture April 21, 2017
13
204521 Digital System Architecture
Example 2’s complement Convert to signed decimal. 1 * * * * * * * * 20 = = -70. This approach is not feasible in hardware due to speed considerations. Digital System Architecture April 21, 2017
14
Two's Complement Shortcuts
Negation in two's complement is accomplished by inverting each bit of the number and then adding one to the result. Example: Convert -70 (decimal) to two's complement binary. |-70| = 70 = (binary) ~ = (where ~ denotes bit inversion) = = -70 (2's comp). Digital System Architecture April 21, 2017
15
204521 Digital System Architecture
Sign Extension To add a 16-bit value to a 32-bit value, the 16-bit value must first be sign extended. The two numbers are right aligned and the sign bit of the shorter number is replicated to the left until the two numbers are equal in length. Digital System Architecture April 21, 2017
16
Example Sign Extension
we must sign extend the second number and then add: The extra bit on the left is simply discarded, leaving us with for our answer. Digital System Architecture April 21, 2017
17
Possible Number Representations
Sign Magnitude: One's Complement Two's Complement 000 = = = = = = = = = = = = = = = = = = = = = = = = -1 Digital System Architecture April 21, 2017
18
204521 Digital System Architecture
Integer Addition Straight forward approach consists of adding each bit together from right to left and propagating the carry from one bit to the next. Perform the operation 7 + 6 = 1101 Digital System Architecture April 21, 2017
19
204521 Digital System Architecture
Integer Addition Add the numbers 2,147,483, This is known as overflow Overflow can be handled in several ways An exception or interrupt can be asserted A bit in a status register can be set It can be completely ignored Digital System Architecture April 21, 2017
20
204521 Digital System Architecture
Integer Subtraction Straight forward approach consists of negating one term and performing integer addition. Perform the operation 7 - 6 = 0001 Perform the operation 6 - 7 = 1111 Digital System Architecture April 21, 2017
21
204521 Digital System Architecture
Integer Subtraction Add the numbers -2,147,483, This is also overflow Overflow can be handled in several ways An exception or interrupt can be asserted A bit in a status register can be set It can be completely ignored Digital System Architecture April 21, 2017
22
204521 Digital System Architecture
Integer Overflow Overflow can occur whenever the sum of two N bit numbers cannot be represented by another N bit number. Unsigned numbers must be treated differently than signed numbers. Unsigned numbers are usually used for address calculations and it is convenient to ignore overflow. In many architectures this is accomplished by having arithmetic instructions that ignore the overflow condition. Digital System Architecture April 21, 2017
23
204521 Digital System Architecture
Detecting Overflow No overflow when adding a positive and a negative number No overflow when signs are the same for subtraction Overflow occurs when the value affects the sign: overflow when adding two positives yields a negative or, adding two negatives gives a positive or, subtract a negative from a positive and get a negative or, subtract a positive from a negative and get a positive Consider the operations A + B, and A – B Can overflow occur if B is 0 ? Can overflow occur if A is 0 ? Digital System Architecture April 21, 2017
24
204521 Digital System Architecture
Effects of Overflow Most processors hardware automatically generates an exception (interrupt) Control jumps to predefined address for exception Interrupted address is saved for possible resumption Details will be studied in a later chapter Don't always want to detect overflow — new MIPS instructions: addu, addiu, subu — Here, “u” stands for “un-overflowed”. note: addiu still sign-extends! note: sltu, sltiu for unsigned comparisons Digital System Architecture April 21, 2017
25
204521 Digital System Architecture
Designing an ALU Not easy to decide the “best” way to build something Don't want too many inputs to a single gate Don’t want to have to go through too many gates for our purposes, ease of comprehension is important Let's look at a 1-bit ALU for addition: How could we build a 1-bit ALU for add, and, and or? How could we build a 32-bit ALU? cout = a b + a cin + b cin sum = a xor b xor cin Digital System Architecture April 21, 2017
26
204521 Digital System Architecture
Building a 32 bit ALU Each box is a 1bit ALU Digital System Architecture April 21, 2017
27
What about subtraction (a – b) ?
Two's complement approach: just negate b and add. How do we negate? A very clever solution: 1-bit ALU w/ negation Digital System Architecture April 21, 2017
28
Tailoring the ALU to the MIPS
Need to support the set-on-less-than instruction (slt) remember: slt is an arithmetic instruction produces a 1 if rs < rt and 0 otherwise use subtraction: (a-b) < 0 implies a < b Need to support test for equality (beq $t5, $t6, $t7) use subtraction: (a-b) = 0 implies a = b Digital System Architecture April 21, 2017
29
Supporting Set Less Than (slt)
Can we figure out the idea? Digital System Architecture April 21, 2017
30
204521 Digital System Architecture
April 21, 2017
31
204521 Digital System Architecture
Test for equality Notice control lines: 000 = and 001 = or 010 = add 110 = subtract 111 = slt Note: zero is a 1 when the result is zero! Digital System Architecture April 21, 2017
32
204521 Digital System Architecture
Conclusion We can build an ALU to support the MIPS instruction set key idea: use multiplexor to select the output we want we can efficiently perform subtraction using two’s complement we can replicate a 1-bit ALU to produce a 32-bit ALU Important points about hardware all of the gates are always working the speed of a gate is affected by the number of inputs to the gate the speed of a circuit is affected by the number of gates in series (on the “critical path” or the “deepest level of logic”) Our primary focus: comprehension, however, Clever changes to organization can improve performance (similar to using better algorithms in software) we’ll look at two examples for addition and multiplication Digital System Architecture April 21, 2017
33
Problem: ripple carry adder is slow
Is a 32-bit ALU as fast as a 1-bit ALU? Is there more than one way to do addition? two extremes: ripple carry and sum-of-products Can you see the ripple? How could you get rid of it? c1 = b0c0 + a0c0 + a0b0 c2 = b1c1 + a1c1 + a1b1 c2 = c3 = b2c2 + a2c2 + a2b2 c3 = c4 = b3c3 + a3c3 + a3b3 c4 = Not feasible! Why? Digital System Architecture April 21, 2017
34
Carry-look-ahead adder
An approach in-between our two extremes Motivation: If we didn't know the value of carry-in, what could we do? When would we always generate a carry? gi = ai bi When would we propagate the carry? pi = ai + bi Did we get rid of the ripple? c1 = g0 + p0c0 c2 = g1 + p1c1 c2 = c3 = g2 + p2c2 c3 = c4 = g3 + p3c3 c4 = Feasible! Why? Digital System Architecture April 21, 2017
35
Use principle to build bigger adders
Can’t build a 16 bit adder this way... (too big) Could use ripple carry of 4-bit CLA adders Better: use the CLA principle again! Digital System Architecture April 21, 2017
36
Longhand Multiplication
Multiply 1000 * 1001 (either decimal or binary will work!) Multiplicand Multiplier 1000 0000 Product Digital System Architecture April 21, 2017
37
Longhand Multiplication
The first number is the multiplicand and the second the multiplier. The result is larger than either the multiplicand or the multiplier. If the size of the multiplicand is N bits and the size of the multiplier is M bits then the result is M+N bits long. Digital System Architecture April 21, 2017
38
Simple Multiplication Algorithm
The Multiplicand is stored in a 64 bit register and the Multiplier is stored in a 32 bit register while the product register is 64 bits long and initialized to zero. Test bit 0 of the multiplier If this bit is 0 Continue If this bit is 1 Add the multiplicand to the product and store back in the product register Shift the multiplicand register left 1 bit Shift the multiplier register right 1 bit If this is the 32nd iteration END else continue Digital System Architecture April 21, 2017
39
MULTIPLY HARDWARE Version 1
64-bit Multiplicand reg, 64-bit ALU, 64-bit Product reg, 32-bit multiplier reg 64-bit Product 32-bit Multiplier 64-bit Multiplicand 64-bit ALU Shift Left Shift Right Write Control Digital System Architecture April 21, 2017
40
Multiply Algorithm Version 1
Start Multiplier0 = 1 Test Multiplier0 Multiplier0 = 0 1. Add multiplicand to product & place the result in Product register 2. Shift the Multiplicand register left 1 bit. 3. Shift the Multiplier register right 1 bit. M’ier: 0011 M’and: P: 1a. 1=>P=P+Mcand M’ier: 0011 Mcand: P: 2. Shl Mcand M’ier: 0011 Mcand: P: 3. Shr M’ier M’ier: 0001 Mcand: P: 1a. 1=>P=P+Mcand M’ier: 0001 Mcand: P: 2. Shl Mcand M’ier: 0001 Mcand: P: 3. Shr M’ier M’ier: 0000 Mcand: P: 1. 0=>nop M’ier: 0000 Mcand: P: 2. Shl Mcand M’ier: 0000 Mcand: P: 3. Shr M’ier M’ier: 0000 Mcand: P: 1. 0=>nop M’ier: 0000 Mcand: P: 2. Shl Mcand M’ier: 0000 Mcand: P: 3. Shr M’ier M’ier: 0000 Mcand: P: 32nd repetition? No: < 32 repetitions Yes: 32 repetitions Done Digital System Architecture April 21, 2017
41
204521 Digital System Architecture
Try It Yourself, V1 4x4 bit Multiply, Version 1 Show each step for 5 x 11 = 55 Multiplier Multiplicand Product Operation initial load Any questions so far? Let’s take a 5-minute break. +5 = 50 min. (Y:30) Digital System Architecture April 21, 2017
42
204521 Digital System Architecture
Solution, V1 5 x 11 = 55 Multiplier Multiplicand Product Operation initial load add shift no add shift add shift no add shift Any questions so far? Let’s take a 5-minute break. +5 = 50 min. (Y:30) Digital System Architecture April 21, 2017
43
Observations on Multiply Version 1
1 clock per cycle --> 32 x 3 = 100 clocks per multiply Ratio of multiply frequency to add is 5:1 to 100:1 But remember Amdahl’s Law! 1/2 bits in multiplicand always 0 64-bit adder is wasted 0’s inserted in right side of multiplicand as shifted least significant bits of product never changed once formed Instead of shifting multiplicand to left, shift product to right? Digital System Architecture April 21, 2017
44
204521 Digital System Architecture
A Better Multiplier In the previous algorithm half of the bits in the multiplicand register are zero and contain no needed information. This algorithm requires only a 32 bit ALU and multiplicand register If multiplier bit 0 = 0 Continue if multiplier bit 0 = 1 Add the multiplicand to the left half of the product register and store the result in the left half Shift the product and multiplier registers right 1 bit. If this is the 32nd iteration END otherwise continue Digital System Architecture April 21, 2017
45
MULTIPLY HARDWARE Version 2
32-bit Multiplicand reg, 32 -bit ALU, 64-bit Product reg, 32-bit Multiplier reg 32-bit Multiplicand 32-bit Multiplier Shift Right 32-bit ALU Shift Right 64-bit Product Control Write Digital System Architecture April 21, 2017
46
Multiply Algorithm Version 2
Start Multiplier0 = 1 Test Multiplier0 Multiplier0 = 0 1. Add multiplicand to the left half of product & place the result in the left half of Product register 2. Shift the Product register right 1 bit. 3. Shift the Multiplier register right 1 bit. M’ier: 0011 Mcand: 0010 P: 1a. 1=>P=P+Mcand M’ier: 0011 Mcand: 0010 P: 2. Shr P M’ier: 0011 Mcand: 0010 P: 3. Shr M’ier M’ier: 0001 Mcand: 0010 P: 1a. 1=>P=P+Mcand M’ier: 0001 Mcand: 0010 P: 2. Shr P M’ier: 0001 Mcand: 0010 P: 3. Shr M’ier M’ier: 0000 Mcand: 0010 P: 1. 0=>nop M’ier: 0000 Mcand: 0010 P: 2. Shr P M’ier: 0000 Mcand: 0010 P: 3. Shr M’ier M’ier: 0000 Mcand: 0010 P: 1. 0=>nop M’ier: 0000 Mcand: 0010 P: 2. Shr P M’ier: 0000 Mcand: 0010 P: 3. Shr M’ier M’ier: 0000 Mcand: 0010 P: 32nd repetition? No: < 32 repetitions Yes: 32 repetitions Done Digital System Architecture April 21, 2017
47
204521 Digital System Architecture
Try It Yourself, V2 4x4 bit Multiply, Version 2 Show each step for 5 x 11 = 55 Multiplier Multiplicand Product Operation initial load Any questions so far? Let’s take a 5-minute break. +5 = 50 min. (Y:30) Digital System Architecture April 21, 2017
48
204521 Digital System Architecture
Solution, V2 5 x 11 = 55 Multiplier Multiplicand Product Operation initial load add shift no add shift add shift no add shift Any questions so far? Let’s take a 5-minute break. +5 = 50 min. (Y:30) Digital System Architecture April 21, 2017
49
Observations on Multiply Version 2
Right half of product register wastes space that exactly matches size of multiplier So put multiplier in right half of product register ? Digital System Architecture April 21, 2017
50
204521 Digital System Architecture
Another Multiplier In the previous algorithm the lower bits of the product register are wasted and could be used to store the multiplier. If product bit 0 = 0 Continue if product bit 0 = 1 Add the multiplicand to the left half of the product register and store the result in the left half Shift the product register right 1 bit. If this is the 32nd iteration END otherwise continue Digital System Architecture April 21, 2017
51
MULTIPLY HARDWARE Version 3
32-bit Multiplicand reg, 32 -bit ALU, 64-bit Product / Multiplier reg 32-bit Multiplicand 32-bit ALU Shift Right 64-bit Product / Multiplier Control Write Digital System Architecture April 21, 2017
52
Multiply Algorithm Version 3
Start Product0 = 1 Test Product0 Product0 = 0 1. Add multiplicand to the left half of product & place the result in the left half of Product register 2. Shift the Product register right 1 bit. 32nd repetition? No: < 32 repetitions Yes: 32 repetitions Done Digital System Architecture April 21, 2017
53
204521 Digital System Architecture
Try It Yourself, V3 4x4 bit Multiply, Version 3 Show each step for 5 x 11 = 55 Multiplicand Product / Multiplier Operation initial load Any questions so far? Let’s take a 5-minute break. +5 = 50 min. (Y:30) Digital System Architecture April 21, 2017
54
204521 Digital System Architecture
Solution, V3 5 x 11 = 55 Multiplicand Product / Multiplier Operation initial load add shift no add shift add shift no add shift Any questions so far? Let’s take a 5-minute break. +5 = 50 min. (Y:30) Digital System Architecture April 21, 2017
55
Observations on Multiply Version 3
2 steps per bit because Multiplier & Product combined MIPS registers Hi and Lo are left and right half of Product Gives us MIPS instruction MultU What about signed multiplication? easiest solution is to make both positive & remember whether to complement product when done (leave out the sign bit, run for 31 steps) Booth’s Algorithm is more elegant way to multiply signed numbers using same hardware as before Digital System Architecture April 21, 2017
56
204521 Digital System Architecture
Booth's Multiplier In the previous algorithms only unsigned numbers were dealt with. Booth's algorithm deals with signed numbers as well. If the current and previous bits are 00 -- no addition or subtraction 01 -- end of string of one's so add the multiplicand to the left half of the product 10 -- beginning of a string of one's so subtract the multiplicand from the left half of the product 11 -- middle of a set of one's no addition or subtraction Shift the product register right 1 bit. If this is the 32nd iteration END otherwise continue Digital System Architecture April 21, 2017
57
Example of Booth's Algorithm
5 = x -3 = x 11101 Product subtract and shift add and shift subtract and shift shift only shift only result = = = -15 (base10) Digital System Architecture April 21, 2017
58
Floating Point Numbers
Although 2's complement numbers have been used for nearly 25 years, many floating point number representations were used up until about 1985. Different representations make it very difficult to transfer data from one machine to another. A standard was and is needed. Digital System Architecture April 21, 2017
59
204521 Digital System Architecture
Reals and Floats Consider the following numbers: 2.718 = 1 * 109 3,155,760,000 = * 109 The first three numbers cannot be represented by binary integers for obvious reasons, and the fourth cannot because signed 32-bit values are not large enough. Digital System Architecture April 21, 2017
60
204521 Digital System Architecture
Reals and Floats Binary numbers can also be represented in scientific notation. The general form of this is s 1.m * 10 ^ e (binary) where s is either + or - representing the sign of the number, m is a string of 1s and 0s representing the fractional part of the mantissa, and e is a + or - string of 1s and 0s representing an exponent, or the number's order of magnitude. Note that this is almost identical in form to standard decimal scientific notation, except that the 10 represents 2 (decimal), not 10 (decimal). Digital System Architecture April 21, 2017
61
Single Precision Floating Point (FP) Numbers
Current computer systems dictate that FP numbers must fit in 32- or 64-bit registers. 32-bit or single precision FP numbers are organized as follows: seee eeee emmm mmmm mmmm mmmm mmmm mmmm where s is the sign of the number, e represents the biased exponent, and m represents the mantissa. 32-bit values range in magnitude from to 1038. Digital System Architecture April 21, 2017
62
Double Precision Floating Point Numbers
64 bit double precision floating point numbers are organized as follows: The MSB is the sign bit The next 11 bits are the exponent The remaining 52 bits are the significand The range in magnitude is from to 10308 The growth of significand and exponent is a compromise between accuracy and range. Digital System Architecture April 21, 2017
63
Implied Normalization
In the IEEE 754 floating point standard, a leading 1 in the significand is assumed. This increases the effective length of the significand in both single and double precision arithmetic Numbers are therefore represented by: (-1)S * (1.M) * 2E-127 where S is the sign, M is the significand and E is the exponent Digital System Architecture April 21, 2017
64
204521 Digital System Architecture
Why IEEE 754? Example if(x == 0.0) y = 17.0 else y = z/x Can this cause a divide by zero error? Digital System Architecture April 21, 2017
65
204521 Digital System Architecture
Why IEEE 754? In the CDC6600 the adder and subtractor examine 13 bits while the multiplier/divider only examines 12 bits. Very small x's cause a problem. Solution: if(1.0 * x == 0.0) CRAY overflow, Why? y = 17.0 else y = z/x Digital System Architecture April 21, 2017
66
Floating Point Addition
It should be clear that addition and subtraction are performed in a similar manner. The steps for adding two floating point numbers is as follows: 1.We must adjust the smaller number until its exponent matches that of the larger number. 2.We add the significands together and round 3.Then the result is normalized by shifting the significand left or right and adjusting the exponent. 4.The number must again be rounded Digital System Architecture April 21, 2017
67
Floating Point Addition
Add the numbers 0.5 and In normalized form 0.5 = 1.000*2-1 = *2-2 Shift the significand of the smaller number right until the exponents of the two numbers are the same and add the significands -0.111* *2-1 = 0.001*2-1 Digital System Architecture April 21, 2017
68
Floating Point Addition
Normalize the result 0.001*2-1 = 0.010*2-2 = 0.100*2-3 = 1.000*2-4 Round the result This result already fits in the 4 bit field and no rounding is needed. Digital System Architecture April 21, 2017
69
Floating Point Addition Continued
Shift the significand of the smaller number right until the exponents of the two numbers are the same and add the significands *10-1 = *101 Digital System Architecture April 21, 2017
70
Floating Point Addition Continued
Round the significand = *101 0.016* *101 = *101 Normalize the result 10.015*101 = *102 Round the result *102 Digital System Architecture April 21, 2017
71
Floating Point Multiplication
1.Add the biased exponents and subtract off the bias. 2.Multiply the significands, this is an unsigned multiply 3.Normalize the product shifting it right and incrementing the exponent 4.Check for overflow or underflow 5.Round the significand 6.Check to see if it is still normalized 7.Set the sign bit to the appropriate value, this is simply the XOR of the initial two sign bits Digital System Architecture April 21, 2017
72
More FP Multiplication
0.5 * 1.000* *2-2 1.000*2-1 = many more sig. bits -1.110*2-2 = many more sig. bits Digital System Architecture April 21, 2017
73
More FP Multiplication
Add exponents and subtract bias = = 251 = = 124 Digital System Architecture April 21, 2017
74
More FP Multiplication
Multiply the significands 1.000 x 0000 1000 = = *2-3 = many more sig. bits Digital System Architecture April 21, 2017
75
Guard, Round, and Sticky Bits
The guard and round bits are two bits to the right of the significand used for intermediate results. 2.56* *102 = * *102 Guard bit holds 5 Round bit holds 6 Now we round using two digits, the guard and round digits 0 to 49 round down 51 to 99 round up 50+sticky round up else even Digital System Architecture April 21, 2017
76
Guard, Round, and Sticky Bits
We round up with a 56 = 2.37*102 Without guard and round bits we would have truncated the intermediate result and acquired = 2.36*102 Digital System Architecture April 21, 2017
77
Floating Point Division
Don't fear we are not going to do anything in detail with division. The hardware and algorithms are quite similar to multiplication, but instead of using addition subtraction is used heavily. Digital System Architecture April 21, 2017
78
204521 Digital System Architecture
Logical Operations There are several logical operations that are useful or needed on a computer system AND OR NAND NOR NOT or Invert XOR XNOR Digital System Architecture April 21, 2017
79
204521 Digital System Architecture
Logical Operations These functions are usually used to test bits fields or patterns within a machine word. These are quite different than the logical operations used in statements like if((a = b) || (c == d)) if((a !=b) & & (c == d)) Digital System Architecture April 21, 2017
80
204521 Digital System Architecture
Time and Space If we are performing a 32 bit addition all of the inputs are known immediately, but we must wait for the carry to ripple the length of the adder. In an ancient VLSI adder I designed and simulated the time for the carry propagation was 7ns. This would result in a 32 bit add time of 224ns resulting in a usable system clock rate of 4.46MHz. This is way too slow. Digital System Architecture April 21, 2017
81
204521 Digital System Architecture
Time and Space Several types of improved adders have been designed: Carry Lookahead Adders Carry Skip Adders Carry Select Adders Adder Type Time Size bit adder, 7ns ripple Ripple O(n) O(n) * 32 = 224 Carry Lookahead O(log n) O(n log n) 7 * log 32 = 35 Carry Skip O(sqrt(n)) O(n) * sqrt 32 = 40 Carry Select O(sqrt(n)) O(n) * sqrt 32 = 40 Digital System Architecture April 21, 2017
82
204521 Digital System Architecture
Time and Space Notice that the Carry Lookahead Adder is the fastest, but also the largest! The Carry Lookahead Adder generates the carry signals by generating the P and G signals described in the text. This added circuitry is responsible for the large size of this adder. The Carry Skip Adder only generates the P signals The Carry Select Adder uses two complete adders, one with a carry in of 1 and the other 0. Digital System Architecture April 21, 2017
83
204521 Digital System Architecture
Fast Multipliers Many adders can be used in parallel to speed multiplication. These units are called parallel multipliers. The important issue is that increased speed can be achieved by throwing hardware at the problem. Parallel multipliers can be pipelined to allow a multiplication to begin on each clock cycle. The result still takes N clock cycles, but then a new result becomes available each clock cycle after the first. Digital System Architecture April 21, 2017
84
204521 Digital System Architecture
Conclusions Algorithms such as binary addition for integer or floating point can usually be made faster by throwing more hardware at the problem. It is the job of the architect and the implementor to make the speed versus cost trade-offs. Understanding the way the datapath of a machine is implemented is crucial for compiler and operating system writers. Digital System Architecture April 21, 2017
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.