VLSI Arithmetic Lecture 5

Slides:

Advertisements

Similar presentations

Recent Developments in Theory and Implementation of Parallel Prefix Adders Neil Burgess Division of Electronics Cardiff School of Engineering Cardiff University.

Advertisements

UNIVERSITY OF MASSACHUSETTS Dept

VLSI Arithmetic Adders & Multipliers

Chapter 4 -- Modular Combinational Logic. Decoders.

EE141 © Digital Integrated Circuits 2nd Arithmetic Circuits 1 Digital Integrated Circuits A Design Perspective Arithmetic Circuits Jan M. Rabaey Anantha.

EE141 Adder Circuits S. Sundar Kumar Iyer.

CSE-221 Digital Logic Design (DLD)

CSE 246: Computer Arithmetic Algorithms and Hardware Design Instructor: Prof. Chung-Kuan Cheng Lecture 4: Adders.

VLSI Arithmetic Adders Prof. Vojin G. Oklobdzija University of California

ECE C03 Lecture 61 Lecture 6 Arithmetic Logic Circuits Hai Zhou ECE 303 Advanced Digital Design Spring 2002.

Lecture 8 Arithmetic Logic Circuits

VLSI Arithmetic Adders & Multipliers

Parallel Prefix Adders A Case Study

Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.

VLSI Arithmetic Adders & Multipliers Prof. Vojin G. Oklobdzija University of California

VLSI Arithmetic Adders & Multipliers Prof. Vojin G. Oklobdzija University of California

Abdullah Aldahami ( ) Feb26, Introduction 2. Feedback Switch Logic 3. Arithmetic Logic Unit Architecture a.Ripple-Carry Adder b.Kogge-Stone.

1 Design Space Exploration for Power-Efficient Mixed-Radix Ling Adders Chung-Kuan Cheng Computer Science and Engineering Depart. University of California,

A 240ps 64b Carry-Lookahead Adder in 90nm CMOS Faezeh Montazeri Advanced VLSI Course Presentation University of Tehran December.

Design of a 32-Bit Hybrid Prefix-Carry Look-Ahead Adder

Computing Systems Designing a basic ALU.

1 Lecture 12 Time/space trade offs Adders. 2 Time vs. speed: Linear chain 8-input OR function with 2-input gates Gates: 7 Max delay: 7.

EE 466/586 VLSI Design Partha Pande School of EECS Washington State University

Unrolling Carry Recurrence

1 Carry Lookahead Logic Carry Generate Gi = Ai Bi must generate carry when A = B = 1 Carry Propagate Pi = Ai xor Bi carry in will equal carry out here.

CPEN Digital System Design

Lecture #23: Arithmetic Circuits-1 Arithmetic Circuits (Part I) Randy H. Katz University of California, Berkeley Fall 2005.

EE141 Arithmetic Circuits 1 Chapter 14 Arithmetic Circuits Rev /12/2003 Rev /05/2003.

EE141 Arithmetic Circuits 1 Chapter 14 Arithmetic Circuits Rev /12/2003.

Carry-Lookahead & Carry-Select Adders

Lecture 12 Logistics Last lecture Today HW4 due today Timing diagrams

Subtitle: How to design the data path of a processor.

UNIVERSITY OF MASSACHUSETTS Dept

Conditional-Sum Adders Parallel Prefix Network Adders

Jackson Adders Prof. David Money Harris Matthew Keeter, Andrew Macrae,

Swamynathan.S.M AP/ECE/SNSCT

Divide-and-Conquer Design

Adder wrap-up Lecture

Parallel Adders.

CSE 575 Computer Arithmetic Spring 2005 Mary Jane Irwin (www. cse. psu

Prof. Vojin G. Oklobdzija

ECE/CS 552: Carry Lookahead

VLSI Arithmetic Lecture 8

Combinational Circuits

ECE 331 – Digital System Design

CSE Winter 2001 – Arithmetic Unit - 1

VLSI Arithmetic Lecture 4

VLSI Arithmetic Adders & Multipliers

Lecture 14 Logistics Last lecture Today

King Fahd University of Petroleum and Minerals

Arithmetic Functions & Circuits

VLSI Arithmetic Lecture 10: Multipliers

Arithmetic Circuits (Part I) Randy H

Digital Integrated Circuits A Design Perspective

Instructor: Prof. Chung-Kuan Cheng

ARM implementation the design is divided into a data path section that is described in register transfer level (RTL) notation control section that is viewed.

UNIVERSITY OF MASSACHUSETTS Dept

Review: Basic Building Blocks

Overview Part 1 – Design Procedure Part 2 – Combinational Logic

Part III The Arithmetic/Logic Unit

Lecture 14 Logistics Last lecture Today

74LS283 4-Bit Binary Adder with Fast Carry

Lecture 9 Digital VLSI System Design Laboratory

Carry-Lookahead, Carry-Select, & Hybrid Adders

Carry-Lookahead, Carry-Select, & Hybrid Adders

Carry-Lookahead & Carry-Select Adders

Arithmetic Building Blocks

Arithmetic Circuits.

Conditional-Sum Adders Parallel Prefix Network Adders

Presentation transcript:

VLSI Arithmetic Lecture 5 Prof. Vojin G. Oklobdzija University of California http://www.ece.ucdavis.edu/acsel

Review Lecture 4

Ling’s Adder Huey Ling, “High-Speed Binary Adder” IBM Journal of Research and Development, Vol.5, No.3, 1981. Used in: IBM 3033, IBM S370/168, Amdahl V6, HP etc.

Ling’s Derivations define: ai bi ci si ci+1 define: gi implies Ci+1 which implies Hi+1 , thus: gi= gi Hi+1 ai bi pi gi ti 1 Oklobdzija 2004 Computer Arithmetic

Ling’s Derivations From: Now we need to derive Sum equation and because: fundamental expansion Now we need to derive Sum equation Oklobdzija 2004 Computer Arithmetic

Ling Adder Ling’s equations: Variation of CLA: Ling, IBM J. Res. Dev, 5/81 Oklobdzija 2004 Computer Arithmetic

Ling Adder Ling’s equation: Variation of CLA: ai bi ci si ci+1 ai-1 bi-1 ci-1 si-1 gi, ti gi-1, ti-1 Hi+1 Hi Ling uses different transfer function. Four of those functions have desired properties (Ling’s is one of them) see: Doran, IEEE Trans on Comp. Vol 37, No.9 Sept. 1988. Oklobdzija 2004 Computer Arithmetic

Ling Adder Conventional: Ling: Fan-in of 5 Fan-in of 4 Oklobdzija 2004 Computer Arithmetic

Advantages of Ling’s Adder Uniform loading in fan-in and fan-out H16 contains 8 terms as compared to G16 that contains 15. H16 can be implemented with one level of logic (in ECL), while G16 can not (with 8-way wire-OR). (Ling’s adder takes full advantage of wired-OR, of special importance when ECL technology is used - his IBM limitation was fan-in of 4 and wire-OR of 8) Oklobdzija 2004 Computer Arithmetic

Ling: Weinberger Notes Oklobdzija 2004 Computer Arithmetic

Ling: Weinberger Notes Oklobdzija 2004 Computer Arithmetic

Ling: Weinberger Notes Oklobdzija 2004 Computer Arithmetic

Advantage of Ling’s Adder 32-bit adder used in: IBM 3033, IBM S370/ Model168, Amdahl V6. Implements 32-bit addition in 3 levels of logic Implements 32-bit AGEN: B+Index+Disp in 4 levels of logic (rather than 6) 5 levels of logic for 64-bit adder used in HP processor Oklobdzija 2004 Computer Arithmetic

Implementation of Ling’s Adder in CMOS (S Implementation of Ling’s Adder in CMOS (S. Naffziger, “A Subnanosecond 64-b Adder”, ISSCC ‘ 96) Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

S. Naffziger, ISSCC’96 Oklobdzija 2004 Computer Arithmetic

Ling Adder Critical Path Oklobdzija 2004 Computer Arithmetic

Ling Adder: Circuits Oklobdzija 2004 Computer Arithmetic

LCS4 – Critical G Path Oklobdzija 2004 Computer Arithmetic

LCS4 – Logical Effort Delay Oklobdzija 2004 Computer Arithmetic

See: S. Naffziger, “A Subnanosecond 64-b Adder”, ISSCC ‘ 96 Results: 0.5u Technology Speed: 0.930 nS Nominal process, 80C, V=3.3V See: S. Naffziger, “A Subnanosecond 64-b Adder”, ISSCC ‘ 96 Oklobdzija 2004 Computer Arithmetic

Prefix Adders and Parallel Prefix Adders

from: Ercegovac-Lang Oklobdzija 2004 Computer Arithmetic

Prefix Adders (g, p)o(g’,p’)=(g+pg’, pp’) (g0, p0) Gi, Pi = Following recurrence operation is defined: (g, p)o(g’,p’)=(g+pg’, pp’) such that: (g0, p0) i=0 Gi, Pi = (gi, pi)o(Gi-1, Pi-1 ) 1 ≤ i ≤ n ci+1 = Gi for i=0, 1, ….. n (g-1, p-1)=(cin,cin) c1 = g0+ p0 cin This operation is associative, but not commutative It can also span a range of bits (overlapping and adjacent) Oklobdzija 2004 Computer Arithmetic

from: Ercegovac-Lang Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities from: Ercegovac-Lang Oklobdzija 2004 Computer Arithmetic

Pyramid Adder: M. Lehman, “A Comparative Study of Propagation Speed-up Circuits in Binary Arithmetic Units”, IFIP Congress, Munich, Germany, 1962. Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities from: Ercegovac-Lang Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities from: Ercegovac-Lang Oklobdzija 2004 Computer Arithmetic

Hybrid BK-KS Adder Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: S. Knowles 1999 operation is associative: h>i≥j≥k operation is idempotent: h>i≥j≥k produces carry: cin=0 Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: Ladner-Fisher Exploits associativity, but not idempotency. Produces minimal logical depth Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: Ladner-Fisher (16,8,4,2,1) Two wires at each level. Uniform, fan-in of two. Large fan-out (of 16; n/2); Large capacitive loading combined with the long wires (in the last stages) Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: Kogge-Stone Exploits idempotency to limit the fan-out to 1. Dramatic increase in wires. The wire span remains the same as in Ladner-Fisher. Buffers needed in both cases: K-S, L-F Oklobdzija 2004 Computer Arithmetic

Kogge-Stone Adder Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: Brent-Kung Set the fan-out to one Avoids explosion of wires (as in K-S) Makes no sense in CMOS: fan-out = 1 limit is arbitrary and extreme much of the capacitive load is due to wire (anyway) It is more efficient to insert buffers in L-F than to use B-K scheme Oklobdzija 2004 Computer Arithmetic

Brent-Kung Adder Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: Han-Carlson Is a hybrid synthesis of L-F and K-S Trades increase in logic depth for a reduction in fan-out: effectively a higher-radix variant of K-S. others do it similarly by serializing the prefix computation at the higher fan-out nodes. Others, similarly trade the logical depth for reduction of fan-out and wire. Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities from: Knowles bounded by L-F and K-S at ends Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities Knowles 1999 Following rules are used: Lateral wires at the jth level span 2j bits Lateral fan-out at jth level is power of 2 up to 2j Lateral fan-out at the jth level cannot exceed that a the (j+1)th level. Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities Knowles 1999 The number of minimal depth graphs of this type is given in: at 4-bits there is only K-S and L-F, afterwards there are several new possibilities. Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities Knowles 1999 example of a new 32-bit adder [4,4,2,2,1] Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities Knowles 1999 Example of a new 32-bit adder [4,4,2,2,1] Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities Knowles 1999 Delay is given in terms of FO4 inverter delay: w.c. (nominal case is 40-50% faster) K-S is the fastest K-S adders are wire limited (requiring 80% more area) The difference is less than 15% between examined schemes Oklobdzija 2004 Computer Arithmetic

Parallel Prefix Adders: variety of possibilities Knowles 1999 Conclusion Irregular, hybrid schmes are possible The speed-up of 15% is achieved at the cost of large wiring, hence area and power Circuits close in speed to K-S are available at significantly lower wiring cost Oklobdzija 2004 Computer Arithmetic