A 240ps 64b Carry-Lookahead Adder in 90nm CMOS Faezeh Montazeri Advanced VLSI Course Presentation University of Tehran December.

Slides:



Advertisements
Similar presentations
Feb. 17, 2011 Midterm overview Real life examples of built chips
Advertisements

Dynamic and Leakage Power Reduction in MTCMOS Circuits Using an Automated Efficient Gate Clustering Technique Mohab Anis, Shawki Areibi *, Mohamed Mahmoud.
CPE 626 CPU Resources: Adders & Multipliers Aleksandar Milenkovic Web:
EE141 © Digital Integrated Circuits 2nd Arithmetic Circuits 1 Digital Integrated Circuits A Design Perspective Arithmetic Circuits Jan M. Rabaey Anantha.
EE141 Adder Circuits S. Sundar Kumar Iyer.
A Leakage Current Replica Keeper for Dynamic Circuits  Based on the work presented in “A leakage Current Replica Keeper for Dynamic Circuits” by: Yolin.
Introduction to CMOS VLSI Design Lecture 19: Design for Skew David Harris Harvey Mudd College Spring 2004.
Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.
Clock Design Adopted from David Harris of Harvey Mudd College.
A 16-Bit Kogge Stone PS-CMOS adder with Signal Completion Seng-Oon Toh, Daniel Huang, Jan Rabaey May 9, 2005 EE241 Final Project.
Designing Combinational Logic Circuits: Part2 Alternative Logic Forms:
EE141 © Digital Integrated Circuits 2nd Arithmetic Circuits 1 [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
VLSI Arithmetic Adders Prof. Vojin G. Oklobdzija University of California
Introduction to CMOS VLSI Design Lecture 11: Adders
Digital Design – Optimizations and Tradeoffs
S. Reda EN1600 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 25: Datapath Subsystems 1/4 Prof. Sherief Reda Division of Engineering,
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
Digital Integrated Circuits© Prentice Hall 1995 Arithmetic Arithmetic Building Blocks.
Lecture 17: Adders.
Adders. Full-Adder The Binary Adder Express Sum and Carry as a function of P, G, D Define 3 new variable which ONLY depend on A, B Generate (G) = AB.
Lec 17 : ADDERS ece407/507.
Introduction to CMOS VLSI Design Lecture 11: Adders David Harris Harvey Mudd College Spring 2004.
VLSI Arithmetic Adders & Multipliers Prof. Vojin G. Oklobdzija University of California
A 30-GS/sec Track and Hold Amplifier in 0.13-µm CMOS Technology
Graduate Institute of Electronics Engineering National Taiwan University 國立台灣大學電子工程學研究所 Short Talk Optimal Designs of CMOS Circuits via Geometric Programming.
Abdullah Aldahami ( ) Feb26, Introduction 2. Feedback Switch Logic 3. Arithmetic Logic Unit Architecture a.Ripple-Carry Adder b.Kogge-Stone.
1 Design Space Exploration for Power-Efficient Mixed-Radix Ling Adders Chung-Kuan Cheng Computer Science and Engineering Depart. University of California,
Arithmetic Building Blocks
EE141 © Digital Integrated Circuits 2nd Arithmetic Circuits 1 Digital Integrated Circuits A Design Perspective Arithmetic Circuits Reference: Digital Integrated.
Tinoosh Mohsenin and Bevan M. Baas VLSI Computation Lab, ECE Department University of California, Davis Split-Row: A Reduced Complexity, High Throughput.
DCSL & LVDCSL: A High Fan-in, High Performance Differential Current Switch Logic Families Dinesh Somasekhaar, Kaushik Roy Presented by Hazem Awad.
Chapter 14 Arithmetic Circuits (I): Adder Designs Rev /12/2003
ECE 8053 – Project Fall’02 Design of 64-bit Low Power Spanning Tree Carry Lookahead Adder Presented by Suderson Soundararajan.
Optimal digital circuit design Mohammad Sharifkhani.
Low Power – High Speed MCML Circuits (II)
Design of a 32-Bit Hybrid Prefix-Carry Look-Ahead Adder
26 th International Conference on VLSI January 2013 Pune,India Optimum Test Schedule for SoC with Specified Clock Frequencies and Supply Voltages Vijay.
A 30 GS/s 4-Bit Binary Weighted DAC in SiGe BiCMOS Technology
A Low-Leakage 2.5GHZ Skewed CMOS 32-Bit Adder For Nanometer CMOS Technologies Advanced VLSI Course Seminar December 28, 2006 Peresented by: Rabe’e Majidi.
EE 466/586 VLSI Design Partha Pande School of EECS Washington State University
EE141 © Digital Integrated Circuits 2nd Arithmetic Circuits 1 Digital Integrated Circuits A Design Perspective Arithmetic Circuits Jan M. Rabaey Anantha.
UC Berkeley, Dept of EECS EE141, Fall 2005, Project 2 Speed/Area Optimized 8-bit Adder Design Name 1 Name 2 Delay(  s) ·Area(  m 2 ) = {your number}
Integrated VLSI Systems EEN4196 Title: 4-bit Parallel Full Adder.
Basics of Energy & Power Dissipation
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA Project Guide: Smt. Latha Dept of E & C JSSATE, Bangalore. From: N GURURAJ M-Tech,
Unrolling Carry Recurrence
Digital Integrated Circuits© Prentice Hall 1995 Arithmetic Arithmetic Building Blocks.
Dynamic Logic Dynamic Circuits will be introduced and their performance in terms of power, area, delay, energy and AT2 will be reviewed. We will review.
Multi-Split-Row Threshold Decoding Implementations for LDPC Codes
EE466: VLSI Design Lecture 13: Adders
CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 19: Adder Design
64 bit Kogge-Stone Adders in different logic styles – A study Rob McNish Satyanand Nalam.
Seok-jae, Lee VLSI Signal Processing Lab. Korea University
Joshua L. Garrett Digital Circuits Design GroupUniversity of California, Berkeley Compact DSM MOS Modeling for Energy/Delay Estimation Joshua Garrett,
EE141 Project: 32x32 SRAM Abhinav Gupta, Glen Wong Optimization goals: Balance between area and performance Minimize area without sacrificing performance.
A 64b Adder Using Self-Calibrating Differential Output Prediction Logic K. H. Chong and Larry McMurchie Dept. of Electrical Engineering University of Washington.
EE141 Arithmetic Circuits 1 Chapter 14 Arithmetic Circuits Rev /12/2003 Rev /05/2003.
Characterizing Processors for Energy and Performance Management Harshit Goyal and Vishwani D. Agrawal Department of Electrical and Computer Engineering,
Tinoosh Mohsenin 2, Houshmand Shirani-mehr 1, Bevan Baas 1 1 University of California, Davis 2 University of Maryland Baltimore County Low Power LDPC Decoder.
EE141 Arithmetic Circuits 1 Chapter 14 Arithmetic Circuits Rev /12/2003.
Jackson Adders Prof. David Money Harris Matthew Keeter, Andrew Macrae,
VLSI Arithmetic Lecture 5
VLSI Arithmetic Lecture 4
Digital Integrated Circuits A Design Perspective
Circuit Design Techniques for Low Power DSPs
332:578 Deep Submicron VLSI Design Lecture 14 Design for Clock Skew
EE115C – Winter 2009 Digital Electronic Circuits
EE216A – Fall 2010 Design of VLSI Circuits and Systems
Arithmetic Building Blocks
Arithmetic Circuits.
Presentation transcript:

A 240ps 64b Carry-Lookahead Adder in 90nm CMOS Faezeh Montazeri Advanced VLSI Course Presentation University of Tehran December 2006 Based on : A 240ps 64b Carry-Lookahead Adder in 90nm CMOS Sean Kao, Radu Zlatanovici, Borivoje Nikolić University of California, Berkeley

2 What Is an Optimal Adder? Optimal adder: Minimum delay for given energy Minimum energy for given delay 64-bit Adders on IEEE Xplore [1]

3 This Work Multi-issue 64-bit microprocessor environment: Optimize a set of representative 64-bit adders in the energy – delay space Analyze the design tradeoffs Implement the optimal adder in 1.0V 90nm GP CMOS

4 Outline Energy – delay optimization Design tradeoffs for 64-bit adders Test chip implementation Measured results Summary

5 Energy – Delay Optimization Delay Energy Domino CLA Adder Goal: obtain the energy – delay optimal adder CAD tool: optimize custom digital circuits in the energy – delay space [3] Static CLA Adder [1]

6 Circuit Optimization Framework Optimizer (Matlab) Delay, Energy Static timer (C++) ModelsNetlistOptimization Goal Optimal Design Variables Design Variables Static timer (C++) Optimization Core [1]

7 Adder Optimization Setup Minimize DELAY subject to Maximum ENERGY [1]

8 CLA: Full Tree Comparison 6 stages Moderate branching 3 stages Larger branching Radix- 4 closer to optimum number of stages Radix-2 Radix-4 [1]

9 CLA vs. Ling Conventional CLA Higher stack in first stage Simple sum precompute Ling CLA Lower stack in first stage Complex sum precompute Higher speed [1] [2]

10 Full vs. Sparse Comparison FULL SP2Ling CLA [1]

11 Full vs. Sparse Comparison FULL SP2Ling CLA SP2 R2+ R4+ [1]

12 Full vs. Sparse Comparison Sparseness benefits adders with large carry trees FULL SP4Ling CLA SP2SP4 R2++ R4+– [1]

13 Optimal Adder Ling’s equations Radix-4 sparse-2 Domino carry tree Static sum-precompute Delay of fastest adder: 7.3 FO4 [1]

14 Radix-4 Sparse-2 Carry Tree Computes every other Ling pseudo-carry: H0, H2, H4 … Each output selects two sums [1]

15 Adder Core Block Diagram Critical paths implemented in clock-delayed domino Non-critical paths implemented in static At-speed BIST [1]

16 Timing Diagram 20 ps margin on all edges; Adjustable hard edges Delay spread places precharge in critical path [1]

17 Layout Floorplan Bitslice height: 24 metal tracks Aligned clock lines Sum precompute occupies space freed by sparse carry tree [1]

18 90 nm Test Chip 1.7 mm 1.6 mm 90 nm GP 7M 1P SVT transistors V DD = 1V 8 adder cores + test circuitry Core 1: this work Cores 2-8: Supply noise measurements and supply grid experiments [4]. Adder core size: 417 x 75  m 2 [1]

19 [1]

20 Chip Packaging Chip-on-board: Bond wires 60% shorter Cleaner supply  10 ps shorter delays Advance ProgramDigest [1]

21 Measured Results: Delay CHIP-ON-BOARD: V DD = 1 V –Average: 240 ps –Fastest: 226 ps V DD = 1.3 V –Average: 180 ps D avg = 7.5 FO4 [1]

22 Measured Results: Power V DD = 1V:P max = 260 mW V DD = 1.3V:P max = 606 mW Adder core Clk gen BIST Leakage [1]

23 Conclusion 90 nm GP 7M 1P SVT transistors V DD = 1V 8 adder cores + test circuitry Adder core size: 417 x 75  m 2

24 64-bit Adders on IEEE Xplore Summary Ling radix-4 sparse-2 domino carry tree 90nm GP CMOS: 240ps, [1]

25 References [1]. S. Kao, R. Zlatanovici, B. Nikolic, “A 240ps 64-bit Carry-Lookahead Adder in 90nm CMOS,” ISSCC2006, Feb [2]. H. Ling, “High Speed Binary Adder,” IBM J. R&D, vol. 25, no. 3, pp , May, [3]. R. Zlatanovici, B. Nikolic, “Power – Performance Optimization for Custom Digital Circuits,” Proc. PATMOS, pp , Sept., [4] V. Abramzon, E. Alon, M. Horowitz Stanford University