High-Speed/Low Power At Architectural Level

Slides:



Advertisements
Similar presentations
Digital Integrated Circuits© Prentice Hall 1995 Combinational Logic COMBINATIONAL LOGIC.
Advertisements

Feb. 17, 2011 Midterm overview Real life examples of built chips
TOPIC : SYNTHESIS DESIGN FLOW Module 4.3 Verilog Synthesis.
Gregory Shklover, Ben Emanuel Intel Corporation MATAM, Haifa 31015, Israel Simultaneous Clock and Data Gate Sizing Algorithm with Common Global Objective.
Introduction To VHDL for Combinational Logic
Digital Design with VHDL Presented by: Amir Masoud Gharehbaghi
1 VLSI Digital System Design Clocking. 2 Clocked System Basic structure Q DlogicQ D clock.
WATERLOO ELECTRICAL AND COMPUTER ENGINEERING 40s: Circuits 1 WATERLOO ELECTRICAL AND COMPUTER ENGINEERING 40s Circuits Department of Electrical and Computer.
Digital Integrated Circuits© Prentice Hall 1995 Devices The MOS Transistor.
High-Level Constructors and Estimators Majid Sarrafzadeh and Jason Cong Computer Science Department
CSE241 Formal Verification.1Cichy, UCSD ©2003 CSE241A VLSI Digital Circuits Winter 2003 Recitation 6: Formal Verification.
SEQUENTIAL LOGIC Digital Integrated Circuits© Prentice Hall 1995 Introduction.
Digital Integrated Circuits© Prentice Hall 1995 Timing ISSUES IN TIMING.
Spring 07, Feb 20 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Reducing Power through Multicore Parallelism Vishwani.
Power-Aware Placement
Spring 08, Jan 15 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
Dec. 6, 2005ELEC Glitch Power1 Low power design: Insert delays to eliminate glitches Yijing Chen Dec.6, 2005 Auburn university.
Spring 07, Jan 16 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Introduction Vishwani D. Agrawal James J. Danaher.
1 ACS Unit of Viterbi Decoder Audy,Garrick Ng, Ichang Wu, Wen-Jiun Yong Advisor: Dave Parent Spring 2005.
Digital Integrated Circuits© Prentice Hall 1995 Sequential Logic SEQUENTIAL LOGIC.
1 Farhan Mohamed Ali (W2-1) Jigar Vora (W2-2) Sonali Kapoor (W2-3) Avni Jhunjhunwala (W2-4) Presentation 5 MAD MAC nd February, 2006 Top Level Integration.
1 5-bit Flash Encoder Nam Van Do, Dave Flores, Shawn Smith Advisor: Dr. David Parent December 6, 2004.
More Realistic 16-Tap FIR Presented By Lihua, DONG Deyan, LIU.
1 32-bit parallel load register with clock gating ECE Department, 200 Broun Hall, Auburn University, Auburn, AL 36849, USA Lan Luo ELEC.
1 3/22/02 Benchmark Update u Carnegie Cell Library: “Free to all who Enter” s Need to build scaling model of standard cell library s Based on our open.
Digital Integrated Circuits© Prentice Hall 1995 Combinational Logic COMBINATIONAL LOGIC.
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 3 ASIC.
ECE 331 – Digital System Design Power Dissipation and Propagation Delay.
GOOD MORNING.
156 / MAPLD 2005 Rollins 1 Reducing Energy in FPGA Multipliers Through Glitch Reduction Nathan Rollins and Michael J. Wirthlin Department of Electrical.
Digital Integrated Circuits© Prentice Hall 1995 Sequential Logic SEQUENTIAL LOGIC.
Power Saving at Architectural Level Xiao Xing March 7, 2005.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.
1 Digital System Design Subject Name : Digital System Design Course Code : IT- 308 Instructor : Amit Prakash Singh Home page :
Principles Of Digital Design Chapter 1 Introduction Design Representation Levels of Abstraction Design Tasks and Design Processes CAD Tools.
Low Power Architecture and Implementation of Multicore Design Khushboo Sheth, Kyungseok Kim Fan Wang, Siddharth Dantu ELEC6270 Low Power Design of Electronic.
TO THE COURSE ON DIGITAL DESIGN FOR INSTRUMENTATION TO THE COURSE ON DIGITAL DESIGN FOR INSTRUMENTATION.
What is an And Gate? It is a digital circuit that produce logical operations The logical operations are call Boolean logical Boolean operation consist.
Sequential Arithmetic ELEC 311 Digital Logic and Circuits Dr. Ron Hayne Images Courtesy of Cengage Learning.
ECE 8053 – Project Fall’02 Design of 64-bit Low Power Spanning Tree Carry Lookahead Adder Presented by Suderson Soundararajan.
CPEN Digital System Design
1 ECE1352F – Topic Presentation - ADPLL By Selvakkumaran S.
Low-Power and Area-Efficient Carry Select Adder on Reconfigurable Hardware Presented by V.Santhosh kumar, B.Tech,ECE,4 th Year, GITAM University Under.
Modern VLSI Design 3e: Chapters 3 & 8Partly from 2002 Prentice Hall PTR week6-1 Lectures 16 Transfer Characteristics (Delay and Power) Feb. 10, 2003.
Area: VLSI Signal Processing.
Electrical and Computer Engineering University of Cyprus LAB 1: VHDL.
LOGIC OPTIMIZATION USING TECHNOLOGY INDEPENDENT MUX BASED ADDERS IN FPGA Project Guide: Smt. Latha Dept of E & C JSSATE, Bangalore. From: N GURURAJ M-Tech,
DEFENSE EXAMINATION GEORGIA TECH ECE P. 1 Fully Parallel Learning Neural Network Chip for Real-time Control Jin Liu Advisor: Dr. Martin Brooke Dissertation.
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.
Class Report 林常仁 Low Power Design: System and Algorithm Levels.
Class Report 何昭毅 : Voltage Scaling. Source of CMOS Power Consumption  Dynamic power consumption  Short circuit power consumption  Leakage power consumption.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.
Real-Time System-On-A-Chip Emulation.  Introduction  Describing SOC Designs  System-Level Design Flow  SOC Implemantation Paths-Emulation and.
Institute of Applied Microelectronics and Computer Engineering College of Computer Science and Electrical Engineering, University of Rostock Slide 1 Spezielle.
TOPIC : Introduction to Sequential Circuits UNIT 1: Modeling and Simulation Module 4 : Modeling Sequential Circuits.
VU-Advanced Computer Architecture Lecture 1-Introduction 1 Advanced Computer Architecture CS 704 Advanced Computer Architecture Lecture 1.
-1- Soft Core Viterbi Decoder EECS 290A Project Dave Chinnery, Rhett Davis, Chris Taylor, Ning Zhang.
SUBJECT : DIGITAL ELECTRONICS CLASS : SEM 3(B) TOPIC : INTRODUCTION OF VHDL.
Gopakumar.G Hardware Design Group
Final Project Report 64 points FFT
ELEC 7770 Advanced VLSI Design Spring 2016 Introduction
VLSI Testing Lecture 5: Logic Simulation
Very low voltage 16-bit counter in high leakage static CMOS technology
Output Prediction Logic (OPL)
ELEC 7770 Advanced VLSI Design Spring 2014 Introduction
Week 5, Verilog & Full Adder
ELEC 7770 Advanced VLSI Design Spring 2012 Introduction
ELEC 7770 Advanced VLSI Design Spring 2010 Introduction
Comparison of Various Multipliers for Performance Issues
Chapter 14 Arithmetic Circuits (II): Multiplier Rev /12/2003
Presentation transcript:

High-Speed/Low Power At Architectural Level The Circuit Under Study Behavioral Simulations to Verify Functionality Speed & Area & Power Results, Through Synthesis with Synopsis, @ 1.6V Default in CMOSP18 Cell Library Xiao Xing

The Circuit Under Study

Behavior Simulations (Combinational)

Behavior Simulation (2-Stage Pipeline)

Behavior Simulation (4-Stages Pipeline)

Behavior Simulation (8-Stages Pipeline)

Behavior Simulation ( 2 Combinational Multipliers In Parallel)

Behavior Simulation (2 2-Stage Pipelines In Parallel

Version of Array Multiplier Purely-Combinational 2 Pipe-Lines 4 Pipe-Lines 8 Pipe-Lines 2 Combos In Parallel 2 2-Stage Pipelines In Parallel Area Combinational 5366.566 6626.903 9964.770 10009.5 13091.204 14.973.586 Sequential   2931.304 6460.256 13412.431 1496.144 7370.954 Total Cell Area 5366.568 9558.213 16425.031 23421.957 14587.317 22344.514 Frequency/Throughput 1 / 14.38 ns = 66 MHz 1/8ns = 125 Hz 1/4ns = 250 MHz 1/3ns = 333 MHz 1/15ns = 66 MHz 1/9ns = 111 MHz Power Cell Internal Power 3.9025mW (43%) 1.088 mW 3.2437 mW 6.6204 mW 656.6855uW (60%) 2.188 mW (73%) Net Switching Power 5.2012 mW (57%) 384.571 uW (26%) 814.7785 uW (20%) 908.6389 uW (12%) 431.1658 uW (40%) 817.109 uW (27%) Total Dynamic Power 9.1037 mW (100%) 1.4726 mW (100%) 4.0619 mW (100%) 7.5291 mW (100%) 1.0879 mW (100%) 3.0051 mW (100%) Cell Leakage Power 169.6141 nW 515.6460 nW 823.4979 nW 953.3696 nW 881.6794 nW 1.1654uW    

Comparison of Total Cell Area

Comparison of Frequency/Through-Put

Comparison of Power

Summary 8-Stage Pipeline is the most Area, Power Consuming, but also the Fastest of all Array Multipliers Implemented While 8-Stages is twice of 4-stages in terms of area and Power, its only about 1.3 Times Better in Speed (Diminishing Return), confirming the prediction that one can NOT Gain Infinitely Speed, in Simply Increasing the number of Pipeline Stages With Parallelism, one Should Gain consistent Twice the Speed, at Twice the Area and Power Consumed; But Clock/Data Skew becomes a problem with Larger Parallelism, even if Power And Area is of no Issue 2 2-Stage Parallel Array Multiplier, is the 2nd Fastest, Most Power/Area Consuming; 4-Stage Pipeline come close to it in every department

Possible Enhancements Gate-Level Net List Simulation to Verify, real life Functionalities and Performances Correct the Synthesis Result for the Purely Combinational Array Multiplier Performance and cost of 4 and 8 Pipelines In Parallel

References [1] Jia Di, J. S. Yuan and R. DeMara, Improving Power-awareness of Pipelined Array Multipliers using 2-Dimensional Pipeline Gating and its Application on FIR Design, Integration, the VLSI Journal, Vol. 38,No. 3, February 2005 [2] Jan, M Raebey, Digital Integrated Circuits, 2nd Edition, Prentice Hall,2003 [3] John R, Calvert, Design of a Synchronous Pipelined Multiplier and Analysis of Clock Skew in High-Speed Digital Systems, NAVAL POSTGRADUATE SCHOOL MONTEREY CA , Dec 2000 [4] Daniel J. Sorin, Pipelining and Advanced Topics, http://www.ee.duke.edu/~sorin/ece152/lectures/7.3-pipelining.pdf, Jan 2005