Introduction to VLSI Circuits and Systems, NCUT 2007 Chapter 15 System-Level Physical Design Introduction to VLSI Circuits and Systems 積體電路概論 賴秉樑 Dept.

Slides:



Advertisements
Similar presentations
Introduction to CMOS VLSI Design Sequential Circuits.
Advertisements

Lecture 11: Sequential Circuit Design. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 11: Sequential Circuits2 Outline  Sequencing  Sequencing Element Design.
Storey: Electrical & Electronic Systems © Pearson Education Limited 2004 OHT 10.1 Sequential Logic  Introduction  Bistables  Memory Registers  Shift.
CHAPTER 3 Sequential Logic/ Circuits.  Concept of Sequential Logic  Latch and Flip-flops (FFs)  Shift Registers and Application  Counters (Types,
Digital Logic Chapter 5 Presented by Prof Tim Johnson
Sequential Circuits1 DIGITAL LOGIC DESIGN by Dr. Fenghui Yao Tennessee State University Department of Computer Science Nashville, TN.
(Neil west - p: ). Finite-state machine (FSM) which is composed of a set of logic input feeding a block of combinational logic resulting in a set.
EKT 124 / 3 DIGITAL ELEKTRONIC 1
Chapter 08 Designing High-Speed CMOS Logic Networks
Introduction to CMOS VLSI Design Lecture 19: Design for Skew David Harris Harvey Mudd College Spring 2004.
Chapter 09 Advanced Techniques in CMOS Logic Circuits
Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.
Sequential Definitions  Use two level sensitive latches of opposite type to build one master-slave flipflop that changes state on a clock edge (when the.
EE141 © Digital Integrated Circuits 2nd Timing Issues 1 Digital Integrated Circuits A Design Perspective Timing Issues Jan M. Rabaey Anantha Chandrakasan.
Synchronous Digital Design Methodology and Guidelines
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.
Clock Design Adopted from David Harris of Harvey Mudd College.
Chapter 11 Timing Issues in Digital Systems Boonchuay Supmonchai Integrated Design Application Research (IDAR) Laboratory August 20, 2004; Revised - July.
ECE 424 – Introduction to VLSI Design Emre Yengel Department of Electrical and Communication Engineering Fall 2014.
FPGA-Based System Design: Chapter 3 Copyright  2004 Prentice Hall PTR SRAM-based FPGA n SRAM-based LE –Registers in logic elements –LUT-based logic element.
Henry Hexmoor1 Chapter 7 Henry Hexmoor Registers and RTL.
Introduction to VLSI Circuits and Systems, NCUT 2007 Chapter 6 Electrical Characteristic of MOSFETs Introduction to VLSI Circuits and Systems 積體電路概論 賴秉樑.
Digital Design: Chapters Chapter 1. Introduction Digital Design - Logic Design? Analog versus Digital Once-analog now goes digital –Still pictures.
Embedded Systems Hardware:
Introduction to VLSI Circuits and Systems, NCUT 2007 Chapter 12 Arithmetic Circuits in CMOS VLSI Introduction to VLSI Circuits and Systems 積體電路概論 賴秉樑 Dept.
Modern VLSI Design 2e: Chapter 5 Copyright  1998 Prentice Hall PTR Topics n Memory elements. n Basics of sequential machines.
Programmable logic and FPGA
11/10/2004EE 42 fall 2004 lecture 301 Lecture #30 Finite State Machines Last lecture: –CMOS fabrication –Clocked and latched circuits This lecture: –Finite.
Chapter 01 An Overview of VLSI
Logic and Computer Design Fundamentals Registers and Counters
Embedded Systems Hardware: Storage Elements; Finite State Machines; Sequential Logic.
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji.
Chapter #6: Sequential Logic Design 6.2 Timing Methodologies
Introduction to CMOS VLSI Design Lecture 10: Sequential Circuits Credits: David Harris Harvey Mudd College (Material taken/adapted from Harris’ lecture.
Comparators  A comparator compares two input words.  The following slide shows a simple comparator which takes two inputs, A, and B, each of length 4.
Tanenbaum, Structured Computer Organization, Fifth Edition, (c) 2006 Pearson Education, Inc. All rights reserved The Digital Logic Level.
Chapter 6 Memory and Programmable Logic Devices
Sequential Circuit  It is a type of logic circuit whose output depends not only on the present value of its input signals but on the past history of its.
Chapter 3: Sequential Logic Circuit EKT 121 / 4 ELEKTRONIK DIGIT 1.
Digital Integrated Circuits for Communication
Lecture 10 Topics: Sequential circuits Basic concepts Clocks
CS3350B Computer Architecture Winter 2015 Lecture 5.2: State Circuits: Circuits that Remember Marc Moreno Maza [Adapted.
© Digital Integrated Circuits 2nd Sequential Circuits Digital Integrated Circuits A Design Perspective Designing Sequential Logic Circuits Jan M. Rabaey.
Introduction to Digital Logic Design Appendix A of CO&A Dr. Farag
Registers CPE 49 RMUTI KOTAT.
Logic and Computer Design Dr. Sanjay P. Ahuja, Ph.D. FIS Distinguished Professor of CIS ( ) School of Computing, UNF.
Chapter 07 Electronic Analysis of CMOS Logic Gates
Digital Logic Design Review Dr. Ahmad Almulhem ahmadsm AT kfupm Phone: Office: Ahmad Almulhem, KFUPM 2010.
© 2009 Pearson Education, Upper Saddle River, NJ All Rights ReservedFloyd, Digital Fundamentals, 10 th ed Digital Fundamentals with PLD Programming.
Microelectronic Circuits - Fourth Edition Sedra/Smith 0 PowerPoint Overheads to Accompany Sedra/Smith Microelectronic Circuits 4/e ©1999 Oxford University.
Memory and Storage Dr. Rebhi S. Baraka
Basic Sequential Components CT101 – Computing Systems Organization.
2017/4/24 1.
June 2005Computer Architecture, Background and MotivationSlide 1 Part I Background and Motivation.
Floyd, Digital Fundamentals, 10 th ed Digital Fundamentals Tenth Edition Floyd © 2008 Pearson Education Chapter 1.
Introduction to Clock Tree Synthesis
EKT 121 / 4 ELEKTRONIK DIGIT I
Hardwired Control Department of Computer Engineering, M.S.P.V.L Polytechnic College, Pavoorchatram. A Presentation On.
CS151 Introduction to Digital Design Chapter 5: Sequential Circuits 5-1 : Sequential Circuit Definition 5-2: Latches 1Created by: Ms.Amany AlSaleh.
REGISTER TRANSFER LANGUAGE (RTL) INTRODUCTION TO REGISTER Registers1.
Processor Organization and Architecture Module III.
RTL Hardware Design by P. Chu Chapter 9 – ECE420 (CSUN) Mirzaei 1 Sequential Circuit Design: Practice Shahnam Mirzaei, PhD Spring 2016 California State.
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003 Rev /05/2003.
Chapter 3 Boolean Algebra and Digital Logic T103: Computer architecture, logic and information processing.
Sequential Logic Design
LATCHED, FLIP-FLOPS,AND TIMERS
Chapter #6: Sequential Logic Design
Topics SRAM-based FPGA fabrics: Xilinx. Altera..
CPE/EE 422/522 Advanced Logic Design L03
Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003
Presentation transcript:

Introduction to VLSI Circuits and Systems, NCUT 2007 Chapter 15 System-Level Physical Design Introduction to VLSI Circuits and Systems 積體電路概論 賴秉樑 Dept. of Electronic Engineering National Chin-Yi University of Technology Fall 2007

Introduction to VLSI Circuits and Systems, NCUT 2007 Outline  Clocked Flip-flops  CMOS Clocking Styles  Pipelined Systems  Clock Generation and Distribution  System Design Considerations

Introduction to VLSI Circuits and Systems, NCUT 2007 Clocked Flip-flops  Synchronous design employs clocking signals to coordinate the movement of data through the system  In Figure 15.2, the data bit D is loaded into the DFF only on a rising clock edge  In high speed design, the limiting circuit factor is the DFF delay time t ff that is determined by the electronics and the load » Decreasing t ff allows for a higher frequency clock » The tradeoff of speed and power Figure 15.1 Ideal clocking signal Figure 15.2 Timing in a DFF (15.1) Where t o is the rise edge, and t ff is the time delay when the output has this value

Introduction to VLSI Circuits and Systems, NCUT 2007 Classical State Machines (1/2)  Two models for state machines that use single-clock timing are shown in Figure 15.3 » Moore machine and Mealy machine » Huffman model (Figure 15.4): contains both the Moore and Mealy models Figure 15.3 Moore and mealy state machines Figure 15.4 Huffman model of a state machine (a) Moore machine (b) Mealy machine

Introduction to VLSI Circuits and Systems, NCUT 2007 Classical State Machines (2/2)  FPGA design are also heavily based on classical state machine theory » For examples, in combinational logic, Individual gates, PLAs, Programming Logic Device (PLD), and groups of multiplexors » Programming is achieved with EPROMs, fuses, SRAM arrays, and some contain lookup tables (LUT, e.g. FPGA) to aid in the design  In Figure 15.5 Figure 15.5 Huffman state machine using PLA logic Where, t ff is the delay time form input to output of the flip-flop t d is the logic delay time through the PLA t su is the “setup time” of the flip-flip (15.2) (15.3) (The AND-plane of the PLA can be programmed to produce minterms m r ) (The clock T must be large enough to allow completion)

Introduction to VLSI Circuits and Systems, NCUT 2007 Outline  Clocked Flip-flops  CMOS Clocking Styles  Pipelined Systems  Clock Generation and Distribution  System Design Considerations

Introduction to VLSI Circuits and Systems, NCUT 2007 Clocked Logic Cascades  Ideal waveforms for the complimentary signal Figure 15.6 Complementary clocks Figure 15.7 Clock-controlled transistors (a) FETs(b) Transmission gate Figure 15.8 A clocked cascade (15.4) (Figure 15.6) (Figure 15.7) (15.5)

Introduction to VLSI Circuits and Systems, NCUT 2007 Timing Circles and Clock Skew  Timing circles: are simple constructs that can be useful for visualizing data transfer » In Figure 15.9, this defines what is known as a 50% duty cycle  Clock skew: the timing of a clock is out of phase with the system reference Figure 15.9 Timing circle for a single-clock, dual-phase cascade Figure Clock generation circuit Figure Clock skew Figure Timing circle with clock skew

Introduction to VLSI Circuits and Systems, NCUT 2007 Circuit Effects and Clock Frequency (1/2)  The logic-level description of the clocked cascade masks the circuit characteristics that determine the ultimate speed Figure Shift register circuit Figure Charge leakage in the shift register (a) Circuit (b) Voltage decay (15.7) (15.8) (15.9) (15.10) (15.11) (15.12) (15.13) (15.14) (15.15)

Introduction to VLSI Circuits and Systems, NCUT 2007 Circuit Effects and Clock Frequency (2/2) Figure Clocking waveforms with finite rise and fall times Figure Static shift register design

Introduction to VLSI Circuits and Systems, NCUT 2007 Dual Non-overlapping Clocks  In this technique, two distinct non-overlapping clocks ψ 1 and ψ 2 are used such that (Fig and 15.18)  Finite-state machines that are based on dual- clock schemes can provide powerful interactive capabilities (Fig ) Figure Dual non-overlapping clocks Figure Timing circuit for a 2-clock network (15.17) Figure A dual-clock finite-state machine design

Introduction to VLSI Circuits and Systems, NCUT 2007 Other Multiple-clock Schemes  It is possible to create different mutliple- clock schemes to control clocked logic cascades and state machine » For example, a triple, non-overlapping clock set  However, in modern high-speed VLSI, complicated clocking schemes introduce too many problems » Solution: speed gains are accomplished by improved circuit design, processing, and architectural modifications » The most popular approach is to use a single-clock, dual-phase system Figure Triple, non-overlapping clock signals Figure Timing circle for a 3-clock non-overlapping network

Introduction to VLSI Circuits and Systems, NCUT 2007 Dynamic Logic Cascades (1/2)  Dynamic logic circuits achieve synchronized data flow by controlling the internal operational states of the logic gate circuits » Typical domino logic state: In section 9.5 of Chapter 9 » P: per-charge phase » E: Evaluation phase  In Figure 15.23, Figure Operation of a domino logic state Figure A dynamic logic cascade

Introduction to VLSI Circuits and Systems, NCUT 2007 Dynamic Logic Cascades (2/2)  In Figure 15.24, the data transfer into and out of a dynamic logic cascade is sequenced with the clock » The number of stages that can be included in the chain is determined by the delay for the case where every stage switches » However, this introduces charge leakage problem » Solution: charge keeper circuits  True Single-Phase Clock (TSPC): use only a single clock ψ throughout » Two TSPC latches are shown in Figure Figure Timing sequence in the domino cascade Figure True single-phase clock latches (a) n-block(b) p-block

Introduction to VLSI Circuits and Systems, NCUT 2007 Outline  Clocked Flip-flops  CMOS Clocking Styles  Pipelined Systems  Clock Generation and Distribution  System Design Considerations

Introduction to VLSI Circuits and Systems, NCUT 2007 Basic Concept of Pipelining  Pipelining is a tech. that is used to increase the throughput of a sequential set of distinct data inputs through a synchronous logic cascade Figure Waveform quantities for timing analysis Figure Basic pipelined stage for timing analysis (15.18) (15.19) (15.20) Where, t ff : flip-flop delay time t d : logic delay time t su : setup time of the flip-flip t hold : hold time of the flip-flip (PW: pulse width of the clock)

Introduction to VLSI Circuits and Systems, NCUT 2007 Pipelining (1/2)  Pipelined systems are designed to increase the overall throughput of a set of sequential input states by dividing the cascade into small visualization of the problem (Figure 15.28)  Once a circuit completes a calculation and passes the result on to the next stage, it remain idle for the rest of the clock cycle (Figure 15.29) Figure Circuit activity in a logic cascade Figure Logic chains in a clocked system

Introduction to VLSI Circuits and Systems, NCUT 2007 Pipelining (2/2)  Since the delay through a logic gate varies its complexity and parasitics, the logic propagation rate will not be uniform  In Figure 15.31, dividing the long logic into small groups, add registers between the sections, and use a faster clock, then most of the circuits will be active at any given time Figure A 4-stage pipeline Figure Progression times in the logic cascade (15.21) (15.22) (15.23) Figure Pipeline with positive edge and negative edge-triggering

Introduction to VLSI Circuits and Systems, NCUT 2007 Outline  Clocked Flip-flops  CMOS Clocking Styles  Pipelined Systems  Clock Generation and Distribution  System Design Considerations

Introduction to VLSI Circuits and Systems, NCUT 2007 Clock Distribution  When frequencies f reach the 1GHz (10 9 Hz) level corresponding to a clock period of  However, distribution of the clocking signal to various points of the chip is complicated because the intrinsic RC time delay τ increases as the square of the line length l according to  Problems: clock skew and signal distortion will be very difficult to deal with in large chips Figure A 4-stage pipeline (15.24) (15.25) (15.26) (15.27)

Introduction to VLSI Circuits and Systems, NCUT 2007 Clock Stabilization and Generation (1/2) Figure A basic clock stabilization network Figure Phase-locked loop (PLL) stabilization circuit

Introduction to VLSI Circuits and Systems, NCUT 2007 Clock Stabilization and Generation (2/2) Figure Inverter-based clock generation circuit Figure Skew minimization circuit Figure Generating complementary clocks using a latch Figure Circuit for producing non-overlapping clocks (15.28)

Introduction to VLSI Circuits and Systems, NCUT 2007 Clock Routing and Driver Trees Figure Simplified view of the clock routing problem Figure Partitioning steps for defining clocking groups (a) Clocking points(b) First grouping (c) Interior routing (a) First(b) Second(c) Third(d) Routing

Introduction to VLSI Circuits and Systems, NCUT 2007 Driver Tree (1/2) Figure Geometrical analysis of the letter “H” Figure Macro-level H-type distribution tree Figure Driver tree arrangement (a) Driver tree(b) Application to H-tree Figure Driver tree design with interconnect parasitics

Introduction to VLSI Circuits and Systems, NCUT 2007 Driver Tree (2/2) Figure Distribution scheme with equivalent driver segments Figure Electrical circuit for a non-symmetrical distribution Figure Single driver tree with multiple outputs (a) Driver tree(b) Chip distribution Figure Asynchronous system clocking Figure Operation of a self-timed element

Introduction to VLSI Circuits and Systems, NCUT 2007 Outline  Clocked Flip-flops  CMOS Clocking Styles  Pipelined Systems  Clock Generation and Distribution  System Design Considerations

Introduction to VLSI Circuits and Systems, NCUT 2007 Bit Slice Design (1/2)  For example, an ALU circuit design (Fig )  Bit slice design is based on the fact that logic deals with data at the bit-level Figure An n-bit ALUFigure Block description (15.30)

Introduction to VLSI Circuits and Systems, NCUT 2007 Bit Slice Design (2/2)  With the bit slice philosophy, an n-bit ALU is created by paralleling n identical slices as shown in Figure » The repetition will also appear at the logic, circuit, and silicon levels » Advantage: library and instance building for large system design  For example, in Verilog Figure An ALU bit slice Figure ALU design with bit slices (1-bit data-type) (32-bit data-type)

Introduction to VLSI Circuits and Systems, NCUT 2007 Cache Memory  Cache memory is designed to be place in between the CPU and the main memory to speed up the system operation » SRAM memory structure » Instruction-cache (I-cache) : is used to hold instructions that are fetched from the M.M. where the program to run freely » Data-cache (D-cache): is used to hold the results of computations  Figure shows a simple block diagram for a dual-issue superscalar design using I- cache and D-cache (Computer Architecture) Figure Adding cache memory Figure Block diagram of a dual- issue superscalar machine (a) Basic system (b) Cache modification

Introduction to VLSI Circuits and Systems, NCUT 2007 Systolic Systems and Parallel Processing  In a systolic system, the movement of the data is controlled by a clock, moving one phase on each cycle  In parallel processing, a PE (processor element) can be as simple as an AND gate, or as a complex as a general-purpose microprocessor (Fig ) » Individual process elements communicate via switching arrays, which are controlled by either local or global signals » The strongest aspects of VLSI: Wafer scale integration, WSI Figure Regular patterning in a parallel processing network