Design Economics. IC designer should able to predict the cost and the time to design a particular IC. This guides the choice of implementation strategy.

Slides:



Advertisements
Similar presentations
Packaging.
Advertisements

Topics Electrical properties of static combinational gates:
Lecture 23: I/O. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 23: I/O2 Outline  Basic I/O Pads  I/O Channels –Transmission Lines –Noise and Interference.
555 Timer ©Paul Godin Updated February Oscillators ◊We have looked at simple oscillator designs using an inverter, and had a brief look at crystal.
   >>> 
EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics
Introduction to CMOS VLSI Design Lecture 19: Design for Skew David Harris Harvey Mudd College Spring 2004.
Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.
Lecture 21: Packaging, Power, & Clock
Clock Design Adopted from David Harris of Harvey Mudd College.
Lecture 21: Packaging, Power, & Clock
Digital Integrated Circuits© Prentice Hall 1995 Interconnect COPING WITH INTERCONNECT.
Introduction to CMOS VLSI Design Lecture 20: Package, Power, and I/O
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 14: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Design and Implementation of VLSI Systems (EN1600)
Lecture 8: Clock Distribution, PLL & DLL
11/5/2004EE 42 fall 2004 lecture 281 Lecture #28 PMOS LAST TIME: NMOS Electrical Model – NMOS physical structure: W and L and d ox, TODAY: PMOS –Physical.
Prof. John Nestor ECE Department Lafayette College Easton, Pennsylvania ECE VLSI Circuit Design Lecture 20 - Chip-Level.
04/09/02EECS 3121 Lecture 25: Interconnect Modeling EECS 312 Reading: 8.3 (text), 4.3.2, (2 nd edition)
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
From Compaq, ASP- DAC00. Power Consumption Power consumption is on the rise due to: - Higher integration levels (more devices & wires) - Rising clock.
Interconnessioni e parassiti1 Progettazione di circuiti e sistemi VLSI Anno Accademico Lezione Interconnessioni e parassiti.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Microwave Interference Effects on Device,
VLSI Digital System Design
Lecture 5 – Power Prof. Luke Theogarajan
Lecture 7: Power.
Microcomputer Buses Outline –What is a Bus? –Interfaces –Open Collector Buses –Tristate Buses –Bus Contention –Transmission Lines Goal –Understand bus.
Introduction to CMOS VLSI Design Lecture 21: Scaling and Economics Credits: David Harris Harvey Mudd College (Material taken/adapted from Harris’ lecture.
EKT214 - ANALOG ELECTRONIC CIRCUIT II
IC packaging and Input - output signals
The printed circuit board (PCB) design
6.893: Advanced VLSI Computer Architecture, September 28, 2000, Lecture 4, Slide 1. © Krste Asanovic Krste Asanovic
EE414 VLSI Design Design Metrics in Design Metrics in VLSI Design [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Switched capacitor DC-DC converter ASICs for the upgraded LHC trackers M. Bochenek 1,2, W. Dąbrowski 2, F. Faccio 1, S. Michelis 1 1. CERN, Conseil Européen.
By Premananda B.S. Design of VLSI Systems By Premananda B.S.
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 13.1 EE4800 CMOS Digital IC Design & Analysis Lecture 13 Packaging, Power and Clock Distributions.
Modern VLSI Design 4e: Chapter 7 Copyright  2008 Wayne Wolf Topics Global interconnect. Power/ground routing. Clock routing. Floorplanning tips. Off-chip.
Lecture 23: I/O. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 23: I/O2 Outline  Basic I/O Pads  I/O Channels –Transmission Lines –Noise and Interference.
Practical Digital Design Considerations Part 1 Last Mod: January 2008 ©Paul R. Godin.
Research on Analysis and Physical Synthesis Chung-Kuan Cheng CSE Department UC San Diego
Logic Synthesis for Low Power(CHAPTER 6) 6.1 Introduction 6.2 Power Estimation Techniques 6.3 Power Minimization Techniques 6.4 Summary.
Digital Integrated Circuits© Prentice Hall 1995 Interconnect COPING WITH INTERCONNECT.
FPGA-Based System Design: Chapter 2 Copyright  2003 Prentice Hall PTR Topics n Off-chip connections.
Modern VLSI Design 2e: Chapter 3 Copyright  1998 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
1 OUTPUT Pad and Driver. 2 CLOCK DRIVER 3 Buffering S = scaling or tapering factor CL = S N+1 Cg ……………… All inverters have identical delay of t o = delay.
Introduction to CMOS VLSI Design Lecture 25: Package, Power, Clock, and I/O David Harris Harvey Mudd College Spring 2007.
EE4800 CMOS Digital IC Design & Analysis
Modern VLSI Design 4e: Chapter 3 Copyright  2008 Wayne Wolf Topics n Pseudo-nMOS gates. n DCVS logic. n Domino gates. n Design-for-yield. n Gates as IP.
Advanced VLSI Design Unit 04: Combinational and Sequential Circuits.
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
Modern VLSI Design 3e: Chapter 7 Copyright  1998, 2002 Prentice Hall PTR Topics n Power/ground routing. n Clock routing. n Floorplanning tips. n Off-chip.
Bi-CMOS Prakash B.
11-1 Integrated Microsystems Lab. EE372 VLSI SYSTEM DESIGNE. Yoon Latch-up & Power Consumption Latch-up Problem Latch-up condition  1   2 >1 GND Vdd.
Introduction to Clock Tree Synthesis
FPGA-Based System Design: Chapter 2 Copyright  2004 Prentice Hall PTR Topics n Logic gate delay. n Logic gate power consumption. n Driving large loads.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
PACKAGE FABRICATION TECHNOLOGY Submitted By: Prashant singh.
MICROPROCESSOR DESIGN1 IR/Inductive Drop Introduction One component of every chip is the network of wires used to distribute power from the input power.
High Speed Properties of Digital Gates, Copyright F. Canavero, R. Fantino Licensed to HDT - High Design Technology
Power Distribution Copyright F. Canavero, R. Fantino Licensed to HDT - High Design Technology.
TERMINATIONS Copyright F. Canavero, R. Fantino Licensed to HDT - High Design Technology.
IC packaging and Input - output signals
Topics Off-chip connections..
CMOS VLSI Design Chapter 13 Clocks, DLLs, PLLs
Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003
CMOS VLSI Design Chapter 13 Clocks, DLLs, PLLs
332:578 Deep Submicron VLSI Design Lecture 14 Design for Clock Skew
Presentation transcript:

Design Economics

IC designer should able to predict the cost and the time to design a particular IC. This guides the choice of implementation strategy. Selling price S total = C total / (1-m) Where m = profit margin and C total = total cost Cost to produce an IC are divided into: –Nonrecurring engineering costs (NRE) –Recurring costs –Fixed costs

Non-recurring Engineering Costs Cost once spent during the design of an IC, they include: –Engineering design cost –Prototype manufacturing cost –i.e., F total = E total + P total NRE can be viewed as an investment for which there is a required rate of return. Engineering design costs, include: –Personnel costs –Support costs Prototype manufacturing costs, include: –Mask cost –Test fixture costs –Package tooling

The personnel cost include labor for: –Architectural design –Logic capture –Simulation for functionality –Layout of modules and chip –Timing verification –DRC and tapeout procedures –Test generation The support costs are: –Computer costs –CAD software costs –Training

Recurring Costs The cost that recurs every time an IC is sold. The total cost is Ct = Cprocess + Cpack + Ctest –Cprocess = W / (N.Yd.Ypack.) Fabrication –Wafer cost / (Dice per wafer * Yield) –Wafer cost: $500 - $3000 –Dice per wafer: –Yield: Y = e -AD For small A, Y  1, cost proportional to area For large A, Y  0, cost increases exponentially Packaging Test

Fixed Costs Data sheets and application notes Marketing and advertising

Schedule Estimate the design cost and design time for the system. Selecting the strategy by which the ICs will be available in the right time and price. Experienced person. To estimate schedule some idea of the amount effort required to complete the design. Schedule is a function of personpower. Methods for improving the schedules: –Using a high productivity design method –Improving the productivity of a given technique –Decreasing the complexity of the design task by partitioning

Personpower Tasks required are: Architectural design HDL capture Functional verification PAR Timing verification, signal integrity, reliability verification DRC and tapeout procedures Test generation

Example You want to start a company to build a wireless communications chip. How much venture capital must you raise? Because you are smarter than everyone else, you can get away with a small team in just two years: –Seven digital designers –Three analog designers –Five support personnel

Solution Digital designers: –salary –overhead –computer –CAD tools –Total: Analog designers –salary –overhead –computer –CAD tools –Total: Support staff –salary –overhead –computer –Total: Fabrication –Back-end tools: –Masks: –Total: Summary

Digital designers: –$70k salary –$30k overhead –$10k computer –$10k CAD tools –Total: $120k * 7 = $840k Analog designers –$100k salary –$30k overhead –$10k computer –$100k CAD tools –Total: $240k * 3 = $720k Support staff –$45k salary –$20k overhead –$5k computer –Total: $70k * 5 = $350k Fabrication –Back-end tools: $1M –Masks: $1M –Total: $2M / year Summary –2 $3.91M / year –$8M design & prototype

Cost Breakdown New chip design is fairly capital-intensive Maybe you can do it for less?

Special-purpose Subsystems

Agenda Packaging –Package options –Chip-to-package connections –Package parasitics –Heat dissipation Power Distribution –On-chip power distribution network –Supply noise I/O –Basic I/O pad circuits Clock –Clock system architecture –Global clock generation & distribution –Local clock gaters

Packaging Package functions: –Electrical connection of signals and power from chip to board, with little delay or distortion –Mechanical connection of chip to board –Removes heat produced on chip –Protects chip from mechanical damage –Compatible with thermal expansion –Inexpensive to manufacture and test

Package Options Through-hole vs. surface mount

Multichip Modules Pentium Pro MCM –Fast connection of CPU to cache –Expensive, requires known good dice

Chip-to-Package Bonding Traditionally, chip is surrounded by pad frame: –Metal pads on 100 – 200  m pitch –Gold bond wires attach pads to package –Lead frame distributes signals in package –Metal heat spreader helps with cooling

Advanced Packages Metal leads contribute parasitic inductance and coupling capacitors to their neighbors Fancy packages have many signal, power layers –Like tiny printed circuit boards Flip-chip places connections across surface of die rather than around periphery –Top level metal pads covered with solder balls –Chip flips upside down –Carefully aligned to package (done blind!) –Heated to melt balls –Introduces new testing problems

Package Parasitics Use many V DD, GND in parallel –Inductance, I DD

Bond wires and lead frame contribute parasitic inductance to the signal traces. They also have mutual inductance and capacitive coupling to nearby signal traces, causing crosstalk when multiple signal switch. V DD & GND wires also have inductance from both bond wires and lead frame. They have nonzero resistance, which becomes important for chips drawing large supply current High performances packages often include bypass capacitors between V DD & GND.

Heat Dissipation 60 W light bulb has surface area of 120 cm 2 Itanium 2 die dissipates 130 W over 4 cm 2 –Chips have enormous power densities –Cooling is a serious challenge Advances in heat sinks, fans, packages have raised the practical limit for heat removal from about 8 W in 1985 to nearly 100 W today for affordable packaging. Package spreads heat to larger surface area –Heat sinks may increase surface area further –Fans increase airflow rate over surface area –Liquid cooling used in extreme cases ($$$)

Temperature difference between transistor junctions and the ambient air is,  T =  ja P –  T: temperature rise on chip –  ja : thermal resistance of chip junction to ambient – P: power dissipation on chip Thermal resistances combine like resistors –Series and parallel  ja =  jp +  pa –Series combination Thermal Resistance

Example Your chip has a heat sink with a thermal resistance to the package of 4.0° C/W. The resistance from chip to package is 1° C/W. The system box ambient temperature may reach 55° C. The chip temperature must not exceed 100° C. What is the maximum chip power dissipation? Solution is ( C) / (4 + 1 C/W) = 9 W

Power Distribution Power Distribution Network functions: –Carry current from pads to transistors on chip –Maintain stable voltage with low noise –Provide average and peak power demands –Provide current return paths for signals –Avoid electromigration & self-heating wearout –Consume little chip area and wire –Easy to lay out

Power Requirements V DD = V DDnominal – V droop Want V droop < +/- 10% of V DD L di/dt of bond wire and IR drop across on-chip wires are often a major source of supply noise Sources of V droop –IR drops –L di/dt noise I DD changes on many time scales

IR Drops: Resistance of power supply network includes: –resistance of the on-chip wires and vias, –resistance of bond wires or solder bumps to the package, –resistance of the package planes, –resistance of the PCB planes IR drops arise from both average and instantaneous current requirements. Ldi/dt Noise: Inductance of power supply dominated by the inductance of the bond wires Modern packages devote many of their pins to power and ground to minimize supply inductance Two sources of current transients are switching I/O signals and changes between idle and active mode in the chip core

On-chip Bypass Capacitance The bypass capacitance is distributed across the chip so that a local spike in current can be supplied from nearby bypass capacitance rather than through the resistance of the overall power grid. power distribution network doesn’t really need to carry all of the peak current. Much of the difference between peak and average current may be supplied by local, on-chip bypass capacitors. On-chip bypass capacitors can reduce the amount of metal needed for distribution. It also greatly reduces the di/dt drawn from the package.

Symbiotic Bypass Capacitors Where are the bypass capacitors in this picture? Gates that are not switching at a given instant in time act as symbiotic bypass capacitors If only one gate in 60 switches at a given instant, the bypass capacitance is 30 times the switched capacitance

Power System Model Power comes from regulator on system board: –Board and package add parasitic R and L –Bypass capacitors help stabilize supply voltage –But capacitors also have parasitic R and L Simulate system for time & frequency responses

Input / Output Input/Output System functions: –Communicate between chip and external world –Drive large capacitance off chip –Operate at compatible voltage levels –Provide adequate bandwidth –Limit slew rates to control di/dt noise –Protect chip against electrostatic discharge –Use small number of pins (low cost)

I/O Pad Design Pad Types: –V DD and GND –Output –Input –Bidirectional –Analog

V DD and GND Pads High-performance chips devote about half of their pins to power and ground. This large number of pins is required to carry the high current and to provide low supply inductance. Largest sources of noise in many chips is the ground bounce caused when the output pads switch. The pads must rapidly charge the large external capacitive loads, causing a big current spike and high Ldi/dt noise. The dirty power and ground lines serving the output pads are separated from the main power grid to reduce the coupling of I/O-related noise into the core.

Output Pads Drive large off-chip loads (2 – 50 pF) –With suitable rise/fall times –Requires chain of successively larger buffers Output transistors have gates longer than normal to prevent avalanche breakdown damage and over voltage is applied to the drains. Guard rings to protect against latchup –Noise below GND injects charge into substrate –Large nMOS output transistor –p+ inner guard ring –n+ outer guard ring In n-well

Input Pads Level conversion –Higher or lower off-chip V –May need thick oxide gates Noise filtering –Schmitt trigger –Hysteresis changes V IH, V IL Protection against electrostatic discharge

ESD Protection Static electricity builds up on your body –Shock delivered to a chip can fry thin gates –Must dissipate this energy in protection circuits before it reaches the gates ESD protection circuits –Current limiting resistor –Diode clamps ESD testing –Human body model –Views human as charged capacitor

Bidirectional Pads Need tristate driver on output: –Use enable signal to set direction –Optimized tristate avoids huge series transistors Improved tri-state buffer

Analog Pads Pass analog voltages directly in or out of chip: –No buffering –Protection circuits must not distort voltages

Clocking Synchronous systems use a clock to keep operations in sequence –Distinguish this from previous or next –Determine speed at which machine operates Clock must be distributed to all the sequencing elements –Flip-flops and latches Also distribute clock to other elements –Domino circuits and memories

Clock Distribution On a small chip, the clock distribution network is just a wire –And possibly an inverter for clkb On practical chips, the RC delay of the wire resistance and gate load is very long –Variations in this delay cause clock to get to different elements at different times –This is called clock skew Most chips use repeaters to buffer the clock and equalize the delay –Reduces but doesn’t eliminate skew

Review: Skew Impact Ideally full cycle is available for work Skew adds sequencing overhead Increases hold time too

Solutions Reduce clock skew –Careful clock distribution network design –Plenty of metal wiring resources Analyze clock skew –Only budget actual, not worst case skews –Local vs. global skew budgets Tolerate clock skew –Choose circuit structures insensitive to skew

Clock Skew Sources Clock Skew Sources are: –systematic, –random, –drift, and –jitter Note some engineers do not report jitter as part of the skew.

Example Skew comes from differences in gate and wire delay –With right buffer sizing, clk 1 and clk 2 could ideally arrive at the same time. –But power supply noise changes buffer delays –clk 2 and clk 3 will always see RC skew

Clock System Architecture Synchronous chip interface with PLL

Phase-locked loop block diagram

Global Clock Distribution Networks Grids H-trees Spines Ad-hoc Hybrid

Clock Grids A clock grid is a mesh of horizontal and vertical wires driven from the middle or edges. Use grid on two or more levels to carry clock. Make wires wide to reduce RC delay. Ensures low skew between nearby points. But possibly large skew across die. Grids compensate for random skew. Grids consume a large amount of metal resources and hence a high switching capacitance and power consumption.

Alpha Clock Grids

H-Trees Fractal structure –Gets clock arbitrarily close to any point –Matched delay along all paths Delay variations cause skew Buffers are added to serve as repeaters A and B might see big skew High random skew, drift and jitter between two nearby points

Itanium 2 H-Tree Four levels of buffering: –Primary driver –Repeater –Second-level clock buffer –Gater Route around obstructions

Ad-hoc The clock is routed haphazardly with some attempt to equalize wire lengths or add buffers to equalize delay. Have low systematic skews because the buffer sizes can be adjusted until nominal delays are nearly equal. Subject to random skew.

If loads are uniform, the spine avoids the systematic skew of the grid by matching the length of the clock wires. Save power by not switching certain wires. System with many clocked elements may require a large number of serpentine routes, leading to high area and capacitance for the clock network. Clock spines have large skews between nearby elements driven by different serpentines. Clock Spine

Hybrid Networks Use H-tree to distribute clock to many points. Tie these points together with a grid. Hybrid combination of H-tree and grid offers lower skew. Hybrid approach has lower systematic skew, less susceptible to skew from non-uniform load distribution. Hybrid approach is regular, making layout of well- controlled transmission line structures easier. Ex: IBM Power4, PowerPC –H-tree drives sector buffers –Buffers drive total of 1024 points –All points shorted together with grid

Clock Generation

Local Clock Gaters receives the global clock and produce the physical clocks required by clocked elements. Clock gaters are often used to stop or gate the clock to unused blocks of logic to save power. Different clock gaters are: –Enabled or Gated clock –Stretched clocks –Nonoverlapping clocks –Complementary clock –Delayed, Pulsed clocks –Clock Doubler –Clock Buffer Local Clock Gaters