A Cost-Driven Lithographic Correction Methodology Based on Off-the-Shelf Sizing Tools A Cost-Driven Lithographic Correction Methodology Based on Off-the-Shelf.

Slides:



Advertisements
Similar presentations
ITRS December 2003, Hsin-Chu Taiwan How Much Variability Can Designers Tolerate? Andrew B. Kahng ITRS Design ITWG December 1, 2003.
Advertisements

Hierarchical Dummy Fill for Process Uniformity Supported by Cadence Design Systems, Inc. NSF, and the Packard Foundation Y. Chen, A. B. Kahng, G. Robins,
Design Rule Generation for Interconnect Matching Andrew B. Kahng and Rasit Onur Topaloglu {abk | rtopalog University of California, San Diego.
Tunable Sensors for Process-Aware Voltage Scaling
-1- VLSI CAD Laboratory, UC San Diego Post-Routing BEOL Layout Optimization for Improved Time- Dependent Dielectric Breakdown (TDDB) Reliability Tuck-Boon.
On the Need for Statistical Timing Analysis Farid N. Najm University of Toronto
Timing Margin Recovery With Flexible Flip-Flop Timing Model
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 23: April 10, 2013 Statistical Static Timing Analysis.
Chop-SPICE: An Efficient SPICE Simulation Technique For Buffered RC Trees Myung-Chul Kim, Dong-Jin Lee and Igor L. Markov Dept. of EECS, University of.
Parameterized Timing Analysis with General Delay Models and Arbitrary Variation Sources Khaled R. Heloue and Farid N. Najm University of Toronto {khaled,
The Cost of Fixing Hold Time Violations in Sub-threshold Circuits Yanqing Zhang, Benton Calhoun University of Virginia Motivation and Background Power.
DARPA Assessing Parameter and Model Sensitivities of Cycle-Time Predictions Using GTX u Abstract The GTX (GSRC Technology Extrapolation) system serves.
1 A Lithography-friendly Structured ASIC Design Approach By: Salman Goplani* Rajesh Garg # Sunil P Khatri # Mosong Cheng # * National Instruments, Austin,
Dual Graph-Based Hot Spot Detection Andrew B. Kahng 1 Chul-Hong Park 2 Xu Xu 1 (1) Blaze DFM, Inc. (2) ECE, University of California at San Diego.
Design of Variable Input Delay Gates for Low Dynamic Power Circuits
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 23: April 22, 2009 Statistical Static Timing Analysis.
Puneet Sharma and Puneet Gupta Prof. Andrew B. Kahng Prof. Dennis Sylvester System-Level Living Roadmap Annual Review, Sept Basic Ideas Gate-length.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
Statistical Crosstalk Aggressor Alignment Aware Interconnect Delay Calculation Supported by NSF & MARCO GSRC Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego.
Design Sensitivities to Variability: Extrapolations and Assessments in Nanometer VLSI Y. Kevin Cao *, Puneet Gupta +, Andrew Kahng +, Dennis Sylvester.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 14: March 19, 2008 Statistical Static Timing Analysis.
Architectural-Level Prediction of Interconnect Wirelength and Fanout Kwangok Jeong, Andrew B. Kahng and Kambiz Samadi UCSD VLSI CAD Laboratory
Enhanced Resist and Etch CD Control by Design Perturbation Abstract Etch dummy features are used to reduce CD skew between resist and etch processes and.
UC San Diego Computer Engineering. VLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD.
Chung-Kuan Cheng†, Andrew B. Kahng†‡,
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 22: April 11, 2011 Statistical Static Timing Analysis.
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
Toward Performance-Driven Reduction of the Cost of RET-Based Lithography Control Dennis Sylvester Jie Yang (Univ. of Michigan,
Design Bright-Field AAPSM Conflict Detection and Correction C. Chiang, Synopsys A. Kahng, UC San Diego S. Sinha, Synopsys X. Xu, UC San Diego A. Zelikovsky,
Jan. 2007VLSI Design '071 Statistical Leakage and Timing Optimization for Submicron Process Variation Yuanlin Lu and Vishwani D. Agrawal ECE Dept. Auburn.
A Cost-Driven Lithographic Correction Methodology Based on Off-the-Shelf Sizing Tools.
Toward a Methodology for Manufacturability-Driven Design Rule Exploration Luigi Capodieci, Puneet Gupta, Andrew B. Kahng, Dennis Sylvester, and Jie Yang.
Topography-Aware OPC for Better DOF margin and CD control Puneet Gupta*, Andrew B. Kahng*†‡, Chul-Hong Park†, Kambiz Samadi†, and Xu Xu‡ * Blaze-DFM Inc.
Statistical Gate Delay Calculation with Crosstalk Alignment Consideration Andrew B. Kahng, Bao Liu, Xu Xu UC San Diego
UC San Diego Computer Engineering. VLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD.
Selective Gate-Length Biasing for Cost-Effective Runtime Leakage Control Puneet Gupta 1 Andrew B. Kahng 1 Puneet Sharma 1 Dennis Sylvester 2 1 ECE Department,
Modern VLSI Design 2e: Chapter 4 Copyright  1998 Prentice Hall PTR Topics n Crosstalk. n Power optimization.
Timing Analysis and Optimization Implications of Bimodal CD Distribution in Double Patterning Lithography Kwangok Jeong and Andrew B. Kahng VLSI CAD LABORATORY.
UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD Laboratory UC San Diego Computer Engineering VLSI CAD.
DDRO: A Novel Performance Monitoring Methodology Based on Design-Dependent Ring Oscillators Tuck-Boon Chan †, Puneet Gupta §, Andrew B. Kahng †‡ and Liangzhen.
Enhanced Metamodeling Techniques for High-Dimensional IC Design Estimation Problems Andrew B. Kahng, Bill Lin and Siddhartha Nath VLSI CAD LABORATORY,
Dose Map and Placement Co-Optimization for Timing Yield Enhancement and Leakage Power Reduction Kwangok Jeong, Andrew B. Kahng, Chul-Hong Park, Hailong.
L. Karklin, S. Mazor, D.Joshi1, A. Balasinski2, and V. Axelrad3
Modern VLSI Design 4e: Chapter 4 Copyright  2008 Wayne Wolf Topics n Interconnect design. n Crosstalk. n Power optimization.
Accuracy-Configurable Adder for Approximate Arithmetic Designs
Modeling and Estimation of Full-Chip Leakage Current Considering Within- Die Correlations Khaled R. Heloue, Navid Azizi, Farid N. Najm University of Toronto.
-1- UC San Diego / VLSI CAD Laboratory Construction of Realistic Gate Sizing Benchmarks With Known Optimal Solutions Andrew B. Kahng, Seokhyeong Kang VLSI.
Introduction to CMOS VLSI Design Lecture 5: Logical Effort GRECO-CIn-UFPE Harvey Mudd College Spring 2004.
Process Variation Mohammad Sharifkhani. Reading Textbook, Chapter 6 A paper in the reference.
Modern VLSI Design 3e: Chapter 4 Copyright  1998, 2002 Prentice Hall PTR Topics n Interconnect design. n Crosstalk. n Power optimization.
Pattern Sensitive Placement For Manufacturability Shiyan Hu, Jiang Hu Department of Electrical and Computer Engineering Texas A&M University College Station,
Pattern Sensitive Placement For Manufacturability Shiyan Hu, Jiang Hu Department of Electrical and Computer Engineering Texas A&M University College Station,
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 24: April 22, 2015 Statistical Static Timing Analysis.
Physical Synthesis Buffer Insertion, Gate Sizing, Wire Sizing,
On the Assumption of Normality in Statistical Static Timing Analysis Halim Damerdji, Ali Dasdan, Santanu Kolay February 28, 2005 PrimeTime Synopsys, Inc.
REALITY STATISTICAL CHARACTERIZATION OF A HIGH-K METAL GATE 32NM ARM926 CORE UNDER PROCESS VARIABILITY IMPACT Paul Zuber Petr Dobrovolny Miguel Miranda.
Variation. 2 Sources of Variation 1.Process (manufacturing) (physical) variations:  Uncertainty in the parameters of fabricated devices and interconnects.
UC San Diego / VLSI CAD Laboratory Learning-Based Approximation of Interconnect Delay and Slew Modeling in Signoff Timing Tools Andrew B. Kahng, Seokhyeong.
Static Timing Analysis
Predicting the In-System Performance of the CMS Tracker Analog Readout Optical Links Stefanos Dris CERN & Imperial College, London.
Design For Manufacturability in Nanometer Era
-1- UC San Diego / VLSI CAD Laboratory Optimization of Overdrive Signoff Tuck-Boon Chan, Andrew B. Kahng, Jiajia Li and Siddhartha Nath Tuck-Boon Chan,
-1- Delay Uncertainty and Signal Criticality Driven Routing Channel Optimization for Advanced DRAM Products Samyoung Bang #, Kwangsoo Han ‡, Andrew B.
Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.
University of Michigan Advanced Computer Architecture Lab. 2 CAD Tools for Variation Tolerance David Blaauw and Kaviraj Chopra University of Michigan.
Motivation Process & Design trends
Circuit Design Techniques for Low Power DSPs
On the Improvement of Statistical Timing Analysis
VLSI Testing Lecture 7: Delay Test
Presentation transcript:

A Cost-Driven Lithographic Correction Methodology Based on Off-the-Shelf Sizing Tools A Cost-Driven Lithographic Correction Methodology Based on Off-the-Shelf Sizing Tools Puneet Gupta Andrew B. Kahng Univ. of California, San Diego Dennis Sylvester Jie Yang Univ. of Michigan, Ann Arbor Work partially supported by MARCO GSRC and SRC

OutlineOutline Trends in Mask Cost Design for Value MinCorr: The Cost of Correction Problem Generic Cost of Correction Methodology MinCorr: Parallels to Gate Sizing Experiments and Results Conclusions and Ongoing Work

Stringent CD Requirements ITRS predicts aggressive CD control as a critical issue Resolution Enhancement Techniques (RETs) such as Optical Proximity Correction (OPC) and Phase Shift Mask (PSM) used Year Technology Node MPU Gate Length Gate CD Control(3  ) 130nm90nm5.3nm90nm37nm3.0nm65nm25nm2.0nm

Mask Cost Components

Mask Data Volume Large data volume  Long mask write times  Increase in mask cost

Trends in Mask Cost RETs increase mask feature complexities and hence mask costs No. of line edges increase by 4-8X after OPC  (for vector scanning) increased mask write time “Million dollar mask set” in 90nm (Sematech, 2000) Average mask set produces only 570 wafers  amortization of mask cost is difficult Mask writers work equally hard to perfect critical and non- critical shapes Errors found in either during mask inspection will cause the mask to be discarded RET and mask write are function-oblivious!

OutlineOutline Trends in Mask Cost Trends in Mask Cost Design for Value MinCorr: The Cost of Correction Problem Generic Cost of Correction Methodology MinCorr: Parallels to Gate Sizing Experiments and Results Conclusions and Ongoing Work

Design for Value * Mask cost trends  Design for Value (DFV) methodologies Design for Value Problem: Design for Value Problem: Given Given Performance measure f Value function v(f) Selling points f i corresponding to various values of f Yield function y(f) Maximize Total Design Value =  i y(f i )*v(f i ) [or, Minimize Total Cost] Probabilistic optimization regime * See "Design Sensitivities to Variability: Extrapolation and Assessments in Nanometer VLSI", IEEE ASIC/SoC Conference, September 2002, pp

DFV At Process Level Inject concept of function into mask flow Selective OPC Various levels of OPC depending on timing and yield criticality of features Obtain desired level of parametric yield Printability: some min level of OPC is required No OPCMedium OPC Aggressive OPC * Figure courtesy: Kurt Wampler, ASML Mask Tools, Inc.

OutlineOutline Trends in Mask Cost Trends in Mask Cost Design for Value Design for Value MinCorr: The Cost of Correction Problem Generic Cost of Correction Methodology MinCorr: Parallels to Gate Sizing Experiments and Results Conclusions and Ongoing Work

Performance Measure = Delay Selling point delay = circuit delay which achieves desired level (say 99%) of parametric yield Goal: Achieve selling point delay with minimum cost of RET’s (OPC)

MinCorr: The Cost of Correction DFV Problem Given: Admissible levels of correction for each layout feature and the corresponding delay impact (mean and variance) Find: Level of correction for each layout feature such that a prescribed selling point delay is attained Objective: Minimize total cost of corrections

OutlineOutline Trends in Mask Cost Trends in Mask Cost Design for Value Design for Value MinCorr: The Cost of Correction Problem MinCorr: The Cost of Correction Problem Generic Cost of Correction Methodology MinCorr: Parallels to Gate Sizing Experiments and Results Conclusions and Ongoing Work

Statistical Timing Analysis Statistical STA (SSTA) provides PDFs of arrival times at all nodes A B Arrival time C Gate delay Arrival time Propagate arrival time distribution

Variation Aware Library Model Capacitance and delay values replaced by ( ,  ) pair Sample variation aware.lib pin(A) { pin(A) { direction : input; direction : input; capacitance : ( ,0.0003); capacitance : ( ,0.0003); } … timing() { related_pin : "A"; related_pin : "A"; timing_sense : positive_unate; timing_sense : positive_unate; cell_rise(delay_template_7x7) { cell_rise(delay_template_7x7) { index_1 ("0.028, 0.044, 0.076"); index_1 ("0.028, 0.044, 0.076"); index_2 (" , , "); index_2 (" , , "); values ( \ values ( \ “( ,0.001), ( ,0.0015), ( ,0.002)", “( ,0.001), ( ,0.0015), ( ,0.002)",….

Generic Cost of Correction Methodology Statistical STA (SSTA) provides PDFs of arrival times at all nodes Assume variation aware library models (for delay) are available SSTA Nominally Correct SP&R Netlist Min. Corrected Library Yield Target met ? EXIT Correction Algorithm SSTA All Correction Libraries All Correction Libraries Y N

Generic Cost of Correction Methodology Statistical STA (SSTA) provides PDFs of arrival times at all nodes Assume variation aware library models (for delay) are available Statistical STA currently has runtime and scalability issues SSTA Nominally Correct SP&R Netlist Min. Corrected Library Yield Target met ? EXIT Correction Algorithm SSTA All Correction Libraries All Correction Libraries Y N

OutlineOutline Trends in Mask Cost Trends in Mask Cost Design for Value Design for Value MinCorr: The Cost of Correction Problem MinCorr: The Cost of Correction Problem Generic Cost of Correction Methodology Generic Cost of Correction Methodology MinCorr: Parallels to Gate Sizing Experiments and Results Conclusions and Ongoing Work

MinCorr: Parallels to Gate Sizing Assume Gaussian-ness of distributions prevails  + 3  corresponds to 99% yield Perfect correlation of variation along all paths  Die-to-Die variation   1+2 =   1 +   2 Resulting linearity allows propagation of (  +3  ) or 99% (selling point) delay to primary outputs using standard Static Timing Analysis (STA) tools

MinCorr: Parallels to Gate Sizing Gate Sizing MinCorr Cell Area  Cost of correction Nominal Delay  Delay (  +k  ) Cycle Time  Selling point delay Die Area  Total cost of OPC Gate Sizing Problem: Given allowed areas and corresponding delays of each cell, minimize total die area subject to a cycle time constraint costs of correction delay (  +k  ) cost of OPC cost of OPC selling point delay MinCorr

Components of MinCorr Sizing A yield-aware library that captures Delay mean and variance for each library master for each level of correction Relative cost of OPC for each master corresponding to each level of correction Use standard off-the-shelf logic synthesis tool to perform sizing Can use well-tested sizing methods Practical runtimes Can handle interesting variants, e.g., cost-constrained selling point delay minimization

MinCorr: Yield Aware Library Characterization Mask cost is assumed proportional to number of layout features Monte-Carlo simulations, coupled with linear interpolation, are used to estimate delay variance given the CD variation We generate a library similar to Synopsys.lib with (  +3  ) delay values for various output loads Cost modeled by relative figure count multiplied by the number of transistors in the cell Gate input capacitance variation with CD considered

OutlineOutline Trends in Mask Cost Trends in Mask Cost Design for Value Design for Value MinCorr: The Cost of Correction Problem MinCorr: The Cost of Correction Problem Generic Cost of Correction Methodology Generic Cost of Correction Methodology MinCorr: Parallels to Gate Sizing MinCorr: Parallels to Gate Sizing Experiments and Results Conclusions and Ongoing Work

Experiments and Results Synopsys Design Compiler used as the synthesis tool to perform “gate sizing” Figure counts, critical dimension (CD) variations derived from Numerical Technologies OPC tool * Use a restricted TSMC 0.13  m library 7 cell masters: BUF, INV, NAND, NOR Approach tested on small combinational circuits alu128: 8064 cells c7552: 2081 cell ISCAS85 circuit c6288: 2769 cell ISCAS85 circuit * Courtesy Dipu Pramanik, NTI

Emulating An SSTA Tool 1.Generate 500 random delay values for each library master from the Gaussian distribution N( ,  ) 2.Generate 500 random input capacitance values perfectly correlated with corresponding random delay values 3.Generate 500.libs having these random delay and capacitance values 4.Monte Carlo Primetime: Run STA tool 500 times, each time with a different.lib to obtain a delay distribution

Comparison With SSTA SSTA emulated by running Synopsys Primetime 500 times with 500 randomly generated.libs Monte Carlo Primetime is run with independently varying library masters but all instances of same master perfectly correlated  Die-to-Die (DTD) with some component of Within-Die (WID) variation modeled Our (  +3  ) propagation approach is accurate for DTD variation but pessimistic in presence of WID variation

Comparison With SSTA Testcase OPC Level SSTA (  +3  ) (ps) Our Approach (  +3  ) (ps) alu128c7552c6288AggressiveMediumNoAggressiveMediumNoAggressiveMediumNo Note: pessimism and fidelity

Yield Library Generation Three levels of OPC considered Input slew dependence ignored Interconnect variation ignored Type of OPC Ldrawn (nm) 3  of Ldrawn Figure Count Delay ( , 3  ) for NAND2X2 Aggressive1305%5X (64.82, 2.14) Medium1306.5%4X (64.82, 2.80) No OPC 13010%1X (64.82, 4.33) Sample Result of Library Generation

Cost Savings with MinCorr Sizing Design Normalized Cost Normalized Selling Point Delay alu (Aggressive OPC) (Medium OPC) (No OPC) c c

Cost Savings with MinCorr Sizing Small (~5%) selling point delay variation between max- and min-corrected versions of design (5X difference in cost) Sizing-based optimization achieves 17-79% reduction in OPC cost without sacrificing parametric yield

OutlineOutline Trends in Mask Cost Trends in Mask Cost Design for Value Design for Value MinCorr: The Cost of Correction Problem MinCorr: The Cost of Correction Problem Generic Cost of Correction Methodology Generic Cost of Correction Methodology MinCorr: Parallels to Gate Sizing MinCorr: Parallels to Gate Sizing Experiments and Results Experiments and Results Conclusions and Ongoing Work

ConclusionsConclusions Function-aware OPC can reduce total cost of OPC while still meeting cycle time and yield constraints Can modify conventional performance optimization methods to solve the MinCorr problem; We use an off-the-shelf synthesis tool to achieve up to 79% cost reduction compared to aggressive OPC, without increasing selling point delay Small change in yield going from no to aggressive OPC suggests that OPC may be a manufacturability issue rather than yield issue

Ongoing Work SSTA based correction flow Apply selective OPC at granularities other than gate-level (incl. radius of influence effects) Alternative MinCorr solution approaches based on transistor sizing and cost-based delay budgeting methods Include interconnect variation in the analysis Make the yield library input slew time aware

Ongoing Work Current sizing approach models Die-to-Die variation accurately but ignores Within-Die (WID) component SSTA with WID Randomly perturbed SDF files Monte Carlo Primetime Ability to model arbitrary distributions of variations

A Cost-Driven Lithographic Correction Methodology Based on Off-the-Shelf Sizing Tools A Cost-Driven Lithographic Correction Methodology Based on Off-the-Shelf Sizing Tools Puneet Gupta Andrew B. Kahng Dennis Sylvester Jie Yang

Trends in Mask Cost “$1M mask set” in 100nm

OPC Overhead OPC increases figure count and mask complexity No. of line edges increase by 4-8X after OPC  (for vector scanning) increase in mask write time Rule Based OPC All rectangles identically corrected Model Based OPC Enhancement to a feature made based on its geometry and local environment OPC fracturing tools view layout as function-oblivious GDSII  overcorrection

Toward A Min Cost of Correction Methodology Many layout features not timing critical  they can tolerate more process variation Less-aggressive OPC  lower costs (reduced figure counts, shorter mask write times, higher yields) Printability  certain min level of OPC is required Selling point delay = circuit delay which achieves desired level (say 99%) of parametric yield

Generic Cost of Correction Methodology SSTA Nominally Correct SP&R Netlist Min. Corrected Library Yield Target met ? EXIT Correction Algorithm SSTA All Correction Libraries All Correction Libraries Y N