Layout-aware Scan-based Delay Fault Testing Puneet Gupta 1 Andrew B. Kahng 1 Ion Mandoiu 2 Puneet Sharma 1 1 ECE Department, University of California –

Slides:



Advertisements
Similar presentations
Porosity Aware Buffered Steiner Tree Construction C. Alpert G. Gandham S. Quay IBM Corp M. Hrkic Univ Illinois Chicago J. Hu Texas A&M Univ.
Advertisements

Retiming Scan Circuit To Eliminate Timing Penalty
OCV-Aware Top-Level Clock Tree Optimization
Weighted Random and Transition Density Patterns for Scan-BIST Farhana Rashid* Vishwani D. Agrawal Auburn University ECE Department, Auburn, Alabama
Courtesy RK Brayton (UCB) and A Kuehlmann (Cadence) 1 Logic Synthesis Sequential Synthesis.
Native-Conflict-Aware Wire Perturbation for Double Patterning Technology Szu-Yu Chen, Yao-Wen Chang ICCAD 2010.
1 A Random Access Scan Architecture to Reduce Hardware Overhead Anand S. Mudlapur Vishwani D. Agrawal Adit D. Singh Department of Electrical and Computer.
1 Advancing Supercomputer Performance Through Interconnection Topology Synthesis Yi Zhu, Michael Taylor, Scott B. Baden and Chung-Kuan Cheng Department.
NCHUCS1 Scan Chain Reorder Sying-Jyan Wang Department of Computer Science National Chung-Hsing University.
Improving Placement under the Constant Delay Model Kolja Sulimma 1, Ingmar Neumann 1, Lukas Van Ginneken 2, Wolfgang Kunz 1 1 EE and IT Department University.
F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (UCLA) S. Muddu (Sanera Systems) A. Zelikovsky (Georgia State) Provably Good Global Buffering by.
Design Flow Enhancements for DNA Arrays Andrew B. Kahng 1 Ion I. Mandoiu 2 Sherief Reda 1 Xu Xu 1 Alex Zelikovsky 3 (1) CSE Department, University of California.
FPGA Latency Optimization Using System-level Transformations and DFG Restructuring Daniel Gomez-Prado, Maciej Ciesielski, and Russell Tessier Department.
Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.
Background: Scan-Based Delay Fault Testing Sequentially apply initialization, launch test vector pairs that differ by 1-bit shift A vector pair induces.
Evaluation of Placement Techniques for DNA Probe Array Layout Andrew B. Kahng 1 Ion I. Mandoiu 2 Sherief Reda 1 Xu Xu 1 Alex Zelikovsky 3 (1) CSE Department,
5/1/2006VTS'061 Upper Bounding Fault Coverage by Structural Analysis and Signal Monitoring Vishwani D. Agrawal Auburn University, Dept. of ECE, Auburn,
: Grid graph :Draw two rays from each concave point Rays are divided into non-intersected ray-segments Conflict pair: two ray segments from the same point.
Power-Aware Placement
Practically Realizing Random Access Scan By Anand Mudlapur ECE Dept. Auburn University.
ELEN 468 Lecture 241 ELEN 468 Advanced Logic Design Lecture 24 Design for Testability.
Fill for Shallow Trench Isolation CMP
Architectural-Level Prediction of Interconnect Wirelength and Fanout Kwangok Jeong, Andrew B. Kahng and Kambiz Samadi UCSD VLSI CAD Laboratory
Supply Voltage Degradation Aware Analytical Placement Andrew B. Kahng, Bao Liu and Qinke Wang UCSD CSE Department {abk, bliu,
Study of Floating Fill Impact on Interconnect Capacitance Andrew B. Kahng Kambiz Samadi Puneet Sharma CSE and ECE Departments University of California,
Practically Realizing Random Access Scan Anand S. Mudlapur Department of Electrical and Computer Engineering Auburn University, AL USA.
Vishwani D. Agrawal James J. Danaher Professor
On Legalization of Row-Based Placements Andrew B. KahngSherief Reda CSE & ECE Departments University of CA, San Diego La Jolla, CA 92093
Yield- and Cost-Driven Fracturing for Variable Shaped-Beam Mask Writing Andrew B. Kahng CSE and ECE Departments, UCSD Xu Xu CSE Department, UCSD Alex Zelikovsky.
Fast and Area-Efficient Phase Conflict Detection and Correction in Standard-Cell Layouts Charles Chiang, Synopsys Andrew B. Kahng, UC San Diego Subarna.
Chung-Kuan Cheng†, Andrew B. Kahng†‡,
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
1 UCSD VLSI CAD Laboratory ISQED-2009 Revisiting the Linear Programming Framework for Leakage Power vs. Performance Optimization Kwangok Jeong, Andrew.
Fill for Shallow Trench Isolation CMP Andrew B. Kahng 1,2 Puneet Sharma 1 Alexander Zelikovsky 3 1 ECE Department, University of California – San Diego.
IC-SOC STEAC: An SOC Test Integration Platform Cheng-Wen Wu.
A Proposal for Routing-Based Timing-Driven Scan Chain Ordering Puneet Gupta 1 Andrew B. Kahng 1 Stefanus Mantik 2
Detailed Placement for Leakage Reduction Using Systematic Through-Pitch Variation Andrew B. Kahng †‡ Swamy Muddu ‡ Puneet Sharma ‡ CSE † and ECE ‡ Departments,
UC San Diego Computer Engineering. VLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD Laboratory.. UC San Diego Computer EngineeringVLSI CAD.
Selective Gate-Length Biasing for Cost-Effective Runtime Leakage Control Puneet Gupta 1 Andrew B. Kahng 1 Puneet Sharma 1 Dennis Sylvester 2 1 ECE Department,
Class Design Project - Test Generation 1 Class Design Project Test Generation Hillary Grimes III ELEC Project Presentation April 26, 2007.
Routing-Aware Scan Chain Ordering Puneet Gupta and Andrew B. Kahng (Univ. of California at San Diego, La Jolla, CA, USA.), Stefanus Mantik (Cadence Design.
Dose Map and Placement Co-Optimization for Timing Yield Enhancement and Leakage Power Reduction Kwangok Jeong, Andrew B. Kahng, Chul-Hong Park, Hailong.
A Topology-based ECO Routing Methodology for Mask Cost Minimization Po-Hsun Wu, Shang-Ya Bai, and Tsung-Yi Ho Department of Computer Science and Information.
-1- UC San Diego / VLSI CAD Laboratory A Global-Local Optimization Framework for Simultaneous Multi-Mode Multi-Corner Clock Skew Variation Reduction Kwangsoo.
1 Fitting ATE Channels with Scan Chains: a Comparison between a Test Data Compression Technique and Serial Loading of Scan Chains LIRMM CNRS / University.
Low-Power Gated Bus Synthesis for 3D IC via Rectilinear Shortest-Path Steiner Graph Chung-Kuan Cheng, Peng Du, Andrew B. Kahng, and Shih-Hung Weng UC San.
August VLSI Testing and Verification Shmuel Wimer Bar Ilan University, School of Engineering.
UC San Diego / VLSI CAD Laboratory Incremental Multiple-Scan Chain Ordering for ECO Flip-Flop Insertion Andrew B. Kahng, Ilgweon Kang and Siddhartha Nath.
A Test-Per-Clock LFSR Reseeding Algorithm for Concurrent Reduction on Test Sequence Length and Test Data Volume Adviser :蔡亮宙 Student ;蔡政宏.
Thermal-aware Steiner Routing for 3D Stacked ICs M. Pathak and S.K. Lim Georgia Institute of Technology ICCAD 07.
Kwangsoo Han, Andrew B. Kahng, Hyein Lee and Lutong Wang
ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTEMS
A Graph Based Algorithm for Data Path Optimization in Custom Processors J. Trajkovic, M. Reshadi, B. Gorjiara, D. Gajski Center for Embedded Computer Systems.
A Robust Pulse-triggered Flip-Flop and Enhanced Scan Cell Design
ARCHER:A HISTORY-DRIVEN GLOBAL ROUTING ALGORITHM Muhammet Mustafa Ozdal, Martin D. F. Wong ICCAD ’ 07.
Reducing Test Application Time Through Test Data Mutation Encoding Sherief Reda and Alex Orailoglu Computer Science Engineering Dept. University of California,
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
Towards a More Fair and Robust Internet Backbone Year 1 Status Report Rene Cruz, Tara Javidi, Bill Lin Center for Networked Systems University of California,
Courtesy RK Brayton (UCB) and A Kuehlmann (Cadence) 1 Logic Synthesis Multi-Level Logic Synthesis.
Multi-Split-Row Threshold Decoding Implementations for LDPC Codes
A Test-Per-Clock LFSR Reseeding Algorithm for Concurrent Reduction on Test Sequence Length and Test Data Volume Wei-Cheng Lien 1, Kuen-Jong Lee 1 and Tong-Yu.
Outline Motivation and Contributions Related Works ILP Formulation
Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles Zhiyi Yu, Bevan Baas VLSI Computation Lab, ECE Department University of California,
Test Generation for Designs with Multiple Clocks Xijiang LinSudhakar M. Reddy Mentor Graphics Corp SW Boeckman Rd. Wilsonville, OR ECE Department.
1 Double-Patterning Aware DSA Template Guided Cut Redistribution for Advanced 1-D Gridded Designs Zhi-Wen Lin and Yao-Wen Chang National Taiwan University.
Kun Young Chung*, Andrew B. Kahng+ and Jiajia Li+
Contents Introduction Bus Power Model Related Works Motivation
Kristof Blutman† , Hamed Fatemi† , Andrew B
Revisiting and Bounding the Benefit From 3D Integration
Multi-Commodity Flow Based Routing
Presentation transcript:

Layout-aware Scan-based Delay Fault Testing Puneet Gupta 1 Andrew B. Kahng 1 Ion Mandoiu 2 Puneet Sharma 1 1 ECE Department, University of California – San Diego 2 CSE Department, University of Connecticut, Storrs

Outline Introduction Introduction Problem formulations Problem formulations Multi-fragment greedy algorithm Multi-fragment greedy algorithm Experiments and results Experiments and results Future directions Future directions

Delay Fault Testing Delay fault: failure of a path to meet timing Delay fault: failure of a path to meet timing Initialization Vector High clock speeds + increasing variability High clock speeds + increasing variability  Delay fault testing important Pair of vectors required Pair of vectors required Initialization vector Initialization vector Launch vector Launch vector Launch Vector 1 0

Scan-based Delay Fault Testing Utilizes standard shift- scan architecture Utilizes standard shift- scan architecture Launch vector produced in two ways: Launch vector produced in two ways: From circuit logic From circuit logic  Functional justification  Functional justification From scan chain From scan chain  Scan justification

Functional Justification Launch vector generated by the circuit logic Launch vector generated by the circuit logic 1.Scan-in initialization vector No effect of scan order No effect of scan order 2. Give system CLK to generate launch vector 3.Give system CLK, capture result 4.Scan-out result Not all paths testable Not all paths testable Difficult to produce vector pairs Difficult to produce vector pairs

Scan Justification Launch vector generated by shifting initialization vector Launch vector generated by shifting initialization vector 1.Scan-in initialization vector 2.Give scan CLK to generate launch vector 3.Give system CLK, capture result 4.Scan-out result Given scan order, not all vector pairs usable Given scan order, not all vector pairs usable Coverage: %age of vector pairs usable Initialization: Launch:

Increasing Scan Coverage Scan order Scan order Initialization: Launch: Don’t cares Don’t cares Initialization: 1 0 x 1 1 Launch: 1 1 x Dummy flops Dummy flops Initialization: Launch:

Scan Order Objectives Coverage driven, layout oblivious Coverage driven, layout oblivious Gupta et al, Cheng et al, … Gupta et al, Cheng et al, … WL: mm, Cov: % WL: 1.22 mm, Cov: 56.80% Layout driven, coverage oblivious Layout driven, coverage oblivious QPlace, Boese et al., Kobayashi et al., … QPlace, Boese et al., Kobayashi et al., … WL: 1.46 mm, Cov: % Layout + Coverage driven Our approach

Outline Introduction Introduction Problem formulations Problem formulations Multi-fragment greedy algorithm Multi-fragment greedy algorithm Experiments and results Experiments and results Future directions Future directions

Scan Chain Ordering Modeled as TSP with flops as cities Modeled as TSP with flops as cities TSP objective TSP objective Minimize WL: MinWL (Boese et al.) Minimize WL: MinWL (Boese et al.) Minimize #dummy for 100% coverage: CompleteDFC (Gupta et al.) Minimize #dummy for 100% coverage: CompleteDFC (Gupta et al.) This paper: Minimize WL + Maximize coverage: MaxDFC This paper: Minimize WL + Maximize coverage: MaxDFC u(i)=0 & v(j)=1 where, u: Initialization vector or u(i)=1 & v(j)=0v: Launch vector Dummy insertion in an edge makes all vector pairs compatible with it Dummy insertion in an edge makes all vector pairs compatible with it Vector pair incompatible with an edge Vector pair incompatible with an edge A vector pair is incompatible with an edge e ij if placing flop j after flop i in the scan chain causes it to become unusable A vector pair is incompatible with an edge e ij if placing flop j after flop i in the scan chain causes it to become unusable Formally, vector pair (u, v) is incompatible with e ij if Formally, vector pair (u, v) is incompatible with e ij if

MinWL (e.g., Boese et al.) Given Given Set of n placed flip-flops F, Scan-in SI, Scan-out SO Find Scan chain ordering  of F  {SI, SO}, starting with SI ending with SO Such that Total scan chain length minimized Total scan chain length minimized

CompleteDFC (Gupta et al.) Given Given Set of n flip-flops F, Scan-in SI, Scan-out SO Set of n flip-flops F, Scan-in SI, Scan-out SO Set of m delay fault vector pairs  Set of m delay fault vector pairs  Find Find Scan chain ordering  of F  {SI, SO}, starting with SI ending with SO Scan chain ordering  of F  {SI, SO}, starting with SI ending with SO Such that Such that # dummy flops required for 100% coverage minimized # dummy flops required for 100% coverage minimized

MaxDFC Given Given Set of n placed flip-flops F, Scan-in SI, Scan-out SO Set of n placed flip-flops F, Scan-in SI, Scan-out SO Set of m delay fault vector pairs,  each with a weight w t, t   Set of m delay fault vector pairs,  each with a weight w t, t   Upped bound on #dummies, D Upped bound on #dummies, D Find Find Scan chain ordering  of F  {SI, SO}, starting with SI ending with SO Scan chain ordering  of F  {SI, SO}, starting with SI ending with SO Set of alive vector pairs, C Set of alive vector pairs, C Such that Such that Scan length minimized Scan length minimized Sum of weights of alive vector pairs maximized Sum of weights of alive vector pairs maximized Vectors pairs incompatible with at most D edges Vectors pairs incompatible with at most D edges

Outline Introduction Introduction Problem formulations Problem formulations Multi-fragment greedy algorithm Multi-fragment greedy algorithm Experiments and results Experiments and results Future directions Future directions

Three Phase MFG - Overview Phase I Phase I Produce D+1 short, high coverage scan chain fragments Produce D+1 short, high coverage scan chain fragments Based on multi-fragment algorithm for TSP Based on multi-fragment algorithm for TSP Data structures Data structures Edge-vector incompatibility matrix Edge-vector incompatibility matrix Edge buckets Edge buckets Phase II Phase II Stitch D+1 fragments using D dummies minimizing WL Stitch D+1 fragments using D dummies minimizing WL Phase III Phase III Further reduce scan chain WL Further reduce scan chain WL

Three Phase MFG – Phase I initialize edge-vector incompatibility matrix distribute edges in buckets based on #incompatible vector pairs while #fragments > #dummies+1 pop shortest edge e ij from first non-empty bucket if( eij is admissible in tour ) add e ij to tour remove vectors incompatible with e ij from edge-vector matrix promote edges with which removed vectors were incompatible 1212121 313131 323232 121212 131313 232323 3131313 23232323 + 5 vectors, 0 dummies edge-vector matrix 012 buckets 12121212 21212121 31313131 32323232

Three Phase MFG – Phase I Scalability Scalability Use small #edges, w(e) < T Use small #edges, w(e) < T If #frags < #dummies + 1, rerun with threshold=M  T If #frags < #dummies + 1, rerun with threshold=M  T Quite insensitive to T, M Quite insensitive to T, M

Three Phase MFG – Phase II Target: Stitch D+1 fragments, minimizing WL Target: Stitch D+1 fragments, minimizing WL Approach Approach ATSP with fragments as cities ATSP with fragments as cities D1 D2 w e  WL required to connect fragments w e  WL required to connect fragments Small #cities Small #cities  Quick even with high quality TSP solver

Three Phase MFG – Phase III Target: Further reduce scan WL Target: Further reduce scan WL Approach Approach Create TSP instance with flops as cities Create TSP instance with flops as cities Throw in edges compatible with all alive faults Throw in edges compatible with all alive faults w e  WL of edge e w e  WL of edge e D1 D D1 D D1 D2

Outline Introduction Introduction Problem formulations Problem formulations Multi-fragment greedy algorithm Multi-fragment greedy algorithm Experiments and results Experiments and results Future directions Future directions

Experimental Flows Comparison of three flows Comparison of three flows MinWL (Boese et al.) MinWL (Boese et al.)  Reference min WL tour CompleteDFC (Gupta et al.) CompleteDFC (Gupta et al.)  Reference full coverage tour MaxDFC MaxDFC

Testcases Testcase # cells # flops # paths Functional coverage s % s % s % AES % DES % Synthesis Design Compiler STAPrimeTimeATPGTetraMAX PlacementQPlace Source all vectors scanvectorsfloplocations paths func vectors 

Results: s38417 Wirelength Wirelength Coverage Coverage MinWLMaxDFC

Results: aes Wirelength Wirelength Coverage Coverage MinWLMaxDFC

MFG Scalability MFG Runtime Time(s)

Dummy-Coverage Tradeoff Proposed ILP formulation Proposed ILP formulation s38417

Outline Introduction Introduction Problem formulations Problem formulations Multi-fragment greedy algorithm Multi-fragment greedy algorithm Experiments and results Experiments and results Future directions Future directions

Conclusions We proposed an algorithm to simultaneously reduce WL and increase delay fault coverage We proposed an algorithm to simultaneously reduce WL and increase delay fault coverage Significant increase in coverage with 10-30% WL increase Significant increase in coverage with 10-30% WL increase Explored tradeoff b/w coverage and dummy insertion Explored tradeoff b/w coverage and dummy insertion

Future Directions Extension to multiple scan chains Extension to multiple scan chains Congestion aware scan ordering Congestion aware scan ordering Modifications to use compacted and/or redundant vectors Modifications to use compacted and/or redundant vectors

Thank You