Recent Challenges. 2 Soft Errors Scaling:  SEU (Single-event upset): −Ionizing radiation corrupts data stored  Cause: −Radioactive impurities in device.

Slides:



Advertisements
Similar presentations
ITRS December 2003, Hsin-Chu Taiwan How Much Variability Can Designers Tolerate? Andrew B. Kahng ITRS Design ITWG December 1, 2003.
Advertisements

DRAFT - NOT FOR PUBLICATION 14 July 2004 – ITRS Summer Conference ITRS FEP Challenges Continued scaling will require the introduction of new materials.
Barcelona Forum on Ph.D. Research in Communications, Electronics and Signal Processing 21st October 2010 Soft Errors Hardening Techniques in Nanometer.
University of Toronto Minimization of Delay Sensitivity to Process Induced Vth Variations Georges Nabaa Farid N. Najm University of Toronto
FPGA (Field Programmable Gate Array)
An International Technology Roadmap for Semiconductors
Tunable Sensors for Process-Aware Voltage Scaling
HPEC 2012 Scrubbing Optimization via Availability Prediction (SOAP) for Reconfigurable Space Computing Quinn Martin Alan George.
Microprocessor Reliability
1 Cleared for Open Publication July 30, S-2144 P148/MAPLD 2004 Rea MAPLD 148:"Is Scaling the Correct Approach for Radiation Hardened Conversions.
Power Reduction Techniques For Microprocessor Systems
On the Need for Statistical Timing Analysis Farid N. Najm University of Toronto
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 23: April 10, 2013 Statistical Static Timing Analysis.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Accelerating Productization. Functional Metrology TM Challenges of Semiconductor Productization Leading IDM’s Solution Novel Solution -> In-product Functional.
Architecture Design Methodology. 2 The effects of architecture design on metrics:  Area (cost)  Performance  Power Target market:  A set of application.
Defect Tolerance for Yield Enhancement of FPGA Interconnect Using Fine-grain and Coarse-grain Redundancy Anthony J. Yu August 15, 2005.
Yuanlin Lu Intel Corporation, Folsom, CA Vishwani D. Agrawal
Defect Tolerance for Yield Enhancement of FPGA Interconnect Using Fine-grain and Coarse-grain Redundancy Anthony J. Yu August 15, 2005.
 Device and architecture co-optimization – Large search space – Need fast yet accurate power and delay estimator for FPGAs  Trace-based power and delay.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 23: April 22, 2009 Statistical Static Timing Analysis.
1 Variability Characterization in FPGAs Brendan Hargreaves 10/05/2006.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 14: March 19, 2008 Statistical Static Timing Analysis.
Institute of Digital and Computer Systems 1 Fabio Garzia / Finding Peak Performance in a Process23/06/2015 Chapter 5 Finding Peak Performance in a Process.
Evolution of implementation technologies
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 22: April 11, 2011 Statistical Static Timing Analysis.
On-Line Adjustable Buffering for Runtime Power Reduction Andrew B. Kahng Ψ Sherief Reda † Puneet Sharma Ψ Ψ University of California, San Diego † Brown.
Stochastic Physical Synthesis for FPGAs with Pre-routing Interconnect Uncertainty and Process Variation Yan Lin and Lei He EE Department, UCLA
© 2005 Altera Corporation © 2006 Altera Corporation Placement and Timing for FPGAs Considering Variations Yan Lin 1, Mike Hutton 2 and Lei He 1 1 EE Department,
Jan. 2007VLSI Design '071 Statistical Leakage and Timing Optimization for Submicron Process Variation Yuanlin Lu and Vishwani D. Agrawal ECE Dept. Auburn.
Circuit Performance Variability Decomposition Michael Orshansky, Costas Spanos, and Chenming Hu Department of Electrical Engineering and Computer Sciences,
1 adaptive body bias for reducing process variations nuno alves 19 / october / 2006.
Temperature-Aware Design Presented by Mehul Shah 4/29/04.
Trace-Based Framework for Concurrent Development of Process and FPGA Architecture Considering Process Variation and Reliability 1 Lerong Cheng, 1 Yan Lin,
Integrated Regulation for Energy- Efficient Digital Circuits Elad Alon 1 and Mark Horowitz 2 1 UC Berkeley 2 Stanford University.
“ Near-Threshold Computing: Reclaiming Moore’s Law Through Energy Efficient Integrated Circuits ” By Ronald G. Dreslinski, Michael Wieckowski, David Blaauw,
Power Reduction for FPGA using Multiple Vdd/Vth
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 4 Programmable.
Source: Lundstrom/Fossom/Yang/Neudeck, Purdue EE612 lecture notes.
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
1 5. Application Examples 5.1. Programmable compensation for analog circuits (Optimal tuning) 5.2. Programmable delays in high-speed digital circuits (Clock.
SiLab presentation on Reliable Computing Combinational Logic Soft Error Analysis and Protection Ali Ahmadi May 2008.
CSE 494: Electronic Design Automation Lecture 2 VLSI Design, Physical Design Automation, Design Styles.
J. Christiansen, CERN - EP/MIC
2/2/2009 Marina Artuso LHCb Electronics Upgrade Meeting1 Front-end FPGAs in the LHCb upgrade The issues What is known Work plan.
MAPLD 2005/202 Pratt1 Improving FPGA Design Robustness with Partial TMR Brian Pratt 1,2 Michael Caffrey, Paul Graham 2 Eric Johnson, Keith Morgan, Michael.
Robust Low Power VLSI ECE 7502 S2015 Minimum Supply Voltage and Very- Low-Voltage Testing ECE 7502 Class Discussion Elena Weinberg Thursday, April 16,
Process Variation Mohammad Sharifkhani. Reading Textbook, Chapter 6 A paper in the reference.
Configuration Bitstream Reduction for SRAM-based FPGAs by Enumerating LUT Input Permutations The University of British Columbia© 2011 Guy Lemieux Ameer.
LA-LRU: A Latency-Aware Replacement Policy for Variation Tolerant Caches Aarul Jain, Cambridge Silicon Radio, Phoenix Aviral Shrivastava, Arizona State.
Leakage reduction techniques Three major leakage current components 1. Gate leakage ; ~ Vdd 4 2. Subthreshold ; ~ Vdd 3 3. P/N junction.
Penn ESE535 Spring DeHon 1 ESE535: Electronic Design Automation Day 24: April 22, 2015 Statistical Static Timing Analysis.
STA with Variation 1. 2 Corner Analysis PRCA (Process Corner Analysis):  Takes 1.nominal values of process parameters 2.and a delta for each parameter.
Variation-Tolerant Circuits: Circuit Solutions and Techniques Jim Tschanz, Keith Bowman, and Vivek De Microprocessor Technology Lab Intel Corporation,
Variation. 2 Sources of Variation 1.Process (manufacturing) (physical) variations:  Uncertainty in the parameters of fabricated devices and interconnects.
Patricia Gonzalez Divya Akella VLSI Class Project.
EE201C : Stochastic Modeling of FinFET LER and Circuits Optimization based on Stochastic Modeling Shaodi Wang
A High-Speed & High-Capacity Single-Chip Copper Crossbar John Damiano, Bruce Duewer, Alan Glaser, Toby Schaffer, John Wilson, and Paul Franzon North Carolina.
In-Place Decomposition for Robustness in FPGA Ju-Yueh Lee, Zhe Feng, and Lei He Electrical Engineering Dept., UCLA Presented by Ju-Yueh Lee Address comments.
EE415 VLSI Design THE INVERTER [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
© PSU Variation Aware Placement in FPGAs Suresh Srinivasan and Vijaykrishnan Narayanan Pennsylvania State University, University Park.
Paper by F.L. Kastensmidt, G. Neuberger, L. Carro, R. Reis Talk by Nick Boyd 1.
University of Michigan Advanced Computer Architecture Lab. 2 CAD Tools for Variation Tolerance David Blaauw and Kaviraj Chopra University of Michigan.
Fault-Tolerant Resynthesis for Dual-Output LUTs Roy Lee 1, Yu Hu 1, Rupak Majumdar 2, Lei He 1 and Minming Li 3 1 Electrical Engineering Dept., UCLA 2.
Overview Modern chip designs have multiple IP components with different process, voltage, temperature sensitivities Optimizing mix to different customer.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Challenges in Nanoelectronics: Process Variability
Impact of Parameter Variations on Multi-core chips
FPGA Glitch Power Analysis and Reduction
Post-Silicon Calibration for Large-Volume Products
Presentation transcript:

Recent Challenges

2 Soft Errors Scaling:  SEU (Single-event upset): −Ionizing radiation corrupts data stored  Cause: −Radioactive impurities in device packages −Recently: cosmic radiation  Scaling worsens SEU: 1.Voltage scaling + reduced node capacitances −  lower the charge threshold necessary to corrupt the data 2.Greater level of integration −  increases the likelihood that soft errors will affect the device

3 SEU Sources:  Configuration memory  Flip-flops  Memory blocks  Combinational circuits (transient error  permanent)

4

5 SEU in Configuration Memory SEU in cinfiguration bits (SRAM-based):  In Virtex FPGAs, ~ 91% of sensitive bits to soft errors are configuration bits −flash- or antifuse-based do not suffer  Any change to the configuration memory may alter the functionality  Persist until FPGA is reprogrammed

6 SEU Mitigation Techniques Mitigation techniques: 1.Circuit and technology-level: −Addition of metal capacitors to nodes in the memory  increases the amount of charge necessary to cause SEU 2.System-level: −Ensures that the system can detect and recover. −Regularly verify their configuration memory by comparing the current values with the desired configuration state using cyclic redundancy checks (Altera Stratix III) 3.User-level: a)TMR (triple modular redundancy): −Replicating a design three times and voting among outputs −Reduce the sensitivity to soft errors in the design by careful selection of the resources used

7 Circuit Level [Ebrahimi]:  Reduce # SRAM cells in a switch box (6  5)( 6  4)

8 Circuit Level [Ebrahimi]:  Reduce # SRAM cells in a switch box (6  5)( 6  4)

9 User Design Level Care bits [Golshan07] :  Only a subset of configuration bits affect the design due to SEU. Resource A is used for net A  A-B SRAM is not a care bit if B is not used by other nets.  A-C SRAM bit is a care bit (change to ‘1’ hurts net A).  A-D SRAM bit is not a care bit (w.r.t. net A) if D not used.

10 User Design Level Soft Error Routing Problem [Golshan07]:  Given a routing graph and a set of multi-terminal nets, route each net with the least care-cost, where care- cost is the number of routing care bits. Experiments:  14% reduction in the number of care bits −~80% of soft errors in the FPGA: configuration memory [Kuon07]

Recent Challenges Process Variation

12 Process Variation Sources x Wafer X Wafer Y 2.0 [IBM, Intel and TSMC]

13 Variation Variations Variation of variation over years Variation from mean value −Gate oxides are so thin that a change of one atom can cause a 25 percent difference in substrate current. −EE Times (04/11/2006) ILD: inter-layer dielectric

14 Statistical Description  The combined set of underlying deterministic and random contributions are lumped into a combined “random” statistical description.  For devices on one wafer, the distribution (mean and variance) for L can be different from devices within a single die.

15 Inter-die vs. Intra-die Variations Figures are courtesy of IBM, Intel and TSMC Intra-die spatial Correlation Inter-die global Correlation L eff

16 Impact of Variation Importance of variation:  Timing violations −  Yield loss

17 Impact of Variation Process variations can cause  up to 2000% variation in leakage current and  30% variation in frequency in 180nm CMOS −Borkar, S., Karnik, T., Narenda, S., Tschanz, J., Keshavarzi, A., De, V. Parameter Variations and Impact on Circuits and Microarchitecture. In Proc. of DAC (2003),

18 Impact of Variation Die-to-die frequency variation

19 Variation in FPGA Binning:  Historically: most of variation between dies −  FPGA manufacturers test the speed of each FPGA after manufacturing and binning each device according to its speed. −Higher speeds: more expensive −Unacceptable leakage power: discard the device  More recently: significant within die variation −Cannot be leveraged in the same manner −Operating speeds must be reduced to maintain functionality −90nm: speed reduction of 5.7% −22nm: speed reduction of 22.4%

20 Solutions Architectural solution: 1.Select the logic block architecture parameters to minimize this variation −LUT size is particularly important [Wong05] −LUT size = 4 : highest leakage yield −LUT size = 7 : highest timing yield −LUT size = 5 : maximum combined leakage and timing yield. 2.Adaptively compensate for any variation through body- biasing [Nabaa06]: −Slow blocks: set to a body bias  decrease V t  increase block’s speed −Fast blocks: increase threshold voltage  reduce leakage power  Experiments: −  Area penalty: 1%–2% −  Delay variability reduction: 30% −  Leakage variability reduction: 78%

21 Solutions CAD-Level: 1.Statistical static timing analysis (SSTA) in FPGA CAD tools −  Improve delays by avoiding the margins that are necessary for traditional STA 2.Testing multiple logically equivalent configurations of the FPGA to find one that is functional at the desired speed [Sedcole07] 3.Generating critical paths that will be more robust in the face of variation [Matsumoto07]

22 Inter-die vs. Intra-die Variations P 0 = nominal design value ΔP intradie = intra-die variation (within a given chip) Δ P interdie = Inter-die variation (from one chip to another) Δ P e = remaining “random” or unexplained variation  P: a structural or electrical parameter e.g. −W, −tox, −Vth, −channel mobility, −coupling capacitances, −line resistances.

23 Corner Analysis PRCA (Process Corner Analysis):  Takes 1.nominal values of process parameters 2.and a delta for each parameter by which it varies.  Finds −performance as max and min values. Pros:  Simple Cons:  conservative  inaccurate

24 Corner Analysis PRCA shortcoming:  Process corners are believed to coincide with performance corners. −Fact: best-case corner may not depend on P min or P max for a particular interconnect parameter but on a value within that range.

25 SSTA

26 Solutions CAD-Level: 2.Testing multiple logically equivalent configurations of the FPGA to find one that is functional at the desired speed [Sedcole07]

27 References [Kuon07] Kuon, Tessier, “FPGA Architecture: Survey and Challenges,” Foundations and Trends in Electronic Design Automation, Vol. 2, No. 2 (2007) 135–253. [Lin07] Yan Lin and Lei He, Device and Architecture Concurrent Optimization for FPGA Transient Soft Error Rate, ICCAD 2007 [Golshan07] S. Golshan and E. Bozorgzadeh, “Single-event- upset (SEU) awareness in FPGA routing,” in DAC ’07: [Xilinx] [Altera] [Wong05] H.-Y.Wong, L. Cheng, Y. Lin, and L. He, “FPGA device and architecture evaluation considering process variations,” in ICCAD, [Nabaa06] G. Nabaa, N. Azizi, and F. N. Najm, “An adaptive FPGA architecture with process variation compensation and reduced leakage,” DAC, 2006.

28 References [Sedcole07] P. Sedcole and P. Y. K. Cheung, “Parametric yield in FPGAs due to within-die delay variations: A quantitative analysis,” in FPGA, 2007.

29 References [Matsumoto07] Y. Matsumoto, M. Hioki, T. Kawanami, T. Tsutsumi, T. Nakagawa, T. Sekigawa, and H. Koike, “Performance and yield enhancement of FPGAs with within-die variation using multiple configurations,” in FPGA 2007.