Automotive-semiconductors Functional Safety

Slides:



Advertisements
Similar presentations
IHP Im Technologiepark Frankfurt (Oder) Germany IHP Im Technologiepark Frankfurt (Oder) Germany ©
Advertisements

Test Yaodong Bi.
Module – 3 Data protection – raid
10/14/2005Caltech1 Reliable State Machines Dr. Gary R Burke California Institute of Technology Jet Propulsion Laboratory.
A reconfigurable system featuring dynamically extensible embedded microprocessor, FPGA, and customizable I/O Borgatti, M. Lertora, F. Foret, B. Cali, L.
SLAM: SLice And Merge – Effective Test Generation for Large Systems ICCAD’13 Review Reviewer: Chien-Yen Kuo.
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
The Future of Formal: Academic, IC, EDA, and Software Perspectives Ziyad Hanna VP of Research and Chief Architect Jasper Design Automation Ziyad Hanna.
Extensible Processors. 2 ASIP Gain performance by:  Specialized hardware for the whole application (ASIC). −  Almost no flexibility. −High cost.  Use.
The Design Process Outline Goal Reading Design Domain Design Flow
Computer ArchitectureFall 2008 © November 12, 2007 Nael Abu-Ghazaleh Lecture 24 Disk IO.
COMPUTER ARCHITECTURE & OPERATIONS I Instructor: Hao Ji.
Test and Verification Solutions116 th April 2010 Silicon South West, “Testing Times” The Economics of Verification Mike Bartley, TVS.
Evaluating the Error Resilience of Parallel Programs Bo Fang, Karthik Pattabiraman, Matei Ripeanu, The University of British Columbia Sudhanva Gurumurthi.
EEL-4746 Microprocessor-based System Design Fall 2004 Semester Dr. Michael P. Frank.
Testimise projekteerimine: Labor 2 BIST Optimization
Alec Stanculescu, Fintronic USA Alex Zamfirescu, ASC MAPLD 2004 September 8-10, Design Verification Method for.
1 3-General Purpose Processors: Altera Nios II 2 Altera Nios II processor A 32-bit soft core processor from Altera Comes in three cores: Fast, Standard,
ECO Methodology for Very High Frequency Microprocessor Sumit Goswami, Srivatsa Srinath, Anoop V, Ravi Sekhar Intel Technology, Bangalore, India Introduction.
EGRE 427 Advanced Digital Design Figures from Application-Specific Integrated Circuits, Michael John Sebastian Smith, Addison Wesley, 1997 Chapter 4 Programmable.
INT-Evry (Masters IT– Soft Eng)IntegrationTesting.1 (OO) Integration Testing What: Integration testing is a phase of software testing in which.
Testing Basics of Testing Presented by: Vijay.C.G – Glister Tech.
Presenter : Ching-Hua Huang 2013/7/15 A Unified Methodology for Pre-Silicon Verification and Post-Silicon Validation Citation : 15 Adir, A., Copty, S.
SiLab presentation on Reliable Computing Combinational Logic Soft Error Analysis and Protection Ali Ahmadi May 2008.
FORMAL VERIFICATION OF ADVANCED SYNTHESIS OPTIMIZATIONS Anant Kumar Jain Pradish Mathews Mike Mahar.
Lach1MAPLD 2005/241 Accessible Formal Verification for Safety-Critical FPGA Design John Lach, Scott Bingham, Carl Elks, Travis Lenhart Charles L. Brown.
Lecture 16: Reconfigurable Computing Applications November 3, 2004 ECE 697F Reconfigurable Computing Lecture 16 Reconfigurable Computing Applications.
- 1 - ©2009 Jasper Design Automation ©2009 Jasper Design Automation JasperGold for Targeted ROI JasperGold solutions portfolio delivers competitive.
Verification of FT System Using Simulation Petr Grillinger.
Using Memory to Cope with Simultaneous Transient Faults Authors: Universidade Federal do Rio Grande do Sul Programa de Pós-Graduação em Engenharia Elétrica.
Silicon Programming--Testing1 Completing a successful project (introduction) Design for testability.
Introduction to Hardware Verification ECE 598 SV Prof. Shobha Vasudevan.
C++ for Engineers and Scientists, Second Edition 1 Problem Solution and Software Development Software development procedure: method for solving problems.
What is a software? Computer Software, or just Software, is the collection of computer programs and related data that provide the instructions telling.
CS223: Software Engineering Lecture 25: Software Testing.
Lecture 5: Design for Testability. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 12: Design for Testability2 Outline  Testing –Logic Verification –Silicon.
This has been created by QA InfoTech. Choose QA InfoTech as your Automated testing partner. Visit for more information.
MAPLD 2005/213Kakarla & Katkoori Partial Evaluation Based Redundancy for SEU Mitigation in Combinational Circuits MAPLD 2005 Sujana Kakarla Srinivas Katkoori.
1 Introduction to Engineering Fall 2006 Lecture 17: Digital Tools 1.
Benefits of a Virtual SIL
Programmable Logic Devices
Automated Software Testing
Rad (radiation) Hard Devices used in Space, Military Applications, Nuclear Power in-situ Instrumentation Savanna Krassau 4/21/2017 Abstract: Environments.
Introduction Edited by Enas Naffar using the following textbooks: - A concise introduction to Software Engineering - Software Engineering for students-
A scalable approach for Test Automation in Vector CAST/Manage with
Soft Error Analysis of FPGA under ISO Standard
Complexity Time: 2 Hours.
nZDC: A compiler technique for near-Zero silent Data Corruption
VLSI Testing Lecture 6: Fault Simulation
runtime verification Brief Overview Grigore Rosu
Maintaining Data Integrity in Programmable Logic in Atmospheric Environments through Error Detection Joel Seely Technical Marketing Manager Military &
Prepared by Stephen M. Thebaut, Ph.D. University of Florida
Introduction Edited by Enas Naffar using the following textbooks: - A concise introduction to Software Engineering - Software Engineering for students-
Introduction to Software Testing
NVIDIA Fermi Architecture
T Computer Architecture, Autumn 2005
Design for Testability
Sequential circuits and Digital System Reliability
Design of a ‘Single Event Effect’ Mitigation Technique for Reconfigurable Architectures SAJID BALOCH Prof. Dr. T. Arslan1,2 Dr.Adrian Stoica3.
Analytical Approach for Soft Error Rate Estimation of SRAM-Based FPGAs
TECHNICAL SEMINAR PRESENTATION
The performance requirements for DSP applications continue to grow and the traditional solutions do not adequately address this new challenge Paradigm.
Chapter 1 Introduction(1.1)
Chapter 1 Introduction.
Avidan Efody, Mentor Graphics Corp.
Software Engineering Group, Motorola India Electronics Pvt. Ltd.,
Functional Safety Solutions for Automotive
Seminar on Enterprise Software
Presentation transcript:

Jamil R. Mazzawi jamil@optima-da.com www.optima-da.com Automotive-semiconductors Functional Safety A practical chip design solution for functional safety in vehicles Introduction to ISO-26262 challenges for IC’s Jamil R. Mazzawi jamil@optima-da.com www.optima-da.com

New challenges for design and verification engineers & Functional Safety New challenges for design and verification engineers What does it mean? What are the requirements? What are we protecting from? How can we protect? How can we measure our work, and improve it? How can we get certified?

Today considered very hard to achieve in one slide 5 Levels of Safety Low-Risk High-Risk QM ASIL-A ASIL-B ASIL-C ASIL-D *QM – Quality Management **ASIL – Automotive Safety Integrity Level Today considered very hard to achieve QM – No safety requirements beyond basic quality ASIL-A (least requirements) ASIL-B ASIL-C ASIL-D (highest requirements) Required Level Determined by Exposure (probability) Severity (potential harm) Controllability (driver ability to avoid)

Hard to get to ASIL-C and ASIL-D No clear methodologies ISO-26262 challenges Hard to get to ASIL-C and ASIL-D No clear methodologies Immense amount of fault-simulations needed Current EDA tools running out of steam Hard to get to ASIL-C and ASIL-D Using today’s tools, ASIL-D is considered very hard to achieve Especially on big and complex chips And specially when involving a multi-threaded complex CPU The challenges are: No clear methodology or automated tool for measuring and reducing soft-error FIT rate No methodology at all for creating ASIL-D multi-threaded CPU’s Each of the existing Safety-Mechanism methodologies for permanent-fault detection has weakness Immense amount of fault-simulations needed: All the FuSa steps involve immense amount of fault simulations The size of this computational task can reach hundreds and thousands of years of CPU time

ISO-26262 requirements for IC’s (intro)

3 types of safety concerns (faults): Systemic faults Failure due to errors in implementation (“bugs”) This is the domain of functional validation Random faults Failure due to the environment impacting a specific chip Transient (soft-error) or permanent (hard-error) Safety Of The Intended Functionality (SOTIF) Absence of unreasonable risk of the intended functions Optima hosted SOTIF meeting in Nazareth Oct 2017 The Working Group have separated the SOTIF from a Part in ISO-262626 into a new standard. Our focus

Transient-faults (Soft-errors/SEU/SET): What are they? Bit-flips caused mostly by cosmic-rays (radiation coming from the Sun)

Transient-faults (Soft-errors/SEU/SET) Where do they hit? Memory bits: Single or multiple bits Gates: Combinatorial logic SET – Single-Event-Transient Flip-flops: Bit-flip in a single flop In FPGA: Also on configuration memory Protecting against them Memory: ECC and bit dealignment Gates: Low-probability, not considered an issue by most experts Flops: Next slides

Existing solutions and challenges Transient faults

Protecting against Transient-faults at the flops: Unit-level Lockstep mechanism (cost: 70% more silicon) Hardening all flops (cost: 30% more silicon) Selective flip-flop hardening (cost: 1-5% more silicon) Design/RTL level mechanisms: Parity, encoding etc. Silicon level: Using Rad-Hard or OLD nodes (180 nm...)

Selective hardening process: Measure derated-FIT rate Decide is hardening needed? Perform hardening on selected flops Calculate post-hardening FIT rate A B C D Does your derated-FIT rate meet your requirements? Hardening means: replace the flop with hardened flop, with lower or close-to-0 FIT rate Many project have 2 or more kinds of flops in their library: regular flop, hardened-flop, extra-hardened-flop In most cases, hardening less than 5% of the flops will lower the FIT to close to 0 Hence meeting ASIL-D requirements with minimal silicon cost Optima-SE performs this step 10,000 to 100,000 times faster than regular RTL simulators

Permanent-faults or Hard-errors What are they? Permanent damage to a transistor Fault models: Stuck-at-0 Stuck-at-1 Bridging-Fault Etc.

Hard-errors: ISO-26262 requirement (simplified) Chip/IP needs to have “Safety Mechanisms” (SM) The SM needs to detect HE’s Detection needs to happen while the chip is working (on-the-fly) Detection needs to be within the budgeted time interval (for example 0.25ms to 100ms) from the time they happen SM needs to meet Coverage requirement The SM need to be able to detect no less than N% of the possible faults Different ASIL levels have different N For example: ASIL-D: N=99%

Existing solutions and challenges Permanent faults

Permanent-faults Safety Mechanisms: Lockstep – unit level STL – Software Test Library Logic-BIST Many other methodologies…

Lockstep methodology: (simplified) Cache-Unit (master) Unit Inputs Phase shift flop Unit outputs Cache-Unit (shadow) Phase shift flop Compare outputs Fault_detected

Lockstep methodology Does not always achieve “99%” coverage This was proven on number of designed examined by Optima Are you duplicating internal memories or not? Comparing internal memories I/O? Important to “verify” the Lock-step mechanism for Correctness Measure detection coverage using fault-simulations Using regular simulators: can be 100’s of years computational task

STL – Software test library A Software that run on the chip/IP/unit (usually only for CPUs) Test the unit for stuck-at hard errors Usually it is: Can not achieve high coverage It is labor intensive to improve the coverage Advantage: Low silicon cost

Permenant-faults: Measuring SM Coverage Measuring and improving SM coverage is needed: For all SM methodologies (STL, Lockstep, etc…) To make sure we meet our ASIL targets To prove to our customers and auditors Need to perform fault-simulation on all gates Measure if the SM can detect this fault or not Run all needed fault models Stuck-at-0 Stuck-at-1 Bridging-fault Tristate-fault Etc. Need to be done on gate-level The compute task is immense: Number of gates X 2 X time-per-fault-gl-simulation 100M gates * 2 faults * 2 min = 400 M minutes = 761 years

Permanent-faults: Measuring SM Coverage To meet ASIL target To prove to our customers and auditors Needed for all type of SM’s Done on gate-level Multiple fault simulations per gate Need to perform fault-simulation on all gates Stuck-at-0, Stuck-at-1 Bridging-fault Etc. Run on multiple fault models Number of gates * 2 * time-per-fault-sim 100M gates * 2 faults * 2 min = 400 M minutes = 761 years The compute task is immense:

Development Process: for STL/Lockstep Write STL or impl. Lockstep Run Optima-HE Examine Coverage Results: Meeting req? Examine Coverage Booster outputs Fine-tune STL based on CB iteration A B C D E No Yes Done Optima-HE does this step over 1,000 times faster than our competitor Reducing this step from weeks to hours Note: The same process is used for all types of SM’s for HE detection STL has the most iterations...

Optima Automotive Safety Platform for ISO-26262 ASIL-D Optima-SE™ Complete Soft Error Solution Soft Error simulation Selective flip flop hardening Reduce your FIT rate to ASIL-D level with low silicon cost Both Pre-silicon and Post-Silicon Applications Optima-HE™ Hard Error Coverage measurement & Boosting Hard Error safety mechanism coverage Coverage Booster Automate the converge raising effort Other offering Functional Safety services Integration with ANSYS Medini Safety platform More tools and details at our booth or under NDA All based on Optima’s Fault Injection Engine Over 100,000 faster than RTL simulators Over 1,000 faster than all other fault-simulators

Automotive-semiconductors Functional Automation See you at our booth!!! the sweetest giveaway at ChipEx -> www.optima-da.com info@optima-da.com