Spring 2008 CSE 591 Compilers for Embedded Systems Aviral Shrivastava Department of Computer Science and Engineering Arizona State University.

Slides:



Advertisements
Similar presentations
IHP Im Technologiepark Frankfurt (Oder) Germany IHP Im Technologiepark Frankfurt (Oder) Germany ©
Advertisements

January 4-8, 2008VLSI Design Single Event Upset An Embedded Tutorial Fan Wang Vishwani D. Agrawal Department of Electrical and Computer Engineering.
MURI Neutron-Induced Multiple-Bit Upset Alan D. Tipton 1, Jonathan A. Pellish 1, Patrick R. Fleming 1, Ronald D. Schrimpf.
Sp09 CMPEN 411 L16 S.1 CMPEN 411 VLSI Digital Circuits Spring 2009 Lecture 16: Introduction to Soft Errors [Adapted from Rabaey’s Digital Integrated Circuits,
April 30, Cost efficient soft-error protection for ASICs Tuvia Liran; Ramon Chips Ltd.
Microprocessor Reliability
ACTIVITY ON THE SUN: Prominences Sunspots Solar Flares CME’s – Coronal Mass Ejections Solar Wind Space Weather.
Future beyond CTF3: ESA/Electron testing January 27 th 2015 CLIC Workshop 2015: Future tests beyond CTF3 January 27 th 2015 Radiation Testing with CALIFES.
® 1 Shubu Mukherjee, FACT Group Cache Scrubbing in Microprocessors: Myth or Necessity? Practical Experience Report Shubu Mukherjee Joel Emer, Tryggve Fossum,
Space Weather Causes and Consequences An introduction to Space Weather What is it? Where does it come from? Who is impacted? Rodney Viereck NOAA Space.
Cosmic rays in solar system By: Tiva Sharifi. Cosmic ray The earth atmosphere is bombarded with the energetic particles originating from the outer space.
May 14-16, 2008 NATW' Probabilistic Soft Error Rate Estimation from Statistical SEU Parameters Fan Wang* Vishwani D. Agrawal Department of Electrical.
Working Group 2 - Ion acceleration and interactions.
March 16-18, 2008SSST'20081 Soft Error Rate Determination for Nanometer CMOS VLSI Circuits Fan Wang Vishwani D. Agrawal Department of Electrical and Computer.
Danish Space Research Institute Danish Small Satellite Programme FH Space_Environment.ppt Slide # 1 Flemming Hansen MScEE, PhD Technology Manager.
The Sun The Sun in X-rays over several years The Sun is a star: a shining ball of gas powered by nuclear fusion. Luminosity of Sun = 4 x erg/s =
Radiation Effects in Microelectronics EE-698a Course Seminar by Aashish Agrawal.
Unreliable Silicon: Myth or Reality? Shubu Mukherjee Principal Engineer Director, SPEARS Group (SPEARS = Simulation & Pathfinding of Efficient And Reliable.
Spring 07, Apr 17, 19 ELEC 7770: Advanced VLSI Design (Agrawal) 1 ELEC 7770 Advanced VLSI Design Spring 2007 Soft Errors and Fault-Tolerant Design Vishwani.
Henry L. Clark, Ph D Accelerator Physicist / SEE Line Project Manager / Upgrade Project Manager Cyclotron Institute, Texas A&M University.
Learning Targets I can name the force that holds the atom’s nucleus together I can describe the two reasons why some isotopes are radioactive. I can describe.
1 Dependability Benchmarking of VLSI Circuits Cristian Constantinescu Intel Corporation.
1 Efficient Analytical Determination of the SEU- induced Pulse Shape Rajesh Garg Sunil P. Khatri Department of ECE Texas A&M University College Station,
The Dangers of Solar Storms and Solar Cycles.  For every 1 million atoms of hydrogen in the entire sun  98,000 atoms of helium  850 of oxygen  360.
Structure & Function. Our Nearest Star  Core  Comprises about 25% of sun’s interior  site of nuclear fusion  Radiative Zone  Energy produced in.
The Sun Our Nearest Star. The Source of the Sun’s Energy The Source of the Sun’s Energy Fusion of light elements into heavier elements. Hydrogen converts.
Chapter 21 Nuclear Chemistry John D. Bookstaver St. Charles Community College Cottleville, MO Lecture Presentation © 2012 Pearson Education, Inc.
Chapter 21 Nuclear Chemistry
CML CSE 591: Advances in Reliable Computing Aviral Shrivastava.
Copyright © 2008 UCI ACES Laboratory Kyoungwoo Lee 1, Aviral Shrivastava 2, Nikil Dutt 1, and Nalini Venkatasubramanian 1.
NASA/NSTA Web Seminar: Radiation – Can’t Live With It, Can’t Live Without It LIVE INTERACTIVE YOUR.
Important Missions for Solar System Exploration An overview from early ’50’s to today… Part I.
Single Event Effects in microelectronic circuits Author: Klemen Koselj Advisor: Prof. Dr. Peter Križan.
Space Radiation and Fox Satellites 2011 Space Symposium AMSAT Fox.
Anatomy of an Atom Parts of an Atom Nucleus (positive, mass of 1 amu) Neutron (, mass of 1 amu) Electron Cloud Electrons (, negligible mass)
SiLab presentation on Reliable Computing Combinational Logic Soft Error Analysis and Protection Ali Ahmadi May 2008.
Soft errors in adder circuits Rajaraman Ramanarayanan, Mary Jane Irwin, Vijaykrishnan Narayanan, Yuan Xie Penn State University Kerry Bernstein IBM.
What is it to be Radioactive? Radioactive decay is the process in which an unstable atomic nucleus loses energy by emitting radiation in the form of particles.
What is radiation? NUCLEAR DECAY HALF-LIFE SOURCES OF RADIATION IONIZING RADIATION NON- IONIZING RADIATON Final Jeopardy.
Seattle June 24-26, 2004 NASA/DoD IEEE Conference on Evolvable Hardware Self-Repairing Embryonic Memory Arrays Lucian Prodan Mihai Udrescu Mircea Vladutiu.
Chapter 21 Nuclear Chemistry. The Nucleus Remember that the nucleus is comprised of the two nucleons, protons and neutrons. The number of protons is the.
1.3-1 Types of Radioactivity.  By the end of this section you will be able to: ◦ Observe nuclear changes and explain how they change an element. ◦ Express.
Radioactivity and you! ….and through you! …..and all around you!
The Dangers of Solar Storms and Solar Cycles.  Radius = 696,000 km  Mass = 2E30 kg  Luminosity = 3.8E26 W  Rotation Rate  25 days at the equator.
Cosmic Rays GNEP Teacher Workshop Steve Shropshire, July 2007.
Space Environment SSE-120 Please type in your questions and raise your hand so we can answer it during class.
Effects of Surrounding Materials on Proton-Induced Energy Deposition in Large Silicon Diode Arrays Christina L. Howe 1, Robert A. Weller 1, Robert A. Reed.
Chapter 25 Nuclear Chemistry
Copyright 2005, M. Tahoori1 Soft Error Modeling and Mitigation Mehdi B. Tahoori Northeastern University
SEU Hardening Incorporating Extreme Low Power Bitcell Design (SHIELD)
What is radioactivity? lecture 9.1 Gamma ray bursts from a star collapsing into a Black Hole.
Energy Unit Learning Goal 3: Examine how changes in the nucleus of an atom result in emissions of radioactivity.
1. What is radioactivity? Radioactivity is the process in which an unstable atomic nucleus emits charged particles and energy. 2. What is a radioisotope?
Spring 2008 CSE 591 Compilers for Embedded Systems Aviral Shrivastava Department of Computer Science and Engineering Arizona State University.
A Novel, Highly SEU Tolerant Digital Circuit Design Approach By: Rajesh Garg Sunil P. Khatri Department of Electrical and Computer Engineering, Texas A&M.
CS203 – Advanced Computer Architecture Dependability & Reliability.
Radiation Applications Application Cards Find the type of radiation used for your application. Note: If more than one type – choose one Group according.
Unstable Nuclei and Radioactive Decay. Radioactivity (Radioactive decay) The process by which some substances spontaneously emit radiation. Radioactive.
Rad (radiation) Hard Devices used in Space, Military Applications, Nuclear Power in-situ Instrumentation Savanna Krassau 4/21/2017 Abstract: Environments.
SE-Aware HPC Extension : Selective Data Protection for reducing failures due to soft errors 7/20/2006 Kyoungwoo Lee.
Geant4 and Microelectronics – Recent Successes, Looming Concerns
Spacecraft Anomaly Analysis and Prediction System – SAAPS
Towards the design of tomorrow’s Reliable Computing Systems
Soft Error Rates with Inertial and Logical Masking
and its effect on various processes
R.W. Mann and N. George ECE632 Dec. 2, 2008
The Centre of the Solar System Earth Science 11
Presentation transcript:

Spring 2008 CSE 591 Compilers for Embedded Systems Aviral Shrivastava Department of Computer Science and Engineering Arizona State University

Lecture 2: Soft Errors

Beginings.. □ nuclear tests □Electronic monitoring equipment failure □could not identify the reason!! □Worked fine after rebooting □no hardware fault, no permanent fault □ Wallmark and Marcus □Surmised that cosmic rays can cause failures in electronic systems □Minimum Size and Maximum Packing Density of Non-Redundant Semiconductor Devices □1978 – May and Woods of Intel □Reported alpha particle induced soft errors in the 2107-series 16- KB DRAMs. □1979 – Ziegler and Lanford of IBM □presented solid evidence that, the electronic sensitivity to radiation-induced soft errors could become a nightmare for the future technologies.

First Space Casualty □Telestar □First communication satellite □ATT Bell Telephone, NASA, British GPO, and French PTT □Launched July 10, 1962 □July 23 - live transatlantic television signal □Supposed to telecast speech from President John. F. Kennedy □Instead telecasted major league baseball □Telstar ushered in a new age of the benevolent use of technology □July 9, 1962 □United States tested a high-altitude nuclear device (called Starfish Prime) which super-energized the Earth's Van Allen Belt where Telstar took orbitStarfish Prime Van Allen Belt □100X increase in radiation □Out of service in December, repaired, but unusable after February

Saving Galileo □1978 – Galileo commissioned for Jupiter exploration □1980 – Design and Architecture decided □Use of AT 2901 for attitude control □1982 – Voyager reaches Jupiter □Intermittent Resets □Sulfur ions from Jupiter’s volcanic moon, Io, were being whipped up to high energy by the Jovian gravity. □After extensive testing of Galileo, chief engineer decided “not worth flying if soft error problem not solved” □Overheads □5 years, 5 million dollars □Sandia National Laboratories was subcontracted to custom-make radiation hardened 2901

Recent – Hubble Space Telescope □Intermittent resets after 1996 upgrade of software on Hubble Space Telescope □South Atlantic Anomaly

Sun-earth Interactions □11 year solar cycle of sun-spots □10 9 kg/s of material lost by the Sun as ejected solar wind. □Protons (~70%), electrons, ionized helium, less than 0.5% minor ions. □2x10 10 protons/cm 2 □Loose of satellites

Copyright 2005, M. Tahoori 8 Impact on Earth-bound Electronics □Documented strikes in large servers found in error logs □Normand, “Single Event Upset at Ground Level,” IEEE Transactions on Nuclear Science, Vol. 43, No. 6, December □Sun Microsystems, 2000 (R. Baumann, 2002 IRPS Workshop talk) □Cosmic ray strikes on L2 cache with defective error protection □caused Sun’s flagship servers to suddenly and mysteriously crash! □Companies affected □Baby Bell (Atlanta), America Online, Ebay, & dozens of other corporations □Verisign moved to IBM Unix servers (for the most part) □Cisco line cards may reset after single event upset (SEU) failures

Copyright 2005, M. Tahoori 9 Reactions from Companies □Fujitsu SPARC in 130 nm technology □80% of 200k latches protected with parity □compare with very few latches protected in Mckinley □ISSCC, 2003 □IBM declared 1000 years system MTBF as product goal □for Power4 line □very hard to achieve this goal in a cost-effective way □Bossen, 2002 IRPS Workshop Talk

Evolution of a Product’s Team’s Psyche □Shock □“SER is the crabgrass in the lawn of computer design” □Denial □“We will do the SER work two months before tapeout” □Anger □“Our reliability target is too ambitious” □Acceptance □“You can deny physics only for so long”

Growing Problem □Is going to become a everyday problem (omnipresent) for every devices (ubiquitous) □Soft Errors in Embedded Systems □Not only a space phenomenon anymore!

Phenomenon of Soft Error □Transient Faults □Random and spontaneous bit-changes in system □Can be caused by □Circuit noise □Cross-talk □More than 50% due to radiation strike

Causes of Soft Errors □Alpha particles emitted by traces of uranium, thorium, or lead impurities in packaging materials □Alpha particles emitted by decaying radioactive impurities in packaging and interconnect materials. (plastic packages is the worst. Ceramic,HyperBGA, Flip-chip PBGA) □High-energy ( > 1 MeV) neutrons from cosmic radiation can induce soft errors in semiconductor devices via secondary ions produced by the neutron reaction with silicon nuclei □Less than 1% of the primary flux reaches ground level □Secondary radiation induced from the interaction of low-energy neutrons and boron □Boron-10 in BPSG (Borophosphosilicate glass) □New process technologies use highly refined packaging and no boron □2 nd effect is the most important □Shielding is effective only for low-energy neutrons □High energy neutrons can pass through 6 feet of concrete

LET Spectrum □Linear energy transfer □Measure of energy deposition □MeV per mg/cm 2, MeV/μ or pC/μ

Metrics □FIT: Failure in Time □No. of failures in 1 billion hours of operation □MTTF: Mean Time To Failure □1000 FITs => MTTF of 114 years □1 GByte of 500 FIT/Mbit can expect an error every two weeks □ECC reduces failure rate by 2 orders of magnitude □hypothetical Terabyte system would experience a soft error every few minutes

Trends □DRAM □System error rate of DRAMs is fairly constant □SRAM □Increasing exponentially □Logic □Increasing exponentially

Masking Effects □Logic Masking □Electrical Masking □Latching Window Masking □Microarchitectural Masking □Software Masking