WHY THEY FAILED AND LESSONS TO BE DRAWN Samuel Franklin G53QAT: Quality Assurance and Testing Famous Software Failures.

Slides:



Advertisements
Similar presentations
©Ian Sommerville 2004Software Engineering Case Studies Slide 1 The London Ambulance fiasco l The London Ambulance Service (LAS) Computer Aided Despatch.
Advertisements

The Failure of the London Ambulance Service Michael McDougall CIS 573 November 16 th, 1999.
CML CML CS 230: Computer Organization and Assembly Language Aviral Shrivastava Department of Computer Science and Engineering School of Computing and Informatics.
OHT 2.1 Galin, SQA from theory to implementation © Pearson Education Limited 2004 Software Quality assurance (SQA) SWE 333 Dr Khalid Alnafjan
1. Software in our lives, then and now  Medical (processing and analysis, Computer Aided Surgery, other various equipment)  Financial and business (banking,
Syllabus Case Histories WW III Almost Medical Killing Machine
Chapter 3: Programming Web Services Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Patriot Missile Failure
CSC 4250 Computer Architectures September 12, 2006 Appendix H. Computer Arithmetic.
Software Engineering Disasters
Scuds and Patriots CSE 200. History 1 st Gulf war 1991 Iraq attacking US troops using Scud missiles US defense: Patriot missiles.
CML CSE 520: Advanced Computer Architecture: Reliability Aviral Shrivastava.
Chapter 8 Representing Information Digitally.
Who were the two superpowers during the Cold War? The United States and the USSR (The Union of Soviet Socialist Republics)/ Russia. After the end of World.
Department of Computer Science City College of New York City College of New York Spring 2006 Copyright © 2006 by Abbe Mowshowitz CSc 375 SOCIAL ISSUES.
SWE Introduction to Software Engineering
1 Software Testing and Quality Assurance Lecture 37 – Software Quality Assurance.
Topic 10Summer London Ambulance System Some of the slides created by Sommerville.
Toward A Reasonable Programmer Standard Responsibility and Negligence in Software Design.
The London Ambulance fiasco
EC4019PA Intrusion & Access Control Technology (IACT) Chapter 4- CAMS Prepared by Sandy Tay.
Software Errors Who is to blame?. Almost everything in our daily lives is controlled by CPU’s and software… Does Embedded Software = Embedded Disasters?
Assessment of BMD Global capabilities Missile Defence as a Factor in Establishing a New Security Environment International Conference Moscow, 3-4 May 2012.
USS Yorktown (1998) A crew member of the guided-missile cruiser USS Yorktown mistakenly entered a zero for a data value, which resulted in a division by.
The Unintended Consequences of a career in Engineering Or How to end up a mass murderer without even trying.
2.2 Errors. Why Study Errors First? Nearly all our modeling is done on digital computers (aside: what would a non-digital analog computer look like?)
9/12/2015Dr Andy Brooks1 Lecture 3 London Ambulance System (LAS) 26 & 27 October and November FOR0383 Software Quality Assurance.
An Overview of Professional Services and NonStop Migration BPC and HP NonStop seminar Athens, 12 th October,2011 Tokhir Abdukadyrov, Deputy COO.
Information System Security and Control
FORESEC Academy FORESEC Academy Security Essentials (II)
SE513 Software Quality Control Lecture01: Introduction to Software Quality Assurance Galin, SQA from Theory to Education Limited.
Software is:  Computer programs, procedures, and possibly associated documentation and data relates to the operation of a computer system. [IEEE_Std_ ]
Introduction to Software Quality Assurance
By Shubham GOEL ECE-III year xyz. Introduction Automatic Vehicle Locator (AVL) is a computer – based vehicle tracking system The actual real –time position.
5.2 Errrors. Why Study Errors First? Nearly all our modeling is done on digital computers (aside: what would a non-digital analog computer look like?)
WAR.
1 CS 501 Spring 2002 CS 501: Software Engineering Lecture 24 Delivering the System.
© ABB Ltd - Page 1 JB Control IT AC800 M/C OPC Server.
CS 5150 Software Engineering Lecture 20 Reliability 1 (and a little Privacy)
The Cold War and the Space Race  At the conclusion of World War 2 both the United States and Russia set themselves up to be super powers  This rivalry.
“I am not in the office at the moment. Send any work to be translated.”
TWO EXAMPLES AS MOTIVATION FOR THE STUDY OF COMPUTER ERRORS
Dallas Fire-Rescue Communications Division Dispatch Operations September 14, 2015.
1 Software Quality Assurance COMP 4004 Notes Adapted from S. Som é, A. Williams.
Software Integrity and Cyber Security NAMEPA: Managing Change in a Changing World Jim Watson Division President & COO, Americas Division Management New.
The Unintended Consequences of a career in Engineering Or How to end up a mass murderer without even trying.
Unintended Consequences of a career in Engineering.
29 March Software Quality and Testing. Why do we care? Therac-25 (1985) Multiple space fiascos (1990s) Ariane V exploded after 40 seconds (conversion)
Update of SAM Implementation ALICE TF Meeting 18/10/07.
MAJOR SOFTWARE FAILURES, WHY THEY FAILED AND LESSONS LEARNED BY AKPABIO UWANA.
Today we will identify and describe the events of September 11 th, By discussing what happened that day and how it impacted our nation. To understand.
Safety Critical Systems
 Software reliability is the probability that software will work properly in a specified environment and for a given amount of time. Using the following.
Managed IT Services JND Consulting Group LLC
SOFTWARE FAILURES.
School of Business Administration
The Space Race.
Patriot Missile Failure
Objectives Explain how the Cold War turned into an arms race.
The Accident On October 26th 1992 the London Ambulance System failed.
The Space Race.
Amendment Invoice Task Force Progress Report
Amendment Invoice Task Force Progress Report
Global Cold War Through 1962
Reliability and Safety
Sources of Error Major: All Engineering Majors
Finish circling the vowel sounds and slashing the words apart.
Software Engineering Disasters
The London Ambulance fiasco
Presentation transcript:

WHY THEY FAILED AND LESSONS TO BE DRAWN Samuel Franklin G53QAT: Quality Assurance and Testing Famous Software Failures

Overview  Three Software Failures  Patriot Missile  Russian Satellite Missile Detection  London Ambulance Service  Summary of Findings  Questions

The Patriot Missile Failure  Feb 1991 – Gulf War  Failed to intercept Scud missile from Iraq  28 dead  100 injured  Error from storing value in fixed point register The Patriot FailingThe Patriot in action

Why it went wrong HoursSecondsCalculated Time (sec)Inaccuracy (sec) Approx. shift in Range Gate (meters) (a) (b)  The system had been running for 100 hours  The calculations were out by 0.34 seconds  Missed the Scud by over 600 meters WOULD MISS AFTER 20 HOURS

What American learnt from this  USA knew of the fault from Israeli Military  American’s did not reboot regularly enough  Software update arrived day after the death of the soldiers

Russian Satellite Missile Detection System  Put in place to detect threats from America during cold war  Stanislav Petrov monitored system on 26 th September 1983  Oko alerted Petrov that 5 missiles were heading towards Russia.  Petrov had to choose:  Declare it a false alarm  Start a counterstrike and probably a Nuclear war

Stanislav Petrov The Man Who Saved the World

What Russia learnt from this  The Russians dissected the Oko System  Found the software full of bugs  Launched the SPRN-2 Prognoz to supplement the Oko system  Cost of this failure could have been: World War III

London Ambulance Fiasco  London Ambulance Service (LAS) introuduced a Computer Aided Dispatch System (CAD) on 26 th October 1992  LAS: Carry over 5000 patients per day Receive approx 2500 calls per day 65% of calls are emergency  New system needed to have near 100% accuracy and full cooperation from all LAS to succeed

26 th October 1992  The new CAD system could not handle the volume of call – regular use  Response time became several hours  Communications between ambulance and LAS lost  System had:  Poor interface between crews and the system  Number of technical problems: Failed to identify duplicate calls Did not prioritise exception messages

What London learnt from this  Do not use direct conversion  Implement in step-by-step fashion  Full consultation  Quality assurance and testing  User training

Conclusion  Testing is essential  All critical systems  Rush to get system in place is bad  Training  Value of humans in the process

Any questions? Questions and Discussion