University of Toronto Department of Computer Science CSC444 Lec02 1 Lecture 2: Examples of Poor Engineering “Software Forensics” Case Studies Mars Pathfinder.

Slides:



Advertisements
Similar presentations
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Resource Access Control Protocols.
Advertisements

Interprocess Communication
Space Exploration Mars Rovers, Apollo program, Voyager satellites, and SETI All Presented by the Peter C Period: 2 (two) As in 1+1=2 Or 2x1=2 ®
Mars Climate Orbiter Team Magna Corp: Tim Toba Mohamed Sahil Nyema Johnson Abner Yemaneab University of Minnesota.
Mars Rovers. Rover Details A rover is a machine that scientists use to explore the planets in our Solar System. They also use rovers to collect minerals.
Prepare for boarding MARS Navigation GPS Mars at close orbit … is 35 million miles away………
WHAT WE KNOW ABOUT MARS. 1.What’s the name of the dead volcano on Mars? 2.When do dust storms occur on Mars? 3.How much would you weigh on Mars? DO THE.
Modern Exploration Global Surveyor.  Objectives:  High resolution imaging of the surface  Study the topography and gravity  Study the role of water.
CURIOSITY: Big Mars Rover for Big Mars Science! Artist’s Concept. NASA/JPL-Caltech.
IV&V of Critical Behavior September, 2012 Shirley Savarino, TASC.
Christy Edwards-Stewart Solar System Ambassador Lockheed Martin Aerospace Engineer Christine Kryscio Lockheed Martin Materials Engineer Lunar and Planetary.
Australian Centre for Space Photonics Andrew McGrath Anglo-Australian Observatory.
Astronomy190 - Topics in Astronomy Astronomy and Astrobiology Lecture 15 : Ancient Mars Ty Robinson.
ARIANE 5 FAILURE ► BACKGROUND:- ► European space agency’s re-useable launch vehicle. ► Ariane-4 was a major success ► Ariane -5 was developed for the larger.
MARS EXPLORATION PROGRAM NSTA/ITEA/NASA-NES ROBOTICS INSTITUTE Sheri Klug Mars Space Flight Facility Arizona State University Jet Propulsion Laboratory.
Mars Exploration By Jacob Stinar. Water on Mars.
Satellite communications and the environment of space Images: NASA.
Mysteries of Earth and Mars Mars Facts and Exploration.
University of Toronto Department of Computer Science © Steve Easterbrook. This presentation is available free for non-commercial use with attribution.
LSU 10/09/2007System Design1 Project Management Unit #2.
Modern Exploration Mars Pathfinder  “NASA’s Mars Pathfinder mission – the first spacecraft to land on Mars in more than 20 years and the first ever to.
Lunar CRater Observation and Sensing Satellite Project LCROSS Site Selection Workshop Oct , 2006 NASA/ARC, Mountain View, California LCROSS Orbital.
University of Toronto Department of Computer Science CSC444 Lec04- 1 Lecture 4: Software Lifecycles The Software Process Waterfall model Rapid Prototyping.
Mars By Chelsea Hoyle. Mars Exploration Mars is mainly explored by rovers. The rovers are called Spirit, curiosity and Opportunity. They were put on mars.
System Software Integration Testing Mars Polar Lander Steven Ford SYSM /05/12.
Today’s APODAPOD  Read NASA website:  solarsystem.nasa.gov solarsystem.nasa.gov  IN-CLASS QUIZ THIS FRIDAY!!  Solar Lab, Kirkwood, Rooftop this week.
High Altitude Scientific Ballooning (Near Space Engineering)
Engineering Means having to deal with failure Missions to Mars as an example of try, try, try again…
SPACE 1900-a scientist named Tsiolkovsky started testing rockets, Russia was responsible for his project. He is known as the Farther of Russian astronautics.
THE GREAT SPACE RACE TIMELINE a scientist named Tsiolkovsky started testing rockets he didn't construct a single rocket but designed hundreds.
In The News Genesis Mission We really need to know what the initial composition of the Solar System was and we'll get that by measuring the oxygen that.
NASA’s Goddard Space Flight Center LRO Operations Concept Richard Saylor Jr. HTSI/Code 444 August 16-17, 2005.
Filling Mars Human Exploration Strategic Knowledge Gaps with Next Generation Meteorological Instrumentation. S. Rafkin, Southwest Research Institute
1960: October 10 – Marsnik 1 (also known as Korabl 4 and Mars 1960A) Soviet Union's first attempt at a planetary probe. Launch failure. 1960: October.
ASTRONOMY 340 FALL October 2007 Class #9. Salient Martian Features  R Mars = 3396 km (R Earth = 6378 km)  Higher surface area to mass ratio 
The Solar System Chapter 6 COPY DOWN THE LEARNING GOALS ON PG SKIP 5 LINES BETWEEN EACH!
Early Spacecraft Exploration Viking  “The scientific goal of the Viking missions is to ‘increase our knowledge of the planet Mars with an emphasis on.
Mars Exploration Rovers (MER) Entry, Descent, Landing, and Deployment.
Mars Rovers By Taylor Sokolovic.
System Integration Testing Requirements Mars Polar Lander Steven Ford SYSM /11/12.
NASA’s Mars Climate Orbiter Mishap. References Official investigation report IEEE Spectrum investigation report Official report on project management.
Unit 5 Lesson 2 Technology for Space Exploration
What happened to the Mars Climate Explorer in September 1999? By Leslie Palomino.
Mars Exploration Rovers Entry, Descent, Landing and Deployment.
Florian Nichitiu a,*, James R. Drummond a, Jiansheng Zou a, Robert Deschambault b a) Department of Physics, University of Toronto, Canada,60 St. George.
Mars - The Red Planet Image Courtesy of NASA/JPL-Caltech.
Image Courtesy of NASA/JPL-Caltech.  Known to Babylonians 3,600 years ago as “Star that Wandered”  The Greeks referred to it as “Ares” the god of War.
Aquarius Mission Simulation A realistic simulation is essential for mission readiness preparations This requires the ability to produce realistic data,
AOS 100: Weather and Climate Instructor: Nick Bassill Class TA: Courtney Obergfell.
Interlude  Viking mission operations ended in the early 1980s  Viking missions gave scientists the most complete picture of Mars to date. What does this.
Interplanetary Lasers Joss Hawthorn, Jeremy Bailey, Andrew McGrath Anglo-Australian Observatory Free space optical communications.
Spacecraft Dimensions 1.06 meters (3.5 feet) tall by 3.6 meters (12 feet) wide. Spacecraft Weight Total: 576 kg (1,270 pounds) Lander: 290 kg (639 pounds)
WHAT ARE SATELLITES AND SPACE PROBES? Objective: Explain how artificial satellites and space probes are used to explore space. Key Terms: Satellite orbit.
1 Mars Exploration Rovers Spirit SOL-18 Anomaly: NASA IV&V Involvement April 2004.
Missions to Mars Julie A. Rathbun Lowell Observatory.
C.M. Rodrigue, 2016 Geography, CSULB Mars: History of Exploration IV Geography 441/541 S/16 Dr. Christine M. Rodrigue.
CSCI 3428: Software Engineering Tami Meredith Chapter 11 Maintaining the System.
Lunar CRater Observation and Sensing Satellite Project LCROSS Astronomer Workshop Feb. 29, 2008 NASA/ARC, Mountain View, California Mission Design & Observation.
Goal: To understand what we learned about Mars from the Viking probes Objectives: 1) To see what the Viking probes were able to see 2) To examine why they.
Mars - The Red Planet Image Courtesy of NASA/JPL-Caltech.
Failure of Process / Failure of Mission
RTOS Scheduling 2.0 Problems - Solutions
Failure of Process / Failure of Mission
Mars Rover By Benn Davis.
Unit 5 Lesson 2 Technology for Space Exploration
After School Astronomy Clubs Dorian Janney
After School Astronomy Clubs Dorian Janney
Curious about Curiosity?
Scale of things : Big and Small
Failure in the PATHFINDER Mission
Presentation transcript:

University of Toronto Department of Computer Science CSC444 Lec02 1 Lecture 2: Examples of Poor Engineering “Software Forensics” Case Studies Mars Pathfinder Mars Climate Observer Mars Polar Lander Deep Space 2 Some conclusions Reliable software has very little to do with writing good programs Humans make mistakes, but good engineering practice catches them! © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 2 NASA JPL’s Mars Program MissionLaunch DateArrival DateOutcome Viking I Viking II 20 Aug Sept 1975 Landed 20 Jul 1976 Landed 3 Sept 1976 Operated until 1982 Operated until 1980 Mars Observer 25 Sept 1992Last contact: 22 Aug 1993 Contact lost just before orbit insertion Pathfinder 4 Dec 1996Landed 4 July 1997 Operated until 27 Sept 1997 Global Surveyor 7 Nov 1996Orbit attained 12 Sept 1997 Still operational Climate Orbiter 11 Dec 1998Last contact: 23 Sept 1999 Contact lost just before orbit insertion Polar Lander 3 Jan 1999Last contact: 3 Dec 1999 Contact lost before descent Deep Space 2 3 Jan 1999Last contact: 3 Jan 1999 No data was ever retrieved Mars Odyssey 7 Apr 2001Arrived in orbit: Oct Still operating © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 3 Mars Pathfinder Mission Demonstrate new landing techniques parachute and airbags Take pictures Analyze soil samples Demonstrate mobile robot technology Sojourner Major success on all fronts Returned 2.3 billion bits of information 16,500 images from the Lander 550 images from the Rover 15 chemical analyses of rocks & soil Lots of weather data Both Lander and Rover outlived their design life Broke all records for number of hits on a website!!! © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 4 Remember these pictures? © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 5 Pathfinder had Software Errors Symptoms: software did total systems resents and some data was lost each time Symptoms noticed soon after Pathfinder started collecting meteorological data Cause 3 Process threads, with bus access via mutual exclusion locks (mutexs): High priority: Information Bus Manager Medium priority: Communications Task Low priority: Meteorological Data Gathering Task Priority Inversion: Low priority task gets mutex to transfer data to the bus High priority task blocked until mutex is released Medium priority task pre-empts low priority task Eventually a watchdog timer notices Bus Manager hasn’t run for some time… Factors Very hard to diagnose and hard to reproduce Need full tracing switched on to analyze what happened Was experienced a couple of times in pre-flight testing Never reproduced or explained, hence testers assumed it was a hardware glitch © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 6 Mars Climate Orbiter Launched 11 Dec 1998 Mission interplanetary weather satellite communications relay for Mars Polar Lander Fate Arrived 23 Sept 1999 No signal received after initial orbit insertion Cause Faulty navigation data caused by failure to convert imperial to metric units © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 7 Locus of error Ground software file called “Small Forces” gives thruster performance data This data used to process telemetry from the spacecraft Spacecraft signals each Angular Momentum Desaturation (AMD) maneuver Small Forces data used to compute effect on trajectory Software underestimated effect by factor of 4.45 Cause of error Small Forces Data given in Pounds-seconds (lbf-s) The specification called for Newton-seconds (N-s) Result of error As spacecraft approaches orbit insertion, trajectory is corrected Aimed for periapse of 226km on first orbit Estimates were adjusted as the spacecraft approached orbit insertion: 1 week prior: first periapse estimated at km 1 hour prior: this was down to 110km Minimum periapse considered survivable is 80km MCO entered Mars occultation 49 seconds earlier than predicted Signal was never regained after the predicted 21 minute occultation Subsequent analysis estimates first periapse of 57km Small Forces... © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 8 Mars To Earth TCM-4 Larger AMD ΔV’s Driving trajectory down relative to ecliptic plane Estimated trajectory and AMD ΔV’s Actual trajectory and AMD ΔV’s 226km 57km MCO Navigation Error Periapse © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 9 Contributing Factors For 4 months, AMD data not used due to file format errors Navigators calculated data by hand File format fixed by April 1999 Anomalies in trajectory became apparent almost immediately Limited ability to investigate: Thrust effects measured along line of sight using doppler shift AMD thrusts are mainly perpendicular to Earth- spacecraft line of sight Poor communication between teams: E.g. Issue tracking system not properly used by navigation team Anomalies not properly investigated Inadequate staffing Operations team monitoring three missions simultaneously (MGS, MCO and MPL) Operations Navigation team unfamiliar with spacecraft Different team from development & test Did not fully understand the significance of the anomalies Familiarity with previous mission (MGS) assumed sufficient: but AMD was performed times more often on MCO as it has asymmetric solar panels. Inadequate Testing Software Interface Spec not used during unit testing of small forces s/w End-to-end test of ground software never completed Ground software was not considered “mission critical” so less rigorous V&V Inadequate Reviews Key personnel missing from critical design reviews © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 10 Mars Polar Lander Launched 3 Jan 1999 Mission Land near South Pole Dig for water ice with a robotic arm Fate: Arrived 3 Dec 1999 No signal received after initial phase of descent Cause: Several candidate causes Most likely is premature engine shutdown due to noise on leg sensors © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 11 What happened? © 2001, Steve Easterbrook Lack of data hampers investigation spacecraft not designed to send telemetry during descent This decision severely criticized by review boards Possible causes: Lander failed to separate from cruise stage (plausible but unlikely) Landing site was too steep (plausible) Heatshield failed (plausible) Loss of control due to dynamic effects (plausible) Loss of control due to center-of-mass shift (plausible) Premature Shutdown of Descent Engines (most likely!) Parachute drapes over lander (plausible) Backshell hits lander (plausible but unlikely)

University of Toronto Department of Computer Science CSC444 Lec02 12 Cause of error Magnetic sensor on each leg senses touchdown Legs unfold at 1500m above surface transient signals on touchdown sensors during unfolding software accepts touchdown signals if they persist for 2 timeframes transient signals likely to be long enough on at least one leg Factors System requirement to ignore the transient signals But the software requirements did not describe the effect s/w designers didn’t understand the effect, so didn’t implement the requirement Engineers present at code inspection didn’t understand the effect Not caught in testing because: Unit testing didn’t include the transients Sensors improperly wired during integration tests (no touchdown detected!) Full test not repeated after re-wiring Result of error Engines shut down before spacecraft has landed When engine shutdown s/w enabled, flags indicated touchdown already occurred estimated at 40m above surface, travelling at 13 m/s estimated impact velocity 22m/s (spacecraft would not survive this) nominal touchdown velocity 2.4m/s Premature Shutdown Scenario © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 13 Deep Space 2 Launched 3 Jan 1999 Mission 2 small probes piggybacked on Mars Polar Lander Demonstration of new technology Separate from MPL 5 minutes before atmosphere entry Bury themselves in Martian Soil Return data on soil analysis and look for water Fate: No signals were received after launch Cause: Unknown (System was not ready for launch) © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 14 Mars Odyssey  Sept 12 and earlier  Original Caption Released with Image: Color differences in this daytime infrared image... represent differences in the mineral composition of the rocks, sediments and dust on the surface.

University of Toronto Department of Computer Science CSC444 Lec02 15 what went wrong?

University of Toronto Department of Computer Science CSC444 Lec02 16

University of Toronto Department of Computer Science CSC444 Lec02 17 Summary Failures can usually be traced to a single root cause But good engineering practice should prevent these causing system failure The real problems are failures of: testing and inspection process problem reporting and tracking lack of expertise inadequate resources, etc… In most cases, it takes a failure of both engineering practice and of management Reliable software depends not on flawless programs but on how good we are at: Communication (sharing information between teams) Management (of Resources and Risk) Verification and Validation Risk Identification and tracking Questioning assumptions © 2001, Steve Easterbrook

University of Toronto Department of Computer Science CSC444 Lec02 18 Mars Observer Project summary Brief summary of possible causes Mars Pathfinder Project info: Report on the priority inversion problem: Mars Climate Orbiter Project Info: Investigation Report: ftp://ftp.hq.nasa.gov/pub/pao/reports/2000/MCO_MIB_Report.pdf Mars Polar Lander & Deep Space 2 Project info: Investigation Reports: General Resources JPL’s list of missions (past, present and future) Basics of Space Flight: Resource List © 2001, Steve Easterbrook