System Safety: A systematic processes

Slides:



Advertisements
Similar presentations
Elements of an Effective Safety and Health Program
Advertisements

Risk Management Introduction Risk Management Fundamentals
Operational Risk Management (ORM)
OPERATIONAL RISK MANAGEMENT
RISK ANALYSIS.  Almost all of the things that we do involve risk of some kind, but it can sometimes be challenging to identify risk, let alone to prepare.
Action 1: Mission/task analysis Action 2: List Hazards Action 3: List Causes STEP 1 IDENTIFY THE HAZARD STEP 2 ASSESS THE RISK Action 1: Assess hazard.
Naval Safety Center School Of Aviation Safety. The process of detecting hazards and assessing associated risks The process of detecting hazards and assessing.
RISK MANAGEMENT FORCE XXI
Unit 6 – Risk Management and safety management system
Safety and Health Programs
The Australian/New Zealand Standard on Risk Management
Operational Risk Management for Airport Emergency Planning
Operational Risk Management
Lucas Phillips Anurag Nanajipuram FAILURE MODE AND EFFECT ANALYSIS.
Annex I: Methods & Tools prepared by some members of the ICH Q9 EWG for example only; not an official policy/guidance July 2006, slide 1 ICH Q9 QUALITY.
OPERATIONAL RISK MANAGEMENT (ORM)
Safety and Health Programs
EMPLOY THE RISK MANAGEMENT PROCESS DURING JOB PLANNING and EXECUTION
Risk Management at a Glance. Terms Hazard Hazard Risk Risk Probability Probability Severity Severity Estimating Estimating Exposure Exposure Risk Assessment.
Risk Analysis for Engineering Design J. M. McCarthy Fall 2003 Definitions Hazard Analysis Hazard Analysis Report Example for Mini Baja Nationally Recognized.
Risk Assessment – all risks approach
As of: 07 Apr 051 Using MIL-STD-882D: Approach for Identification and Elimination of Environmental Hazards or Reduction of Risks Associated with Environmental.
Chapter 11: Project Risk Management
Unit #4 Establishing Committee Expectations – Safety & Health Programs 1.
Risk Management - the process of identifying and controlling hazards to protect the force.  It’s five steps represent a logical thought process from.
Guidance Notes on the Investigation of Marine Incidents
WHAT IS SYSTEM SAFETY? The field of safety analysis in which systems are evaluated using a number of different techniques to improve safety. There are.
Safety and Health Programs 1. Benefits of Effective Safety and Health Programs Reduce work related injuries and illnesses Improve morale and productivity.
OSHAX.org - The Unofficial Guide To the OSHA1. Benefits of Effective Safety and Health Programs Reduce work related injuries and illnesses Improve morale.
DESIGNING FOR SAFETY CHAPTER 9. IMPORTANCE OF DESIGNING FOR SAFETY  In the near future, the level of safety that companies and industries achieve will.
OPERATIONAL RISK MANAGEMENT. The Benefits of Risk Management Reduction in Material and Property Damage. Effective Mission Accomplishment. Reduction in.
Safety Risk Assessment/ Acceptance Air Force Materiel Command Chuck Dorney, P.E., C.S.P. Chief, HQ AFMC/SES DSN I n t e g r i t y - S e r v i.
Operational Risk Management. CAP Approach  Top-down leader backing  Decentralized implementation  Moderate implementation tempo  Safety lead role.
OPERATIONAL RISK MANAGEMENT
Liability Issues for TRIO Programs Managing Your Project’s Risk.
Risk Benefit Risk Management SSG MACK.
OPERATIONAL RISK MANAGEMENT
1 Safety - definitions Accident - an unanticipated loss of life, injury, or other cost beyond a pre-determined threshhold.  If you expect it, it’s not.
Col. Francis S. Gabreski Squadron Col. Francis S. Gabreski Squadron Operational Risk Management.
Jacques Vanier ICAO EUR/NAT Regional Officer Almaty, 5 to 9 September 2005 SAFETY MANAGEMENT SYSTEMS RISK VERSUS SAFETY.
Rules for Supporting Part 803 and Part 806 Decision Making Page 1 Establishing Rules for: Medical Device Reports (803) & Correction and Removal Reports.
SOFTWARE PROJECT MANAGEMENT
System Safety: A systematic processes
How to Supervise Evaluate
Operational Risk Management Annual Briefing January 20, 2015 SWR-TX-435 David Lee "Tex" Hill Composite Squadron.
RISK MANAGEMENT YULVI. Introduction Time Quality Cost Project Constraints Success Introduction.
Information Technology Project Management Managing IT Project Risk.
National Corporate Training Pty Ltd0. Topics Follow safe work practices Maintain personal safety standards Assess risks Follow emergency procedures National.
CATASTROPHIC (I) Loss of ability to accomplish the mission or mission failure. Death or permanent total disability (accident risk). Loss of major or mission-critical.
Development, Validation, Implementation and Enhancement for a Voluntary Protection Programs Center of Excellence (VPP CX) Capability for the Department.
Department of Defense Voluntary Protection Programs Center of Excellence Development, Validation, Implementation and Enhancement for a Voluntary Protection.
Department of Defense Voluntary Protection Programs Center of Excellence Development, Validation, Implementation and Enhancement for a Voluntary Protection.
Department of Defense Voluntary Protection Programs Center of Excellence Development, Validation, Implementation and Enhancement for a Voluntary Protection.
Failure Modes, Effects and Criticality Analysis
Safety Management Systems Session Two Safety Risk Management APTA Webinar April 28, 2016.
SYSTEM SAFETY AND THE TECHNICAL AUTHOR
Civil Air Patrol BASIC LEVEL OPERATIONAL RISK MANAGEMENT
Air Carrier Continuing Analysis and Surveillance System (CASS)
GE 6757 TOTAL QUALITY MANAGEMENT
Risk Assessment Matrix
Operational Risk Management (ORM) and the Driving Task
Introduction to Risk Management.
Elements of an Effective Safety and Health Program
RISK MANAGEMENT Overview
Elements of an Effective Safety and Health Program
A New Concept for Laboratory Quality Management Systems
SELDOM (D) Remotely possible; could occur at some time
Presentation transcript:

System Safety: A systematic processes

Risk Assessment An evaluation of threats in terms of severity and probability 1. Identify the Hazards 4. Make Control Decisions 3. Analyze Risk Control Measures 6. Supervise and Review 2. Assess the Risks 5. Implement Risk Controls 6-1 1

MISSION FOCUS (HAZARD VERSUS RISK) Identifying and analyzing an existing or potential condition that can impair mission accomplishment (No discussion of mission significance) HAZARD ID & Analysis A hazard for which we have estimated the severity, probability, and scope with which it can impact our mission and accepted it RISK Assessment & Mgmt 3-14

Hazard Identification and Analysis during the Life Cycle of a system Concept Definition Development Production Deployment Termination

Threat assessment process ID Hazardous Condition Complete Risk Assessment Q/Q Assess Severity Q/Q Assess Probability 3-13

THE RISK ASSESSMENT MATRIX Probability Frequent Likely Occasional Seldom Unlikely A B C D E S E V R I T Y I Extremely Catastrophic High High Critical II High Medium III Moderate Low Negligible IV Risk Levels 3-20

A thorough risk assessment process might help you better understand a hazard you have been exposed to many times before without incident* * No beavers were assaulted in production of this slide

Hazard Severity What impact will this threat have on people? What impact on environment, equipment or facilities? What impact on mission? 3-16

Severity Categories A key factor in establishing a common understanding of a safety programs goal MIL-STD 882 uses four categories Cat 1: Catastrophic Cat 2: Critical Cat 3: Marginal Cat 4: Negligible

Severity Qualified CATASTROPHIC - Complete mission failure, death, or loss of system CRITICAL - Major mission degradation, severe injury, occupational illness, or major system damage MODERATE - Minor mission degradation, injury, minor occupational illness, or minor system damage NEGLIGIBLE - Less than minor mission degradation, injury, occupational illness or minor system damage 6-3 3

Severity Quantified CATASTROPHIC - Complete mission failure, death, or loss of system and/or costs exceeding $1B CRITICAL - Major mission degradation, severe injury, occupational illness, or major system damage and/ or costs exceeding $1M MODERATE - Minor mission degradation, injury, minor occupational illness, or minor system damage and/or costs exceeding $100,000 NEGLIGIBLE - Less than minor mission degradation, injury, occupational illness or minor system damage and/or costs exceeding $10,000 6-3 3

Probability Expressed in terms of time, occurrence, proximity, etc Use data to substantiate your assessment Use descriptive or quantitative terms Use the cumulative probability of all factors Examine experientially derived or anecdotal information from operators Acknowledge uncertainty – There are no guarantees

THE RISK ASSESSMENT MATRIX Probability Frequent Likely Occasional Seldom Unlikely A B C D E S E V R I T Y I Extremely Catastrophic High High Critical II High Medium III Moderate Low Negligible IV Risk Levels 3-20

Qualified Probability Categories FREQUENT Individual piece of equipment - Occurs often in the life of the system Individual - Occurs often in career Fleet or inventory - Continuously experienced All Personnel exposed - continuously experienced LIKELY Individual piece of equipment - Occurs several times in the life of the system Individual - Occurs several times in a career Fleet or Inventory - Occurs often All Personnel exposed - Occurs often OCCASIONAL Individual piece of equipment - Will occur in the life of the system Individual - Will occur in a career Fleet or Inventory - Occurs several times in the life of the system All Personnel exposed - Occurs sporadically 6-4 4

Qualified Probability (cont) SELDOM Individual piece of equipment - Could occur in the life of the system Individual person - Could occur in a career Fleet or Inventory - Can be expected to occur in the life of the system All Personnel exposed - Seldom occurs UNLIKELY Individual piece of equipment - You assume it will not occur in the system lifecycle Individual person - So unlikely you assume it will not occur in a career Fleet or Inventory - Unlikely but could occur in the life of the system All Personnel exposed - Occurs very rarely 6-5 5

Probabilities Quantified (In terms of failure or exposure rates) Unlikely: 1 failure in 1,000,000,000 events instead of assuming it will not occur Seldom: 1 failure in 500 million exposures instead of it could occur Occasional: 1 failure in 1 million exposures instead of it will occur Likely: 1 failure in 500,000 exposures instead of it occurs several times Frequent:1 failure in 100,000 events instead of it occurs often

Qualitative Assessment AC 25.1309-1A Design Appraisal Installation Appraisal Failure Modes and Effects Analysis Fault Tree Analysis Probability Assessment

Quantitative Assessment AC 25.1309-1A Probability Analysis (PRA) Quantitative Probability Terms (QRA)

FAA Fail-Safe Design Concept AC 25.1309-1A The fail-safe design concept considers the effects of failures and combinations of failures in defining a safe design The following basic objectives apply: In any system or subsystem, the failure of any single element, component, or connection during any one flight should be assumed. Such single failure should not prevent continued safe flight and landing Subsequent failures during the same flight, whether detected or latent, should also be assumed unless their joint probability with the first failure is demonstrated to be extremely improbable

Fail-Safe Design Concept Fail-Safe designs use the following design principals – A combination of two or more are usually needed to provide a fail-safe design Redundant or backup systems Isolation of systems, components and elements Demonstrated reliability / Periodic inspection Failure warning and indication Flight crew procedures Designed failure effect limits Designed failure path Increased margins or factors of safety Error-tolerant design

Operational and Maintenance Considerations AC 25.1309-1A Flight crew action Ground crew action Certification check requirements Flight with inoperative equipment

Quantifying or Qualifying Risk? Remember Murphy’s Law for Management: “Technology is dominated by those who manage what they don’t understand”

Risk Acceptance Codes RAC 1 – Unacceptable RAC 2 – Undesirable RAC 3 – Acceptable with controls RAC 4 - Acceptable

Risk Assessment Shortcomings Deficiencies in RACs represent one of the major problems facing the system safety effort Quantitative severity and probabilities scales in most RAC matrices are too subjective The RAC is a main driver of system safety efforts This code prioritizes the management emphasis given to a particular problem

THE “ENHANCED” RISK ASSESSMENT MATRIX - Numeric Code is used to prioritize hazards and determine their acceptability using a quantitative methodology Probability Frequent Likely Occasional Seldom Unlikely A B C D E S E V R I T Y I 1 2 6 8 12 3 4 7 11 15 5 9 10 14 16 13 17 18 19 20 Catastrophic II Critical III Moderate Negligible IV Risk Levels 6-7 7

THE RISK PRIORITY LIST Highest Risk By ranking the hazards, we address them on a “worst-first” basis Safety dedicated resources are always limited and should be directed at the highest risk Lowest Risk Warranting action 3-21

ASSESSMENT CHALLENGES Over optimism Over pessimism Misrepresentation/Misunderstanding Alarmism / “Accident du Jour” Indiscrimination Bias Inaccuracy

Total Risk Exposure Codes Expanded scale Probability expressed in Exposure Severity expressed in Cost Combined determination expressed in quantifiable terms $$$$* (Now you are talking a language the bean counters understand)

Verification & Validation Quality of data establishes process credibility Avoid GIGO syndrome Verify and Validate initial estimates with updated data Failure rates Exposure rates Project lifecycle changes Number of units in the system

THE PRIORITY LIST What does it accomplish? Traditional Risk Management - Personnel can’t name or prioritize hazards -- can only identify general threats ORM - Personnel can name and prioritize RISKS that impact them and their mission In a mature “NORMal” world, every individual personally benefits by adapting the knowledge of prioritized hazards that exist in their life -- (Due diligence is demonstrated when managers see that their subordinates possess this knowledge) 3-22

System Safety Precedence A systematic approach to Hazard ID – Risk Assess and Control Design to minimize hazards Robust & Redundant systems, assemblies, components, etc Install physical barriers Isolate known threatening conditions or environments Use Warning devices Alerts to prevent or reduce unwanted event Develop Procedures and Training Most commonly used & abused hazard control High quality energy sources Low MTBF hardware Failures that do occur are not catastrophic No single point critical items allowed Physically isolate harmful environments or conditions Passive barriers tend to be more effective than active ones Airbag versus seatbelt example Warning devices are not positive protection Alert may be incremental, i.e., time to prevent damage versus time to escape disaster Lowest level of control relies on training and SOPs Quickest and cheapest to implement Least effective Most commonly used control

Risk Analysis 6. Supervise 1. Identify and Review the Hazards 2. Assess the Risks 3. Analyze Risk Control Measures 4. Make Control Decisions 5. Implement Risk Controls 6. Supervise and Review

Assessing Risk Controls Identify control options Determine control effects Prioritize risk control measures

2 Major Risk Control Approaches Employ Macro Risk Control Option(s) Reject – Avoid – Delay –Transfer –Spread – Compensate – Reduce Implement System Safety Precedence Control Option(s) Engineer – Guard – Improve Design – Limit Exposure – Personnel Selection – Train – Warn – Motivate – Reduce Effect - Rehabilitate

“Swiss Cheese” Model of Defenses Hazards The ideal The reality Potential losses (people and assets) James Reason: “Managing the Risks of Organizational Accidents”

“Swiss Cheese” Model of Defenses Some ‘holes’ due to active failures Defenses in depth Other ‘holes’ due to latent conditions James Reason: “Managing the Risks of Organizational Accidents”

Macro Options REJECT AVOID DELAY TRANSFER Risk outweighs benefit AVOID Go around the risk, do it in a different way DELAY Maybe the problem will be resolved by time If delay is an acceptable option consider if operation is needed at all TRANSFER Better qualified system, i.e.,“Pro’s From Dover”

Macro Options (cont) SPREAD COMPENSATE REDUCE Modular or separate Hazardous Operations COMPENSATE Design parallel and redundant systems REDUCE Design for minimum risk Incorporate Safety Devices Provide Warning Devices Develop SOPs & Train

The Risk Control Macro Option List Reject Avoid Delay Transfer Spread Compensate Reduce QUESTION: Why isn’t eliminate on this list?

Determine Risk Control Effects How will this effect probability? How will this effect severity? How will this impact other sub-systems? Some controls support other sub-systems Some controls may hinder other sub-systems What are the costs vs. benefits? Direct Costs Indirect Costs

Direct vs. Indirect Costs “As a rule of thumb, it is generally acceptable to calculate indirect costs of a mishap to be 7 times greater than those costs which can directly be accounted for in the incident or accident”

Risk Control ROT’s Use the System Safety Precedence order Choose the most mission supportive combinations Use Integrated Product Teams Look for synergistic enhancements Man – Machine – Medium – Mission - Management

Use the 5 M model as you look for systemic issues Man: Doesn’t know Doesn’t care Can’t physically accomplish Machine: Poor design Faulty maintenance SOP’s

5 M systemic issues (cont) Medium Weak design considerations Lack of provisions for natural “phenomena” Management: Inadequate procedures Inadequate policy Inadequate standards & controls Mission: Poorly thought out Poorly executed Weak understanding Incompatibilities

Providing Management Risk Control Options Program Manager looking for optimum combinations Mission supportive Some Risk Controls are incompatible Evaluate full cost versus full benefit Be prepared for numbers game Some Controls reinforce one another Win-Win option Redundancy = Robustness Is it needed? Can you afford it? i.e., $$$, #’s, real estate

Aid to Decision Making Be prepared to assist decisions at the right time Don’t rush – Make them as late as possible without negative impact on timeline Insure decisions are made at the right level It should be establish who makes the tough calls Use RAC or TREC to quantify who, what, when Provide Mission supportive options Use the Macro Option list as a starting point Be prepared to offer sound advice

Don’t be one who says, “ …data or information was not available and our department could not prove it was unsafe to allow the operation.”