Disaster Tolerant Computing and Communications Systems Mitch Thornton Steve Szygenda.

Slides:



Advertisements
Similar presentations
Trust Management of Services in Cloud Environments:
Advertisements

PLOTTING PHASE PORTRAITS WITH MATLAB:
Marzieh Parandehgheibi
COE 444 – Internetwork Design & Management Dr. Marwan Abu-Amara Computer Engineering Department King Fahd University of Petroleum and Minerals.
11. Practical fault-tolerant system design Reliable System Design 2005 by: Amir M. Rahmani.
Normal Accidents: Living with High-Risk Technologies Minho Jeung Trinity Team 12/06/2005.
Optimal redundancy allocation for information technology disaster recovery in the network economy Benjamin B.M. Shao IEEE Transaction on Dependable and.
Software Fault Injection for Survivability Jeffrey M. Voas & Anup K. Ghosh Presented by Alison Teoh.
Chap 1: Overview Concepts of CIA: confidentiality, integrity, and availability Confidentiality: concealment of information –The need arises from sensitive.
Cascading failures in interdependent networks and financial systems -- Departmental Seminar Xuqing Huang Advisor: Prof. H. Eugene Stanley Collaborators:
Dynamic adaptation of parallel codes Toward self-adaptable components for the Grid Françoise André, Jérémy Buisson & Jean-Louis Pazat IRISA / INSA de Rennes.
Analysing Systems Failures (1) Main Principles: systems thinking.
Client/Server Databases and the Oracle 10g Relational Database
Adaptive Infrastructures EPRI/DoD Initiative on Complex Interactive Networks/Systems Joint innovative research ·EPRI and ·Office of the Director of Defense.
Structural Stability, Catastrophe Theory, and Applied Mathematics
Fault Tolerant Control F Crusca and M Aldeen. Outline Definition of problem Modelling Fault detection filters Fault tolerant control systems Example.
CE 579: STRUCTRAL STABILITY AND DESIGN
A Concept of Environmental Forecasting and Variational Organization of Modeling Technology Vladimir Penenko Institute of Computational Mathematics and.
Contributed Talk at NetSci 2007
ABCSG - Dependable Systems - 01/06/ ABCSG Dependable Systems.
SENG521 (Fall SENG 521 Software Reliability & Testing Defining Necessary Reliability (Part 3b) Department of Electrical & Computer.
RISK IDENTIFICATION TOOL FOR ICT IN INTERNATIONAL DEVELOPMENT CO-OPERATION PROJECTS JOY PAMELA AZONOBI Tallinn 2014.
John Graham – STRATEGIC Information Group Steve Lamb - QAD Disaster Recovery Planning MMUG Spring 2013 March 19, 2013 Cleveland, OH 03/19/2013MMUG Cleveland.
Software Dependability CIS 376 Bruce R. Maxim UM-Dearborn.
Software faults & reliability Presented by: Presented by: Pooja Jain Pooja Jain.
Whitacre College of Engineering Panel Interdisciplinary Cybersecurity Education Texas Tech University NSF-SFS Workshop on Educational Initiatives in Cybersecurity.
The Computer for the 21 st Century Mark Weiser – XEROX PARC Presented By: Mihail Ionescu.
Distributed Control of FACTS Devices Using a Transportation Model Bruce McMillin Computer Science Mariesa Crow Electrical and Computer Engineering University.
Chapter 1 Introduction to Simulation
Cognitive Task Analysis and its Application to Restoring System Security by Robin Podmore, IncSys Frank Greitzer, PNNL.
Secure Systems Research Group - FAU 1 A survey of dependability patterns Ingrid Buckley and Eduardo B. Fernandez Dept. of Computer Science and Engineering.
Xiao Liu CS3 -- Centre for Complex Software Systems and Services Swinburne University of Technology, Australia Key Research Issues in.
“To Colo or not to Colo” Choosing the Right Solution a Critical Facilities Round Table presentation October 17,
Quality by Design (QbD) Myth : An expensive development tool ! Fact : A tool that makes product development and commercial scale manufacturing simple !
Element 1 Factors affecting quality and reliability of electronic products 1.1Factors affecting quality are identified, their causes described, and methods.
Secure Systems Research Group - FAU 1 Active Replication Pattern Ingrid Buckley Dept. of Computer Science and Engineering Florida Atlantic University Boca.
Adaptive control and process systems. Design and methods and control strategies 1.
1 Computer Networking Dr. Mohammad Alhihi Communication and Electronic Engineering Department Philadelphia University Faculty of Engineering.
CprE 458/558: Real-Time Systems
5 May CmpE 516 Fault Tolerant Scheduling in Multiprocessor Systems Betül Demiröz.
April 28, 2003 Early Fault Detection and Failure Prediction in Large Software Systems Felix Salfner and Miroslaw Malek Department of Computer Science Humboldt.
CS 505: Thu D. Nguyen Rutgers University, Spring CS 505: Computer Structures Fault Tolerance Thu D. Nguyen Spring 2005 Computer Science Rutgers.
Biocomplexity Teacher Workshop May 31 – June 2, 2008 University of Puerto Rico.
Fault Tolerance Benchmarking. 2 Owerview What is Benchmarking? What is Dependability? What is Dependability Benchmarking? What is the relation between.
A Fault Tolerant Control Approach to Three Dimensional Magnetic Levitation By James Ballard.
Maximizing Lifetime per Unit Cost in Wireless Sensor Networks
1 Fault-Tolerant Computing Systems #1 Introduction Pattara Leelaprute Computer Engineering Department Kasetsart University
1 INTRUSION TOLERANT SYSTEMS WORKSHOP Phoenix, AZ 4 August 1999 Jaynarayan H. Lala ITS Program Manager.
EEC 688/788 Secure and Dependable Computing Lecture 6 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Strategies and Rubrics for Teaching Chaos and Complex Systems Theories as Elaborating, Self-Organizing, and Fractionating Evolutionary Systems Fichter,
On Hierarchical Design of Computer Systems for Critical Applications Peter Gabriel Neumann Presented by Bo Cui.
AUTOMATIC CONTROL THEORY II Slovak University of Technology Faculty of Material Science and Technology in Trnava.
Science and Engineering Practices K–2 Condensed Practices3–5 Condensed Practices6–8 Condensed Practices9–12 Condensed Practices Developing and Using Models.
Topic: Reliability and Integrity. Reliability refers to the operation of hardware, the design of software, the accuracy of data or the correspondence.
Investigate Plan Design Create Evaluate (Test it to objective evaluation at each stage of the design cycle) state – describe - explain the problem some.
Introduction to emulators Tony O’Hagan University of Sheffield.
References: Supply Chain Saves the World. Boston, MA: AMR Research (2006); Designing and Managing the Supply Chain – Concepts, Strategies and Case Studies;
ON “SOFTWARE ENGINEERING” SUBJECT TOPIC “RISK ANALYSIS AND MANAGEMENT” MASTER OF COMPUTER APPLICATION (5th Semester) Presented by: ANOOP GANGWAR SRMSCET,
SEC 480 assist Expect Success/sec480assistdotcom FOR MORE CLASSES VISIT
Dr. Gerry Firmansyah CID Business Continuity and Disaster Recovery Planning for IT (W-I)
SEMINAR PRESENATATION ON WIDEAREA BLACKOUT (AN ELECTRICAL DISASTER) BY:Madhusmita Mohanty Electrical Engineering 7TH Semester Regd No
Risk Assessment.
Large Distributed Systems
Fault Tolerance & Reliability CDA 5140 Spring 2006
Fault Injection: A Method for Validating Fault-tolerant System
Computational Elements of Robust Civil Infrastructure
Fault Tolerance Distributed Web-based Systems
EEC 688/788 Secure and Dependable Computing
Overview of Control System
An Original Model of Infrastructure System Resilience
Presentation transcript:

Disaster Tolerant Computing and Communications Systems Mitch Thornton Steve Szygenda

Outline Definitions Motivation Modeling Approaches Conclusion/Future Work

Motivation Many Systems Vulnerable to Disasters Cannot Use Principle of Redundancy Fault Tolerance Models May Not be Applicable to Disasters Disaster Tolerance is Crucial for Security and Infrastructure Robustness

Disaster Definition Disaster: an event that can cause a system-wide malfunction as a result of one or more failures within a system. Disasters may occur due to a single-point failure or by a plurality of single-point failures that occur either simultaneously, or nearly simultaneously in a temporal sense and may be caused from either a man-made or natural event.

Catastrophe Definition Catastrophe: an event that can happen as the result of the occurrence of a disaster and cause a system to function improperly. Catastrophe avoidance is the goal of disaster tolerance.

Related Concepts Disaster Recovery –Ability to Resume Normal Operation After Occurrence of Disaster Disaster Avoidance –System Incorporates Techniques that Prevent System-wide Failure Fault Tolerance –Mechanisms/Techniques in a System to Allow Functionality in the Presence of a Fault –Fault Model Used to Determine Fault Tolerant Mechanisms Disaster Tolerance –Mechanisms/Techniques in a System to Allow Functionality in the Presence of a Disaster –Disaster Model Used to Determine Disaster Tolerant Mechanisms

Disaster Models Chaos Theory –Applicable to Large Complex Non-linear Dynamic Systems with Feedback –Can Model Sudden Transitions in Dynamic Systems –A Disaster is a Sudden Transition in a Dynamic System –Successfully Used to Model Dynamic Interactions of Power Generators on Electric Grid in 1982

Two Generator Fractal* *James Thorp, Cornell University Each Point: Phase Angle wrt Reference Gen. Light Blue: Grid Stability Other Colors: Grid Vulnerable or Unstable Small Changes: Can Cause Sys. Instability (Disaster)

Disaster Models Catastrophe Theory –Theory in Vogue in 50’s (Rene Thom) –System is Modeled as Multidimensional Smooth Surface Affected by Classes of Singularities –A Disaster is Modeled as a Singularity that Occurs in Normal System Operation –Example is the Use of Catastrophe Theory to Predict the Formation of a Planet when Two Stars Stray Close Together and Gas is Pulled from One to the Other

Conclusions Disasters, Catastrophes Defined as Events Disaster Models Needed to Provide Disaster Tolerance Candidate Mathematical Tools Defined for Disaster Models

Proposed Future Work Choose Candidate Large System with Disaster Data Formulate and Calibrate Disaster Model Apply Disaster Model to Other Large Systems to Identify Critical Components Model Systems with More Robust Critical Components (Redundancy?) and Apply Disaster Model