Application of Fault Injection to Globus Grid Middleware Nik Looker & Jie Xu University of Leeds, Leeds. LS2 9JT, UK Tianyu Wo & Jinpeng Huai Beihang University,

Slides:



Advertisements
Similar presentations
An Overview of ABFT in cloud computing
Advertisements

Qusay H. Mahmoud CIS* CIS* Service-Oriented Computing Qusay H. Mahmoud, Ph.D.
Lecture 8: Testing, Verification and Validation
E-Science Collaboration between the UK and China Paul Townend ( University of Leeds.
Pontus Boström and Marina Waldén Åbo Akademi University/ TUCS Development of Fault Tolerant Grid Applications Using Distributed B.
Software Fault Injection for Survivability Jeffrey M. Voas & Anup K. Ghosh Presented by Alison Teoh.
EEC 688/788 Secure and Dependable Computing Lecture 2 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Designing a DTC Verification System Jennifer Mahoney NOAA/ESRL 21 Feb 2007.
Software Quality Assurance (SQA). Recap SQA goal, attributes and metrics SQA plan Formal Technical Review (FTR) Statistical SQA – Six Sigma – Identifying.
Dependability ITV Model-based Analysis and Design of Embedded Software Techniques and methods for Critical Software Anders P. Ravn Aalborg University August.
Term Paper OLOMOLA,Afolabi( ). Dependability Modellling.
CSE 322: Software Reliability Engineering Topics covered: Dependability concepts Dependability models.
Presented By: Vinay Kumar.  At the time of invention, Internet was just accessible to a small group of pioneers who wanted to make the network work.
1 Software Testing and Quality Assurance Lecture 34 – Software Quality Assurance.
(c) 2007 Mauro Pezzè & Michal Young Ch 1, slide 1 Software Test and Analysis in a Nutshell.
Software Fault Tolerance – The big Picture mMIC-SFT September 2003 Anders P. Ravn Aalborg University.
Network Enabled Capability Through Innovative Systems Engineering Service Oriented Integration of Systems for Military Capability Duncan Russell, Nik Looker,
Summary and Safety Assessment mMIC-SFT November 2003 Anders P. Ravn Aalborg University.
SIMULATING ERRORS IN WEB SERVICES International Journal of Simulation: Systems, Sciences and Technology 2004 Nik Looker, Malcolm Munro and Jie Xu.
Introduction to Dependability slides made with the collaboration of: Laprie, Kanoon, Romano.
Software Quality Assurance
Software Process and Product Metrics
Software Dependability CIS 376 Bruce R. Maxim UM-Dearborn.
Software Faults and Fault Injection Models --Raviteja Varanasi.
1 Autonomic Computing An Introduction Guenter Kickinger.
 The software systems must do what they are supposed to do. “do the right things”  They must perform these specific tasks correctly or satisfactorily.
High Performance Embedded Computing © 2007 Elsevier Chapter 1, part 4: Embedded Computing High Performance Embedded Computing Wayne Wolf.
Made by: Sambit Pulak XI-IB. Reliability refers to the operation of hardware, the design of software, the accuracy of data or the correspondence of data.
Software Software is omnipresent in the lives of billions of human beings. Software is an important component of the emerging knowledge based service.
1 Software Testing and Quality Assurance Lecture 33 – Software Quality Assurance.
Dependability in FP 6 Brian Randell Pisa Workshop, November 2002.
Socio-technical Systems (Computer-based System Engineering)
FAULT TREE ANALYSIS (FTA). QUANTITATIVE RISK ANALYSIS Some of the commonly used quantitative risk assessment methods are; 1.Fault tree analysis (FTA)
Ch. 1.  High-profile failures ◦ Therac 25 ◦ Denver Intl Airport ◦ Also, Patriot Missle.
Chapter 1 Overview The NIST Computer Security Handbook defines the term Computer Security as:
ACM 511 Introduction to Computer Networks. Computer Networks.
Building Dependable Distributed Systems Chapter 1 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Quality of System requirements 1 Performance The performance of a Web service and therefore Solution 2 involves the speed that a request can be processed.
Tutorial 1: Date: 19/09/2012 Instructor: Hanif Ullah
Lecture slides prepared for “Computer Security: Principles and Practice”, 3/e, by William Stallings and Lawrie Brown, Chapter 1 “Overview”. © 2016 Pearson.
CS 505: Thu D. Nguyen Rutgers University, Spring CS 505: Computer Structures Fault Tolerance Thu D. Nguyen Spring 2005 Computer Science Rutgers.
Fault Tolerance Benchmarking. 2 Owerview What is Benchmarking? What is Dependability? What is Dependability Benchmarking? What is the relation between.
Software Testing and Quality Assurance Software Quality Assurance 1.
Improving Dependability in Service Oriented Architectures using Ontologies and Fault Injection Binka Gwynne Jie Xu School of Computing University of Leeds.
Basic Concepts of Dependability Jean-Claude Laprie DeSIRE and DeFINE Workshop — Pisa, November 2002.
1 Fault-Tolerant Computing Systems #1 Introduction Pattara Leelaprute Computer Engineering Department Kasetsart University
1 INTRUSION TOLERANT SYSTEMS WORKSHOP Phoenix, AZ 4 August 1999 Jaynarayan H. Lala ITS Program Manager.
The e-Demand Project (A Demand-Led Service-Based Architecture for Dependable e-Science Applications) Jie Xu (Project PI) A joint 3-year EPSRC/DTI-funded.
Attributes Availability Reliability Safety Confidentiality Integrity Maintainability Dependability Means Fault Prevention Fault Tolerance Fault Removal.
Computer threats, Attacks and Assets upasana pandit T.E comp.
C OMPUTER THREATS, ATTACKS AND ASSETS DONE BY NISHANT NARVEKAR TE COMP
DEFINE central topics: Critical infrastructures interdependencies Marcelo Masera Joint Research Centre DEFINE workshop November 2002, Pisa.
©Ian Sommerville 2000Dependability Slide 1 Chapter 16 Dependability.
Testing Overview Software Reliability Techniques Testing Concepts CEN 4010 Class 24 – 11/17.
Topic: Reliability and Integrity. Reliability refers to the operation of hardware, the design of software, the accuracy of data or the correspondence.
SENG521 (Fall SENG 521 Software Reliability & Testing Fault Tolerant Software Systems: Techniques (Part 4a) Department of Electrical.
Tool Support for Testing Classify different types of test tools according to their purpose Explain the benefits of using test tools.
CS457 Introduction to Information Security Systems
Chapter 1 Characterization of Distributed Systems
Software Defects Cmpe 550 Fall 2005
Software Dependability
Critical systems design
Fault Tolerance & Reliability CDA 5140 Spring 2006
Software Reliability Definition: The probability of failure-free operation of the software for a specified period of time in a specified environment.
IoTSec Taxonomy Proposal
Software Reliability It is defined as the ability of a system or component to perform its required functions under stated conditions for a specified period.
Definitions Cumulative time to failure (T): Mean life:
Presentation transcript:

Application of Fault Injection to Globus Grid Middleware Nik Looker & Jie Xu University of Leeds, Leeds. LS2 9JT, UK Tianyu Wo & Jinpeng Huai Beihang University, Beijing , PRC 1 School of Computing FACULTY OF ENGINEERING

A Historical Perspective

Dependability & Security To understand dependability it is important to understand the three main concepts that it utilises: Attributes Measurements of how Dependable and Secure a system is Threats Things that may affect the Dependability and Security of a system Means Ways of increasing the Dependability and Security of a system

Attributes Availability The probability that a service is present and ready for use Reliability The capability of maintaining the service and service quality Safety The absence of catastrophic consequences Confidentiality Information is accessible only to those authorised to use it Integrity The absence of improper system alterations Maintainability To undergo modifications and repairs

Threats Fault A fault is a defect in a system Error An error is a discrepancy between the behaviour of a system and its specified behaviour within the system boundary i.e. it enters an unspecified state Failure A failure is an instance in time when a system displays behaviour that is contrary to its specification at the system boundary

Fault-Error-Failure Chains As a general rule: A fault, when activated, can lead to an error An error is an invalid state An invalid state generated by an error may lead to either another error or a failure A generated error can be treated as another fault A failure is an observable deviation from the specified behaviour at the system boundary

Means Dependability means are ways of breaking fault-error- failure chains. Four main classifications: Fault Prevention Fault Removal Fault Forecasting Fault Tolerance

Fault Injection MTBF may be very large Attempt to speed up this process by injecting faults Cause the execution of seldom used control pathways within a system Either A failure may occur System’s fault tolerance mechanism will handle the fault or the failure will go undetected and uncorrected :-( Network Level Fault Injection Corrupt Drop Reorder

Network Level Fault Injection

Modified Network Level Fault Injection This allows a fault injector to intercept an entire middleware message, and thus we can decode it and modify specific parts of it.

Grid-FIT

Injecting Faults in a Production Environment

System Model

Extended Fault Model

Extended Failure Model

Failure Detection

Application to Globus Initial experiments were based around Web Services This resulted in the WS-FIT tool (Web Service - Fault Injection Technology) Ultimate aim was to apply this method to Grids This has resulted in the Grid-FIT tool Modifications and initial experiments have been conducted Modified hooks to work with Globus Adapted FIT decoding to Globus message structure Repeated an earlier set of experiments rewritten for Globus 4

Test Case

Results

Future Work Apply Grid-FIT to complex systems CoLaB Short for Collaboration of Leeds and Beihang, is a joint laboratory founded by the Beihang University, PRC & University of Leeds, UK. in The primary mission of CoLaB is research in Software and Security, each linked through a common objective To support the needs of the next generation of Internet computing. CROWN Short for China Research and Development environment Over Wide- area Network, is a grid test bed to facilitate scientific activities in different disciplines. We are currently working on integrating Grid-FIT with CROWN This will give Grid-FIT a large test bed to refine its method and models This will give CROWN a native Dependability Assessment method Part of the integration will be to integrate Grid-FIT as an Eclipse plug-in

Demonstrations & Workshop Demonstrations Venue: White Rose Grid Stall Wednesday 20th September13:45 – 14:30 Thursday 21st September10: :45 CROWNTianyu Wo FT-Grid Paul Grid-FIT Nik Mini-Workshop on UK-China e-Science Collaborations Venue: Conference Room 1 Wednesday 20th September 17: :00