Adaptive Systems – Graceful Degrading System Paul Li

Slides:



Advertisements
Similar presentations
Jeremy S. Bradbury, James R. Cordy, Juergen Dingel, Michel Wermelinger
Advertisements

An advanced weapon and space systems company 1 23 rd ISSC/NWSSS Conference 23 rd ISSC/NWSSS Conference C. Forni, B. Blake – Remote Controlled.
Self-Managing Networks Dr. Marcus Brunner Network Laboratories NEC Europe Ltd. Heidelberg, Germany
Technical and design issues in implementation Dr. Mohamed Ally Director and Professor Centre for Distance Education Athabasca University Canada New Zealand.
Principles of Engineering System Design Dr T Asokan
Silberschatz and Galvin  Operating System Concepts Module 16: Distributed-System Structures Network-Operating Systems Distributed-Operating.
Network Management Basics Network management requirements OSI Management Functional Areas –Network monitoring: performance, fault, accounting –Network.
Software Fault Injection for Survivability Jeffrey M. Voas & Anup K. Ghosh Presented by Alison Teoh.
CS 795 – Spring  “Software Systems are increasingly Situated in dynamic, mission critical settings ◦ Operational profile is dynamic, and depends.
CLOUD COMPUTING AN OVERVIEW & QUALITY OF SERVICE Hamzeh Khazaei University of Manitoba Department of Computer Science Jan 28, 2010.
A. Bucchiarone / Dagstuhl/ 2007 APL Antonio Bucchiarone PhD Student – IMT Graduate School Piazza S. Ponziano, Lucca (Italy)
Network Operating Systems Users are aware of multiplicity of machines. Access to resources of various machines is done explicitly by: –Logging into the.
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
Ensuring Non-Functional Properties. What Is an NFP?  A software system’s non-functional property (NFP) is a constraint on the manner in which the system.
Selecting a defect model for maintenance resource planning and software insurance Paul Li Carnegie Mellon University
Replication Management using the State-Machine Approach Fred B. Schneider Summary and Discussion : Hee Jung Kim and Ying Zhang October 27, 2005.
2/23/2009CS50901 Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial Fred B. Schneider Presenter: Aly Farahat.
Distributed Systems Management What is management? Strategic factors (planning, control) Tactical factors (how to do support the strategy practically).
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
The ANSA project Failures and Dependability in ANSA.
Page 1 Copyright © Alexander Allister Shvartsman CSE 6510 (461) Fall 2010 Selected Notes on Fault-Tolerance (12) Alexander A. Shvartsman Computer.
1 FM Overview of Adaptation. 2 FM RAPIDware: Component-Based Design of Adaptive and Dependable Middleware Project Investigators: Philip McKinley, Kurt.
1 Software Testing and Quality Assurance Lecture 5 - Software Testing Techniques.
Achieving self-healing in service delivery software systems by means of case- based reasoning Stefania Montani Cosimo Anglano Presented by Tony Schneider.
McGraw-Hill The McGraw-Hill Companies, Inc., 2000 SNMP Simple Network Management Protocol.
QoS-enabled middleware by Saltanat Mashirova. Distributed applications Distributed applications have distinctly different characteristics than conventional.
Software Quality Assurance Lecture #8 By: Faraz Ahmed.
SensIT PI Meeting, January 15-17, Self-Organizing Sensor Networks: Efficient Distributed Mechanisms Alvin S. Lim Computer Science and Software Engineering.
1 System Models. 2 Outline Introduction Architectural models Fundamental models Guideline.
Poisson Random Variable Provides model for data that represent the number of occurrences of a specified event in a given unit of time X represents the.
Computer Science Open Research Questions Adversary models –Define/Formalize adversary models Need to incorporate characteristics of new technologies and.
1 Software Testing and Quality Assurance Lecture 33 – Software Quality Assurance.
Automating service management Tiina Niklander Faculty of Science Department of Computer Science In AMICT 2008 Petrozavodsk, May 2008.
Exercises for Chapter 2: System models
1 Network Monitoring Mi-Jung Choi Dept. of Computer Science KNU
Ch. 1.  High-profile failures ◦ Therac 25 ◦ Denver Intl Airport ◦ Also, Patriot Missle.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 05. Review Software design methods Design Paradigms Typical Design Trade-offs.
SOFTWARE MAINTENANCE 1. TOPICS TO BE DISCUSSED.. Definition of Maintenance Software Maintenance Types of Maintenance Maintenance Process Need of Maintenance.
THE VISION OF AUTONOMIC COMPUTING. WHAT IS AUTONOMIC COMPUTING ? “ Autonomic Computing refers to computing infrastructure that adapts (automatically)
Network Management. Network management means monitoring and controlling the network so that it is working properly and providing value to its users. A.
CS 505: Thu D. Nguyen Rutgers University, Spring CS 505: Computer Structures Fault Tolerance Thu D. Nguyen Spring 2005 Computer Science Rutgers.
Chapter 2 Fundamental Simulation Concepts
DEAS2005Michael Shin Copyright1 Connector-Based Self-Healing Mechanism for Components of a Reliable System Michael E. Shin Department of Computer Science.
A Planning Based Approach to Failure Recovery in Distributed Systems Naveed Arshad Dennis Hiembigner, Alexander L. Wolf University of Colorado at Boulder.
Basic Concepts of Dependability Jean-Claude Laprie DeSIRE and DeFINE Workshop — Pisa, November 2002.
8-3: Probability and Probability Distributions English Casbarro Unit 8.
Programming Sensor Networks Andrew Chien CSE291 Spring 2003 May 6, 2003.
Fault-Tolerant Control. Fault Tolerance Passive Passive  Tolerance achieved by the use of feedback control laws that are robust to possible system faults.
1 Fault-Tolerant Computing Systems #1 Introduction Pattara Leelaprute Computer Engineering Department Kasetsart University
Smart Instant HKU Context-aware Instant Messenger for Mobile Users Supervisor: Dr.Cho-Li Wang CS Final Year Project (Year )
Control System Considerations for ADS EuCARD-2/MAX Accelerators for Accelerator Driven Systems Workshop, CERN, March 20-21, 2014 Klemen Žagar Robert Modic.
Operating Systems Distributed-System Structures. Topics –Network-Operating Systems –Distributed-Operating Systems –Remote Services –Robustness –Design.
Manajemen Jaringan, Sukiswo ST, MT 1 Network Monitoring Sukiswo
Fundamentals of Fault-Tolerant Distributed Computing In Asynchronous Environments Paper by Felix C. Gartner Graeme Coakley COEN 317 November 23, 2003.
1 Distributed Systems Architectures Distributed object architectures Reference: ©Ian Sommerville 2000 Software Engineering, 6th edition.
Chapter 4. CONCEPT OF THE OPERATING SYSTEM MANAGING ESSENTIAL FILE OPERATIONS.
CompSci 280 S Introduction to Software Development
Douglas Potter IBI Minneapolis User Group November 2008
IEEE Std 1074: Standard for Software Lifecycle
Research Challenges of Autonomic Computing
Fault Tolerance Distributed Web-based Systems
Middleware for Fault Tolerant Applications
Mattan Erez The University of Texas at Austin July 2015
The Vision of Autonomic Computing
Software System Testing
Self-Managed Systems: an Architectural Challenge
Abstractions for Fault Tolerance
Chapter 2: System models
Autonomic Pervasive Systems
Presentation transcript:

Adaptive Systems – Graceful Degrading System Paul Li

Fundamental Concept of Dependability Failure is an event that occurs when delivered service deviates from correct service. (Quality of service?) Failure is an event that occurs when delivered service deviates from correct service. (Quality of service?) Design faults? Physical faults? Or Interaction faults? Design faults? Physical faults? Or Interaction faults?

Fundamental Concept of Dependability Natural Acc Perm Trans Environment/System Flux

Software Component Detection/ Monitoring Software Component Detection/ Monitoring Software Component Detection/ Monitoring Software Component Detection/ Monitoring System-wide Graceful Degradation (Implicit Adaptation?) Software Component Detection/ Monitoring System Variables I/M O/A Updated System Vars System Updates

Software Component Detection/ Monitoring Software Component Detection/ Monitoring Software Component Detection/ Monitoring Software Component Detection/ Monitoring Scalable System-wide Graceful Degradation Software Component Detection/ Monitoring System Variables I/M O/A Sub-System

Self Healing Problem Space Rainbow Permanent/Transient Inadequate System Configuration Stress on the System Configuration in Distributed Systems Random, arbitrary, pre-defined Gauges Less than desired quality of service Perform associated repair On-line Reconfiguration Long time between occurrences Quality of service

Self Healing Problem Space Open Systems with defined interfaces Assumed to be complete System knows component presence Data rates System fluctuates during operation Elements of distributed systems Heterogeneous components and resources Dynamic behavior within expectations Semi-automated System quality of service Scalability assumes linearity Many clients and servers in distributed systems