WPDRTS ’05 1 Workshop on Parallel and Distributed Real-Time Systems 2005 April 4th and 5th, 2005, Denver, Colorado Challenge Problem Session Detection.

Slides:

Advertisements

Similar presentations

Motorola General Business Use MOTOROLA and the Stylized M Logo are registered in the US Patent & Trademark Office. All other product or service names are.

Advertisements

IBM SMB Software Group ® ibm.com/software/smb Maintain Hardware Platform Health An IT Services Management Infrastructure Solution.

Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.

OSI Model OSI MODEL.

CS 795 – Spring  “Software Systems are increasingly Situated in dynamic, mission critical settings ◦ Operational profile is dynamic, and depends.

Business Continuity and DR, A Practical Implementation Mich Talebzadeh, Consultant, Deutsche Bank

Chapter 13 Embedded Systems

Service Layer Session Management Group Name: WG2-ARC Source: IDCC, LGE, ZTE Meeting Date: TP16 Agenda Item:

Ashish Gupta Under Guidance of Prof. B.N. Jain Department of Computer Science and Engineering Advanced Networking Laboratory.

11/14/05ELEC Fall Multi-processor SoCs Yijing Chen.

Software Engineering and Middleware: a Roadmap by Wolfgang Emmerich Ebru Dincel Sahitya Gupta.

Chapter 13 Embedded Systems

REAL-TIME SOFTWARE SYSTEMS DEVELOPMENT Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.

Managing Agent Platforms with the Simple Network Management Protocol Brian Remick Thesis Defense June 26, 2015.

Real-Time Kernels and Operating Systems. Operating System: Software that coordinates multiple tasks in processor, including peripheral interfacing Types.

By Group: Ghassan Abdo Rayyashi Anas to’meh Supervised by Dr. Lo’ai Tawalbeh.

CprE 458/558: Real-Time Systems

Misconceptions About Real-time Computing : A Serious Problem for Next-generation Systems J. A. Stankovic, Misconceptions about Real-Time Computing: A Serious.

EMBEDDED SOFTWARE Team victorious Team Victorious.

Diameter Agent Overload IETF 88 - Vancouver 1. Goal Get consensus from the working group that Agent overload needs to be addressed If so, get guidance.

IBM Research – Thomas J Watson Research Center | March 2006 © 2006 IBM Corporation Events and workflow – BPM Systems Event Application symposium Parallel.

Large Enterprise with Multiple Locations Source: Avaya.

Computer System Architectures Computer System Software

Chapter 6 System Engineering - Computer-based system - System engineering process - “Business process” engineering - Product engineering (Source: Pressman,

REAL-TIME SOFTWARE SYSTEMS DEVELOPMENT Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.

GrIDS -- A Graph Based Intrusion Detection System For Large Networks Paper by S. Staniford-Chen et. al.

Tufts Wireless Laboratory School Of Engineering Tufts University “Network QoS Management in Cyber-Physical Systems” Nicole Ng 9/16/20151 by Feng Xia, Longhua.

Version 4.0. Objectives Describe how networks impact our daily lives. Describe the role of data networking in the human network. Identify the key components.

Multicore In Real-Time Systems – Temporal Isolation Challenges Due To Shared Resources Ondřej Kotaba, Jan Nowotsch, Michael Paulitsch, Stefan.

1 06/00 Questions 10/6/2015 QoS in DOS ECOOP 2000John Zinky BBN Technologies ECOOP 2000 Workshop on Quality of Service in Distributed Object Systems

Introduction to Telecommunications Babak Esfandiari (with material by Roger Impey and Qusay Mahmoud)

LOGO Service and network administration Storage Virtualization.

CS4730 Real-Time Systems and Modeling Fall 2010 José M. Garrido Department of Computer Science & Information Systems Kennesaw State University.

© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.

Architectures of distributed systems Fundamental Models

©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Architectural Design l Establishing the overall structure of a software system.

Distributed Systems and Algorithms Sukumar Ghosh University of Iowa Spring 2011.

Real-Time CORBA By Christopher Bolduc. What is Real-Time? Real-time computing is the study of hardware and software systems that are subject to a “real-

Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.

REAL-TIME SOFTWARE SYSTEMS DEVELOPMENT Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.

Investigating Survivability Strategies for Ultra-Large Scale (ULS) Systems Vanderbilt University Nashville, Tennessee Institute for Software Integrated.

Reconsidering Internet Mobility Alex C. Snoeren, Hari Balakrishnan, M. Frans Kaashoek MIT Laboratory for Computer Science.

Conformance Test Experiments for Distributed Real-Time Systems Rachel Cardell-Oliver Complex Systems Group Department of Computer Science & Software Engineering.

Service-oriented Resource Broker for QoS-Guaranteed in Grid Computing System Yichao Yang, Jin Wu, Lei Lang, Yanbo Zhou and Zhili Sun Centre for communication.

CS4730 Real-Time Systems and Modeling Fall 2010 José M. Garrido Department of Computer Science & Information Systems Kennesaw State University.

A QoS Policy Modeling Language for Publish/Subscribe Middleware Platforms A QoS Policy Modeling Language for Publish/Subscribe Middleware Platforms Joe.

Adaptive Resource Management Architecture for DRE Systems Nishanth Shankaran

1 BBN Technologies Quality Objects (QuO): Adaptive Management and Control Middleware for End-to-End QoS Craig Rodrigues, Joseph P. Loyall, Richard E. Schantz.

CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Distributed Real-Time Systems.

SelfCon Foil no 1 Variability in Self-Adaptive Systems.

Self-Organizing Architectures SOAR 2010 International Conference on Autonomic Computing and Communication, ICAC Washington DC, USA June 7, 2010.

Message routing in multi-segment FTT networks: the isochronous approach Paulo Pedreiras, Luís Almeida Workshop on Parallel and.

© Drexel University Software Engineering Research Group (SERG) 1 The OASIS SOA Reference Model Brian Mitchell.

Manish Kumar,MSRITSoftware Architecture1 Remote procedure call Client/server architecture.

Technical Reading Report Virtual Power: Coordinated Power Management in Virtualized Enterprise Environment Paper by: Ripal Nathuji & Karsten Schwan from.

Chapter 1 : Computer Networks. Lecture 2. Computer Networks Classification: 1- Depend on the geographical area. 2- Depend on functional relationship.

Ben Miller.   A distributed algorithm is a type of parallel algorithm  They are designed to run on multiple interconnected processors  Separate parts.

FLARe: a Fault-tolerant Lightweight Adaptive Real-time Middleware for Distributed Real-time and Embedded Systems Dr. Aniruddha S. Gokhale

IHP Im Technologiepark Frankfurt (Oder) Germany IHP Im Technologiepark Frankfurt (Oder) Germany ©

Page 1 2P13 Week 1. Page 2 Page 3 Page 4 Page 5.

Introduction to Networks. When Personal Computers first appeared in business, software programs were designed for a single user. However as computers.

Computer Engineering and Networks, College of Engineering, Majmaah University Protocols OSI reference MODEL TCp /ip model Mohammed Saleem Bhat

OSI Model OSI MODEL. Communication Architecture Strategy for connecting host computers and other communicating equipment. Defines necessary elements for.

Software Architecture

Wayne Wolf Dept. of EE Princeton University

QNX Technology Overview

Presentation transcript:

WPDRTS ’05 1 Workshop on Parallel and Distributed Real-Time Systems 2005 April 4th and 5th, 2005, Denver, Colorado Challenge Problem Session Detection and Reaction to Unplanned Operational Events in Large Scale Distributed Real-Time Embedded Systems

WPDRTS ’05 2 Challenge Problem Context More real-time and embedded systems are becoming Quality of Service enabled thus allowing for the management of resources in a more dynamic policy based manner The mechanisms for defining and operating on this policy are still maturing These systems are also moving towards more peer-to-peer implementation of resource allocation for managing large-scale distributed networks of mixed hard and soft real-time subsystems The computing devices, consisting of multiple blade processors, numbering in the hundreds and are connected via combination of LANs, WANs, and wireless communications.

WPDRTS ’05 3 Challenge Problem One of the challenges in the management of resources (e.g., processors, memory, networks, communications, power) is the detection and reaction to operational events that were unplanned or unanticipated but shouldn’t cause failures (unexpected behavior). An example of this is receipt of a larger number of requests for service than specified by the requirements or anticipated by the system designers for a capability that if it fails would have a significant impact, e.g., cause the loss of a great deal of money. What approaches, methods, architectural features, and mechanisms exist, are under development, or are the subject of research to deal with these sorts of situations?

WPDRTS ’05 4 Discussion Points (1 of 3) In many large-scale real-time systems there are both periodic and aperiodic processes driven by data exchanges (messages) that affect the system performance. In QoS enabled systems, end-to-end deadlines may be specified for a set of applications that make up an operation’s capability. The policy for responding to certain events may also be specified. The occurrence of unplanned operational events may or may not cause resource exhaustion. The detection of and remediation action for unanticipated operational events may be specified by a function that defines a set of thresholds (e.g., upper and lower bounds) and the action(s) to be taken when these thresholds are exceeded.

WPDRTS ’05 5 Discussion Points (2 of 3) Is it better to have separate detection/reaction models for fault detection and handling and for unplanned operational events. Or does this make for a more complicated solution? Given the nature of distributed systems, what might be the issues with implementing peer-to-peer mechanisms for event detection and correlation, policy management, and policy enactment? There are some existing standards (e.g., the IETF SNMP and Distributed Management Task Force (DMTF) Common Information Model (CIM) that have been used by some of the enterprise level system management products (e.g., CA Unicenter, IBM Tivoli) but these don’t really address real-time QoS based resource management. How can these be extended to support the DRE space for this type of problem?

WPDRTS ’05 6 What are the issues within both systems and software engineering disciplines to the development of solutions to these challenge problems (e.g., what are some changes in processes and culture within these disciplines that are necessary to support the development of robust solutions that can exceed specified requirements, but don’t “break the budget” during the project development life-cycle)? Discussion Points (3 of 3)