Architecture and Real Time Systems Lab University of Massachusetts, Amherst An Application Driven Reliability Measures and Evaluation Tool for Fault Tolerant.

Slides:



Advertisements
Similar presentations
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Advertisements

U of Houston – Clear Lake
DETAILED DESIGN, IMPLEMENTATIONA AND TESTING Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.
1 Fault-Tolerant Computing Systems #6 Network Reliability Pattara Leelaprute Computer Engineering Department Kasetsart University
Hallway Traffic Simulator Peter Riggs Computer Systems Lab
SOCELLBOT: A New Botnet Design to Infect Smartphones via Online Social Networking th IEEE Canadian Conference on Electrical and Computer Engineering(CCECE)
1 Advancing Supercomputer Performance Through Interconnection Topology Synthesis Yi Zhu, Michael Taylor, Scott B. Baden and Chung-Kuan Cheng Department.
Ashish Gupta Under Guidance of Prof. B.N. Jain Department of Computer Science and Engineering Advanced Networking Laboratory.
More routing protocols Alec Woo June 18 th, 2002.
Low Overhead Fault Tolerant Networking (in Myrinet)
1 Fall 2005 Internetworking: Concepts, Architecture and TCP/IP Layering Qutaibah Malluhi CSE Department Qatar University.
ENGIN Introduction to Computer Engineering.
Toward Energy-Aware Software-Based Fault Tolerance in Real-Time Systems Osman S. Unsal, Israel Koren, C. Mani Krishna Architecture and Real-Time Systems.
Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.
Student Projects in Computer Networking: Simulation versus Coding Leann M. Christianson Kevin A. Brown Cal State East Bay.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
1 Evgeny Bolotin – ICECS 2004 Automatic Hardware-Efficient SoC Integration by QoS Network on Chip Electrical Engineering Department, Technion, Haifa, Israel.
1 Scheduling Mapping of tasks to time slots  Computation  Communication Mapping of power usage to time slots  Mechanical devices  Thermal subsystems.
Architecture and Real Time Systems Lab University of Massachusetts, Amherst I Koren and C M Krishna Electrical and Computer Engineering University of Massachusetts.
TOSSIM: Visualizing the Real World Philip Levis, Nelson Lee, Dennis Chi and David Culler UC Berkeley NEST Retreat, January 2003.
SBSE Course 4. Overview: Design Translate requirements into a representation of software Focuses on –Data structures –Architecture –Interfaces –Algorithmic.
REAL-TIME SOFTWARE SYSTEMS DEVELOPMENT Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.
A brief overview about Distributed Systems Group A4 Chris Sun Bryan Maden Min Fang.
IMPROUVEMENT OF COMPUTER NETWORKS SECURITY BY USING FAULT TOLERANT CLUSTERS Prof. S ERB AUREL Ph. D. Prof. PATRICIU VICTOR-VALERIU Ph. D. Military Technical.
Cluster Reliability Project ISIS Vanderbilt University.
Distributed Computation in MANets Robot swarm developed by James Rice University.
Planning and Analysis Tools to Evaluate Distribution Automation Implementation and Benefits Anil Pahwa Kansas State University Power Systems Conference.
© Oxford University Press 2011 DISTRIBUTED COMPUTING Sunita Mahajan Sunita Mahajan, Principal, Institute of Computer Science, MET League of Colleges, Mumbai.
1 Software Reliability Assurance for Real-time Systems Joel Henry, Ph.D. University of Montana NASA Software Assurance Symposium September 4, 2002.
V. Tsaoussidis, DUTH – Greece
Distributed Systems and Algorithms Sukumar Ghosh University of Iowa Spring 2011.
NMS Case Study HP OpenView Network Node Manager Hong-taek Ju DP&NM Lab. Dept. of Computer Science and Engineering POSTECH, Pohang Korea Tel:
Deeply Embedded Large Scale Networks Specify and Control Emerging Behavior.
Part.1.1 In The Name of GOD Welcome to Babol (Nooshirvani) University of Technology Electrical & Computer Engineering Department.
Advanced Principles of Operating Systems (CE-403).
Chapter 8-2 : Multicomputers Multiprocessors vs multicomputers Multiprocessors vs multicomputers Interconnection topologies Interconnection topologies.
REAL-TIME SOFTWARE SYSTEMS DEVELOPMENT Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.
Framework for MDO Studies Amitay Isaacs Center for Aerospace System Design and Engineering IIT Bombay.
Summary :-Distributed Process Scheduling Prepared By:- Monika Patel.
Interconnect simulation. Different levels for Evaluating an architecture Numerical models – Mathematic formulations to obtain performance characteristics.
Interconnect simulation. Different levels for Evaluating an architecture Numerical models – Mathematic formulations to obtain performance characteristics.
1 Software Reliability Analysis Tools Joel Henry, Ph.D. University of Montana.
Copyright 2004 Koren & Krishna ECE655/Koren Part.8.1 UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering Fault Tolerant Computing ECE.
1 Putchong Uthayopas, Thara Angsakul, Jullawadee Maneesilp Parallel Research Group, Computer and Network System Research Laboratory Department of Computer.
1 CS145 Lecture 26 What’s next?. 2 What software questions do we study? Where is software headed?
Basic Linear Algebra Subroutines (BLAS) – 3 levels of operations Memory hierarchy efficiently exploited by higher level BLAS BLASMemor y Refs. FlopsFlops/
Copyright © Clifford Neuman - UNIVERSITY OF SOUTHERN CALIFORNIA - INFORMATION SCIENCES INSTITUTE Advanced Operating Systems Lecture notes Dr.
Tolerating Communication and Processor Failures in Distributed Real-Time Systems Hamoudi Kalla, Alain Girault and Yves Sorel Grenoble, November 13, 2003.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
CS 351/ IT 351 Modeling and Simulation Technologies HPC Architectures Dr. Jim Holten.
Self-stabilizing energy-efficient multicast for MANETs.
Voice Over Internet Protocol (VoIP) Copyright © 2006 Heathkit Company, Inc. All Rights Reserved Presentation 5 – VoIP and the OSI Model.
Wireless Network Management SANDEEP. Network Management Network management is a service that employs a variety of tools, applications, and devices to.
Bringing together leading research institutions to advance electric ship concepts. Power Interconnect Tool Angela Card Mississippi State.
Network Systems Lab. Korea Advanced Institute of Science and Technology No.1 Ch. 1 Introduction EE692 Parallel and Distribution Computation | Prof. Song.
Copyright 2007 Koren & Krishna, Morgan-Kaufman Part.1.1 FAULT TOLERANT SYSTEMS Fault tolerant Measures.
Copyright 2007 Koren & Krishna, Morgan-Kaufman Part.12.1 FAULT TOLERANT SYSTEMS Part 12 - Networks.
Seminar On Rain Technology
PERFORMANCE MANAGEMENT IMPROVING PERFORMANCE TECHNIQUES Network management system 1.
SEMINAR TOPIC ON “RAIN TECHNOLOGY”
Interaction and Animation on Geolocalization Based Network Topology by Engin Arslan.
TrueTime.
Application Level Fault Tolerance and Detection
Mobicom ‘99 Per Johansson, Tony Larsson, Nicklas Hedman
Configuration of Cisco Routers in GNS3
Application Level Fault Tolerance and Detection
The RAPIDS Project Israel Koren C. Mani Krishna ARTS
Modeling and Simulation of TTEthernet
Wide Area Workload Management Work Package DATAGRID project
Distributed Systems and Algorithms
Presentation transcript:

Architecture and Real Time Systems Lab University of Massachusetts, Amherst An Application Driven Reliability Measures and Evaluation Tool for Fault Tolerant Real Time Systems I Koren and C M Krishna Electrical and Computer Engineering University of Massachusetts Amherst, MA Sponsored by Space and Naval Warfare Systems Command & Advanced Research Projects Agency ARPA order B855 under SPAWAR contract N C-0165 (Please view the slides only through Power Point Slide Show. The show is automated)

Architecture and Real Time Systems Lab University of Massachusetts, Amherst * Provides Network Measures which quantify how network parameters including topology and routing techniques effect the reliability of a given system * Provides Computer Measures which quantify how parameters such as task allocation, scheduling, check pointing and fault recovery algorithms affect the system’s reliability * GUI part of simulator allows user to draw the topology and specify the task sets * Output of the simulator is presented in the form of graph between various parameters TRIDENT

Architecture and Real Time Systems Lab University of Massachusetts, Amherst Calculating Computer Measures for Robotics Application Loading environment file for robotics application Simulator is running now Graphs indicating computer measures are produced

Architecture and Real Time Systems Lab University of Massachusetts, Amherst Computer Measures for Robotics Application Minimum Deadline vs. Surge size for various number of processors Recovery Time vs. Surge size for various number of processors

Architecture and Real Time Systems Lab University of Massachusetts, Amherst Computer Measures for Robotics Application System Recovery Time vs. Surge Orientation Processor for various number of processors Processor Recovery Time vs. Processor ID for various number of processors

Architecture and Real Time Systems Lab University of Massachusetts, Amherst Calculating Network and Computer Measures for Avionics Application Loading environment file for Avionics application Simulator is running now Graphs indicating network measures are produced Simulator is running now to get computer measures Graphs indicating computer measures are produced

Architecture and Real Time Systems Lab University of Massachusetts, Amherst Network Measures for Avionics Application Diameter vs. Link Failure Probability for various networks Probability of Disconnection vs. Link Failure Probability for various networks

Architecture and Real Time Systems Lab University of Massachusetts, Amherst Network Measures for Avionics Application Node Pair Distance vs. Link Failure Probability for various networks Frequency of failure and component size vs. Link Failure Probability for various networks

Architecture and Real Time Systems Lab University of Massachusetts, Amherst Minimum Deadline vs. Surge size for various Tasks sets Recovery Time vs. Surge size for various Tasks sets Computer Measures for Avionics Application

Architecture and Real Time Systems Lab University of Massachusetts, Amherst System Recovery Time vs. Surge Origination Processor for various Tasks sets Processor Recovery Time vs. Processor ID for various Tasks sets Computer Measures for Avionics Application

Architecture and Real Time Systems Lab University of Massachusetts, Amherst Summary The new measures can be used to evaluate the effect of * Task Allocation algorithms * Scheduling algorithms * Interconnection topologies * Communication protocols * Failure recovery mechanisms on real-time dependability Our tool, TRIDENT, calculates the new dependability measures and presents in graphical form