TECHNICAL SEMINAR On. introduction  Cloud support for real time system is really important because, today we found a lot of real time systems around.

Slides:



Advertisements
Similar presentations
Tolerating Timing faults TSW November 2009 Anders P. Ravn Aalborg University.
Advertisements

Christian Delbe1 Christian Delbé OASIS Team INRIA -- CNRS - I3S -- Univ. of Nice Sophia-Antipolis November Automatic Fault Tolerance in ProActive.
Towards Self-Testing in Autonomic Computing Systems Tariq M. King, Djuradj Babich, Jonatan Alava, and Peter J. Clarke Software Testing Research Group Florida.
First Step Towards Automatic Correction of Firewall Policy Faults Fei Chen Alex X. Liu Computer Science and Engineering Michigan State University JeeHyun.
Software Testing and Quality Attributes Software Testing Module ( ) Dr. Samer Hanna.
Making Services Fault Tolerant
1 Building Reliable Web Services: Methodology, Composition, Modeling and Experiment Pat. P. W. Chan Department of Computer Science and Engineering The.
A 100,000 Ways to Fa Al Geist Computer Science and Mathematics Division Oak Ridge National Laboratory July 9, 2002 Fast-OS Workshop Advanced Scientific.
Ashish Gupta Under Guidance of Prof. B.N. Jain Department of Computer Science and Engineering Advanced Networking Laboratory.
Software Fault Tolerance – The big Picture RTS April 2008 Anders P. Ravn Aalborg University.
Fault Tolerance: Basic Mechanisms mMIC-SFT September 2003 Anders P. Ravn Aalborg University.
Modified from Sommerville’s originals Software Engineering, 7th edition. Chapter 20 Slide 1 Critical systems development.
Design of Fault Tolerant Data Flow in Ptolemy II Mark McKelvin EE290 N, Fall 2004 Final Project.
Dependability ITV Real-Time Systems Anders P. Ravn Aalborg University February 2006.
Developing Dependable Systems CIS 376 Bruce R. Maxim UM-Dearborn.
Strategic Directions in Real- Time & Embedded Systems Aatash Patel 18 th September, 2001.
1 Making Services Fault Tolerant Pat Chan, Michael R. Lyu Department of Computer Science and Engineering The Chinese University of Hong Kong Miroslaw Malek.
Undergraduate Poster Presentation Match 31, 2015 Department of CSE, BUET, Dhaka, Bangladesh Wireless Sensor Network Integretion With Cloud Computing H.M.A.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
Failure Spread in Redundant UMTS Core Network n Author: Tuomas Erke, Helsinki University of Technology n Supervisor: Timo Korhonen, Professor of Telecommunication.
6.4 Data And File Replication Presenter : Jing He Instructor: Dr. Yanqing Zhang.
An Approach to Test Autonomic Containers Ronald Stevens (IEEE Computer Society & ACM Student Member) August 1, 2006 REU Sponsored by NSF.
Automatic Load Balancing of ProActive Applications... with Fault Tolerance Javier Bustos, José Piquer, Universidad de Chile, Santiago, Chile Sebastien.
A Cooperative Diversity- Based Robust MAC Protocol in wireless Ad Hoc Networks Sangman Moh, Chansu Yu Chosun University, Cleveland State University Korea,
Copyright © Clifford Neuman and Dongho Kim - UNIVERSITY OF SOUTHERN CALIFORNIA - INFORMATION SCIENCES INSTITUTE Advanced Operating Systems Lecture.
Yaoshen Yuan Tufts University Virtual Machine Usage in Cloud Computing in Google EE-126.
TRIGON BASED AUTHENTICATION, AUTHORIZATION AND DISTRIBUTION OF ENCRYPTED KEYS WITH GLOBUS MIDDLEWARE Anitha Kumari K 08MW01 II ME – Software Engineering.
Can Cloud Computing be Used for Planning? An Initial Study Authors: Qiang Lu*, You Xu†, Ruoyun Huang†, Yixin Chen† and Guoliang Chen* from *University.
COST289 14th MCM Towards Cognitive Communications 13 April Towards Cognitive Communications A COST Action Proposal Mehmet Safak.
1 Digitally Controlled Converter with Dynamic Change of Control Law and Power Throughput Carsten Nesgaard Michael A. E. Andersen Nils Nielsen Technical.
N. GSU Slide 1 Chapter 05 Clustered Systems for Massive Parallelism N. Xiong Georgia State University.
Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management Author: Raul Castro Fernandez, Matteo Migliavacca, et al.
SENG521 (Fall SENG 521 Software Reliability & Testing Fault Tolerant Software Systems: Techniques (Part 4b) Department of Electrical.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 20 Slide 1 Critical systems development 3.
Survey of Adding Fault Tolerance to Service Oriented Architecture Ingrid Buckley 03/26/09.
Fault-Tolerant Parallel and Distributed Computing for Software Engineering Undergraduates Ali Ebnenasir and Jean Mayo {aebnenas, Department.
Fault Tolerance Mechanisms ITV Model-based Analysis and Design of Embedded Software Techniques and methods for Critical Software Anders P. Ravn Aalborg.
CprE 458/558: Real-Time Systems
Software Testing Definition Software Testing Module ( ) Dr. Samer Odeh Hanna.
April 28, 2003 Early Fault Detection and Failure Prediction in Large Software Systems Felix Salfner and Miroslaw Malek Department of Computer Science Humboldt.
CS 505: Thu D. Nguyen Rutgers University, Spring CS 505: Computer Structures Fault Tolerance Thu D. Nguyen Spring 2005 Computer Science Rutgers.
Online monitoring and filtering Graham July 2009 Graham July 2009.
Intelligent Systems Software Assurance Symposium 2004 Bojan Cukic & Yan Liu, Robyn Lutz & Stacy Nelson, Chris Rouff, Johann Schumann, Margaret Smith July.
Research of P2P Architecture based on Cloud Computing Speaker : 吳靖緯 MA0G0101.
WS-DREAM: A Distributed Reliability Assessment Mechanism for Web Services Zibin Zheng, Michael R. Lyu Department of Computer Science & Engineering The.
Presented by Reliability, Availability, and Serviceability (RAS) for High-Performance Computing Stephen L. Scott Christian Engelmann Computer Science Research.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Fault Management in Mobile Ad-Hoc Networks by Tridib Mukherjee.
Copyright © Clifford Neuman - UNIVERSITY OF SOUTHERN CALIFORNIA - INFORMATION SCIENCES INSTITUTE Advanced Operating Systems Lecture notes Dr.
Mixed Criticality Systems: Beyond Transient Faults Abhilash Thekkilakattil, Alan Burns, Radu Dobrin and Sasikumar Punnekkat.
NASA Software Assurance Symposium 2001 Metrics for Fault-Tolerant Real-Time Software Afzel Noore Computer Science and Electrical Engineering West Virginia.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
1 Developing Aerospace Applications with a Reliable Web Services Paradigm Pat. P. W. Chan and Michael R. Lyu Department of Computer Science and Engineering.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
An Algorithm for Automatically Obtaining Distributed and Fault Tolerant Static Schedules Alain Girault - Hamoudi Kalla - Yves Sorel - Mihaela Sighireanu.
Denis Caromel1 OASIS Team INRIA -- CNRS - I3S -- Univ. of Nice Sophia-Antipolis -- IUF IPDPS 2003 Nice Sophia Antipolis, April Overview: 1. What.
18/05/2006 Fault Tolerant Computing Based on Diversity by Seda Demirağ
Week#3 Software Quality Engineering.
Software Quality Assurance
On Reliable Modular Testing with Vulnerable Test Access Mechanisms Lin Huang, Feng Yuan and Qiang Xu.
Reliability and Performance of the SNS Machine Protection System Doug Curry 2013.
Presented by: Saurav Kumar Bengani
The consensus problem in distributed systems
Sindhusha Doddapaneni
Application Level Fault Tolerance and Detection
Supporting Fault-Tolerance in Streaming Grid Applications
Fault Tolerance Distributed Web-based Systems
by Xiang Mao and Qin Chen
CSC3050 – Computer Architecture
MapReduce: Simplified Data Processing on Large Clusters
Presentation transcript:

TECHNICAL SEMINAR On

introduction  Cloud support for real time system is really important because, today we found a lot of real time systems around us.  Use of cloud infrastructure for real time applications increases the chances of errors.  There is an increased need to tolerate the fault.  Here is the model for the fault tolerance of real time applications running at cloud infrastructure.

Literature survey Present System  X. Kong et. al. [1, 5] presented a model for virtual infrastructure performance and fault tolerance. But it is not well suited for the fault tolerance of real time cloud applications.  For the non-cloud applications, a baseline model for distributed RTS is, distributed recovery block [11] proposed by K. H. Kim which is very basic in nature.  Another model is “time stamped fault tolerance of distributed RTS” [9], which is proposed by S.Malik and M. J. Rehman.  All of these models were defined for the real time systems based on standard computing architecture. No one has introduced the fault tolerance on the basis of reliability of node and reliability assessment of node.

Proposed System Model Input Buffer Algo Acceptance Test TC WDT Application Application result Valid result Validity exception All nodes fail In time / time overrun RA R algo Valid result In time results Reliability level DM Final output Failur e signal RC pass checkpoint Valid result

Fault Tolerance Mechanism Reliability Assessment Algorithm: Begin Initially reliability:=1, n :=1 Input from configuration RF, maxReliability,minReliability Input nodestatus if nodeStatus =Pass then reliability := reliability + (reliability * RF) if n > 1 then n := n-1; else if nodeStatus = Fail then reliability := reliability – (reliability * RF * n) n := n+1; if reliability >= maxReliability then reliability := maxReliability if reliability < minReliability then nodeStatus :=dead call_proc: remove_this_node call_proc: add_new_node End

Contnd….. Decision Mechanism Algorithm Begin Initially reliability:=1, n :=1 Input from RA nodeReliability, numCandNodes Input from configuration SRL bestReliability := find_reliability of node with highest reliability if bestReliability >= SRL status := success else perform_backward_recovery call_proc: remove_node_minReliability call_proc: add_new_node End

Reliability Assessment Impact Analysis

Contd…………  Here we can see that convergence towards decrement in reliability is much higher.

Experiments and Results

Advantages:  It has a dynamic behaviour of reliability configuration.  The scheme is highly fault tolerant.  The reason behind adaptive reliability is that the scheme can take advantage of dynamic scalability of cloud infrastructure.   This system takes the full advantage of using diverse software.  Probability of failure is very less in our devised scheme.  The system assures the reliability by providing the backward recovery.

Conclusion  The proposed scheme is a good option to be used as a fault tolerance mechanism for real time computing on cloud infrastructure.  The reason behind adaptive reliability is that the scheme can take advantage of dynamic scalability of cloud infrastructure.  Probability of failure is very less in our devised scheme.  It does not suffer from domino effect as check pointing is made in the end when all the nodes have produced the result.

References:  [1] X. Kong, J. Huang, C. Lin, P. D. Ungsunan, “Performance, Fault-tolerance and Scalability Analysis of Virtual Infrastructure Management System”, 2009 IEEE International Symposium on Parallel and Distributed Processing with Applications, Chengdu, China, August 9-12, 2009  [2] J.Coenen, J. Hooman, “A Formal Approach to Fault Tolerance in Distributed Real-Time Systems”, Department o f Mathematics and Computing Science, Eindhoven Universityof Technology, Nether land  [3] S. Malik, M. J. Rehman, “Time Stamped Fault Tolerance in Distributed Real Time Systems”; IEEE International Multitopic Conference, Karachi, Pakistan, 2005  [4] K. H. Kim, “Structuring DRB Computing Stations in Highly Decentralized Systems,: Proceedings International Symposium on Autonomous Decentralized Systems, Kawasaki, 1993, pp  [7] “ProActive Service and Advanced Feature”, INRIA – University of Nice Sophia Antipolis 

Query????????????