Fault Detection Manager FDA Fault Recovery Manager Host VM OCCO InfoBroker CloudHandler ServiceComposer Application Create VM Deploy application Deploy.

Slides:



Advertisements
Similar presentations
Support for Fault Tolerance (Dynamic Process Control) Rich Graham Oak Ridge National Laboratory.
Advertisements

High Availability Deep Dive What’s New in vSphere 5 David Lane, Virtualization Engineer High Point Solutions.
Use Cases for Fault Tolerance Support in MPI Rich Graham Oak Ridge National Laboratory.
L-Root: Expanding Distribution in Africa. 2 One of 13 root name servers containing Internet Protocol addresses Operated by ICANN using anycast technology.
MapReduce Online Created by: Rajesh Gadipuuri Modified by: Ying Lu.
Arrow color indicates specific subset of Security Service Desk Common Backplane API. is DC Backplane API impledmented by the Backplane Services. Devices.
Time Series Data Repository (TSDR)
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
A Case for a Fault-Tolerant Virtual Machine Andrey Ermolinskiy Mohit Chawla.
Fault Tolerance -Example TSW November 2009 Anders P. Ravn Aalborg University.
Components for high performance grid programming in the GRID.it project 1 Workshop on Component Models and Systems for Grid Applications - St.Malo 26 june.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
Asynchronous Ad-hoc Leader Election in Complete Networks Nolan Irving.
8. Fault Tolerance in Software
Legion Worldwide virtual computer. About Legion Made in University of Virginia Object-based metasystems software project middleware that connects computer.
Towards an Evaluation Framework for Availability Solutions in the Cloud Proceedings of ISSRE 2014 Proceedings of ISSRE 2014 Majid Hormati, Ferhat Khendek.
Evaluate container lifecycle support in TOSCA TOSCA – 174 Adhoc TC.
SOA Implementation & Federation SOA General Concepts SOA Implementation, System landscape and Processes – wM 8.2 Federation of Heterogeneous SOA environments.
User-Perceived Performance Measurement on the Internet Bill Tice Thomas Hildebrandt CS 6255 November 6, 2003.
SensIT PI Meeting, January 15-17, Self-Organizing Sensor Networks: Efficient Distributed Mechanisms Alvin S. Lim Computer Science and Software Engineering.
An approach to Intelligent Information Fusion in Sensor Saturated Urban Environments Charalampos Doulaverakis Centre for Research and Technology Hellas.
Architecture Update. Guest Host HOST COMPONENTS VERNIER Community Level: Connected Clusters User Node KB Super Node COMMUNITY MONITOR SERVLET WEB SERVER.
NOX an OpenFlow controller. Role of Controller in OpenFlow Environments Push forwarding logic to switches Give developers a high-level API to develop.
High-Availability Linux.  Reliability  Availability  Serviceability.
7/2/2003Supervision & Monitoring section1 Supervision & Monitoring Organization and work plan Olof Bärring.
Cluster Reliability Project ISIS Vanderbilt University.
Kuali Enterprise Workflow Presented at ITANA October 2009 Eric Westfall – Kuali Rice Project Manager.
WP4 Security and AA(A) issues For WP4: David Groep
Livespace Architecture. Overview Livespace requirements Discussion of issues Livespace Architecture.
Distributed systems A collection of autonomous computers linked by a network, with software designed to produce an integrated computing facility –A well.
© 2006 Process-one – All right reserved Page 1 Jérôme Sautret Horde Leader, a Framework to Build Cluster Aware Erlang Web Administration Console November.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Chapter 14 Part II: Architectural Adaptation BY: AARON MCKAY.
Olof Bärring – WP4 summary- 4/9/ n° 1 Partner Logo WP4 report Plans for testbed 2
COPYRIGHT © 2012 ALCATEL-LUCENT. ALL RIGHTS RESERVED. Application Monitoring in TOSCA Presenter: Ifat Afek, Alcatel-Lucent Jan 2015.
A Framework for the Reconfiguration of Ubicomp Systems Pau Giner, Carlos Cetina, Joan Fons, Vicente Pelechano.
TOSCA Monitoring Reference Architecture Straw-man Roger Dev CA Technologies March 18, 2015 PRELIMINARY.
ALICE, ATLAS, CMS & LHCb joint workshop on
SAN DIEGO SUPERCOMPUTER CENTER Inca TeraGrid Status Kate Ericson November 2, 2006.
Information Services Andrew Brown Jon Ludwig Elvis Montero grid:seminar1:lectures:seminar-grid-1-information-services.ppt.
Cloud Interoperability & Standards. Scalability and Fault Tolerance Fault tolerance is the property that enables a system to continue operating properly.
FIspace Review Meeting 1 T280 Francisco Pérez Atos.
Next Generation Security Solutions Fault Tolerant powered by Hydra January 2013.
1 Putchong Uthayopas, Thara Angsakul, Jullawadee Maneesilp Parallel Research Group, Computer and Network System Research Laboratory Department of Computer.
EU 2nd Year Review – Feb – WP4 demo – n° 1 WP4 demonstration Fabric Monitoring and Fault Tolerance Sylvain Chapeland Lord Hess.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
Ceilometer + Gnocchi + Aodh Architecture
Shane O’Neill Synergy Geekovation XenDesktop VM’s need to be registered in order for the user to be able to access them. Registration with the XenDesktop.
Modern Programming Language. Web Container & Web Applications Web applications are server side applications The most essential requirement.
788.11J Presentation Volcano Monitoring Deploying a Wireless Sensor Network on an Active Volcano Phani Arava.
The Earth System Grid (ESG) A Fault Monitoring System for ESG Components DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
Collaboration diagrams. Deployment diagrams. Lesson 4.
Current GEMINI use of instrumentize script to initialize & configure services Hussam Nasir University of Kentucky.
MSF and MAGE: e-Science Middleware for BT Applications Sep 21, 2006 Jaeyoung Choi Soongsil University, Seoul Korea
APRIL 10, Meeting Agenda  Prototype 2 Goals  Robust Connections Demo  System Diagnostics Tool Demo  Final Prototype Risk Mitigation  Final.
DataTAG is a project funded by the European Union CERN, 8 May 2003 – n o 1 / 10 Grid Monitoring A conceptual introduction to GridICE Sergio Andreozzi
DIVYA K 1RN09IS016 RNSIT1. Cloud computing provides a framework for supporting end users easily through internet. One of the security issues is how to.
Automated Testing for Dynamics CRM Integration Testing Custom Workflow Activities Wael Hamze Ramón Tébar.
Automated Testing for Dynamics CRM Integration Testing Plug-ins Wael Hamze Ramón Tébar.
. . . ? ? ? ? ? ? ETL Engine Server Gateway Server Database Server
Doctor + OPenStack Congress
Curator: Self-Managing Storage for Enterprise Clusters
مبررات إدخال الحاسوب في رياض الأطفال
Fault-Tolerant CORBA By, Srinivas Seshu.
مديريت موثر جلسات Running a Meeting that Works
21twelveinteractive.com/ twitter.com/21twelveI/ facebook.com/21twelveinteractive/ linkedin.com/company/21twelve-interactive/ pinterest.com/21twelveinteractive/
Fault Tolerance Distributed
Similarities Differences
Fault-Tolerant CORBA By, Srinivas Seshu.
Presentation transcript:

Fault Detection Manager FDA Fault Recovery Manager Host VM OCCO InfoBroker CloudHandler ServiceComposer Application Create VM Deploy application Deploy FDA FT API Query status Query composite node status New in OCCO Trigger enactor run

FDA Fault Recovery Manager Host VM OCCO InfoBroker CloudHandler ServiceComposer Application Create VM Deploy application Deploy FDA FT API Query status Query composite node status New in OCCO Trigger enactor run Fault Detection Manager

OCCO-FT Framework Major components Fault Detection Manager (global component) Single global component which liaises with localized components Interaction with OCCO components to communicate fault occurrence/causes Fault Detection Agent(s) (localized components) Individual components for each node(VM) Are they similar to ad-hoc fault detectors ( may become a plugins for each type of fault)? If there is a plugin for each different type of fault then the local FDA is a collection of these plugins or detectors Fault Recovery Manager (global component) To perform actions to remedy/lessen the impact of a fault Interaction with OCCO components to achieve desired remedial actions