Reliable Distributed Systems

Slides:



Advertisements
Similar presentations
15 May 2007 IVOA Beijing: Grid & Web Services 21 Matthew J. Graham (Caltech, NVO) T HE US N ATIONAL V IRTUAL O BSERVATORY vs.
Advertisements

MgmtSecurity QOS Transport Packaging Extensions Wire Inspection Discovery Publication Registry Description Business Level Agreements Service Level Agreements.
MgmtSecurity QOS Transport Packaging Extensions Wire Inspection Discovery Publication Registry Description Business Level Agreements Service Level Agreements.
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
GT 4 Security Goals & Plans Sam Meder
cetis Really Complex Web Service Specifications Scott Wilson.
The Open Grid Services Architecture, Version 1.0 I. Foster, H. Kishimoto, A. Savva, D. Berry, A. Djaoui, A. Grimshaw, B. Horn, F. Maciel, F. Siebenlist,
WS-* Specifications Process Step 2 Broader Community Participation Step 1 Initial Development Process reconciles conflicting goals Quality of engineering.
1 Transactions and Web Services. 2 Web Environment Web Service activities form a unit of work, but ACID properties are not always appropriate since Web.
High Performance Computing Course Notes Grid Computing.
Service Oriented Architecture SOA. SOA has been the New New Thing for the last few years in enterprise software As with everything that gains visibility.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Workflow utilization in composition of complex applications based.
UDDI Overview Web Services Registry SOA Enabler. What Is UDDI? Universal Description, Discovery, and Integration Protocols for web services registry Public.
Distributed components
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Latest techniques and Applications in Interprocess Communication and Coordination Xiaoou Zhang.
Notes to the presenter. I would like to thank Jim Waldo, Jon Bostrom, and Dennis Govoni. They helped me put this presentation together for the field.
CS514: Intermediate Course in Operating Systems Professor Ken Birman Ben Atkin: TA Lecture 9: Sept. 21.
Extending Web Applications with Web Services Mike Taulty Developer & Platform Group Microsoft Ltd
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Distributed Systems 2006 Group Membership * *With material adapted from Ken Birman.
Our Research Background and Possible Research Directions in the Context of the Adapt Project Davide Rossi Università di Bologna IST
Reliable Distributed Systems Membership. Agreement on Membership Recall our approach: Detecting failure is a lost cause. Too many things can mimic failure.
1 A Framework for Highly Available Services Based on Group Communication Alan Fekete Idit Keidar University of Sidney MIT.
Web Services and OGSA Tal Lavian, Phil Wang. What Are Web Services? ● Conventions for program-to-program Communication ● Built on existing Web infrastructure.
Chapter 5: Principles of Service- Oriented Computing Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns,
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid Computing and Web Services Marty Humphrey Assistant Professor Computer Science Department University of Virginia.
The Anatomy of the Grid Introduction The Nature of Grid Architecture Grid Architecture Description Grid Architecture in Practice Relationships with Other.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Web Service Future CS409 Application Services Even Semester 2007.
Fault Tolerant Services
Web Service Resource Framework WSMO Presentation Jos de Bruijn Digital Enterprise Research Institute http ://
18-1 Summary (Day 2) Learning Summary – What is JXTA ? – Understand the fundamental concepts of JXTA – Learn about the various implementations of.
© 2004 IBM Corporation ICSOC2004 Panel Discussion: Grid Systems: What is needed from web service standards? Jeffrey Frey IBM.
Replication and Group Communication. Management of Replicated Data FE Requests and replies C Replica C Service Clients Front ends managers RM FE RM Instructor’s.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Relying on Safe Distance to Achieve Strong Partitionable Group Membership in Ad Hoc Networks Authors: Q. Huang, C. Julien, G. Roman Presented By: Jeff.
SOA Standard & Application Zhu Lvwei TongTech Co,.Ltd Chief Architect 2008 年 4 月.
Chapter 8 Fault Tolerance. Outline Introductions –Concepts –Failure models –Redundancy Process resilience –Groups and failure masking –Distributed agreement.
XtreemOS IP project is funded by the European Commission under contract IST-FP Scientific coordinator Christine Morin, INRIA Presented by Ana.
Service Oriented Architecture
Java Web Services Orca Knowledge Center – Web Service key concepts.
rain technology (redundant array of independent nodes)
Replication & Fault Tolerance CONARD JAMES B. FARAON
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
Algorithm for Virtually Synchronous Group Communication
WEB SERVICES.
Unit – 5 JAVA Web Services
Bringing Grid & Web Services Together
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
Introduction to Web Services and SOA
Providing Secure Storage on the Internet
Service-centric Software Engineering 1
The Globus Toolkit™: Information Services
CSSSPEC6 SOFTWARE DEVELOPMENT WITH QUALITY ASSURANCE
Reliable Distributed Systems
Service-Oriented Computing: Semantics, Processes, Agents
Experiences in Deploying Services within the Axis Container
CS514: Intermediate Course in Operating Systems
Service Oriented Architecture (SOA)
B. Ramamurthy Based on Paper by Werner Vogels and Chris Re
The Anatomy and The Physiology of the Grid
Physics data processing with SOA
The Anatomy and The Physiology of the Grid
Introduction to Web Services and SOA
Current and Future Perspectives of Grid Technology Panel
Distributed Systems and Algorithms
Presentation transcript:

Reliable Distributed Systems Membership 11/29/2018

http/https 1.1 WS-Addressing SOAP1.2, MTOM XML1.1: XMLScheme, XPath, XSL WS-Policy, WS-MetaDataExchange WSDL1.2, UDDI3.0 WS-Reliability, WS-Membership WS-Security, WS-Trust, WS-SecureConversation WS-Transaction, WS-Coordination WS-BPEL, WS-Choreography WS-Resource, WS-ResourceProperties WS-Notification, WS-Eventing 11/29/2018

Group Membership Foundational concept for high speed data replication protocols. Essential for large scale grid-based virtual organizations and resource discovery and scheduling Solution: Group membership service (GMS) Manage GMS services’ membership and then manage other services’ general membership: 2-tier architecture GMP Group Membership Protocol is used among GMS to manage membership GMS then woks on its group. Another problem is static vs dynamic membership 11/29/2018

Agreement on Membership Detecting failure is a lost cause. Too many things can mimic failure To be accurate would end up waiting for a process to recover Substitute agreement on membership Now we can drop a process because it isn’t fast enough This can seem “arbitrary”, e.g. A kills B… GMS implements this service for everyone else 11/29/2018

Architecture Applications use replicated data for high availability 2PC-like protocols use membership changes instead of failure notification Membership Agreement, “join/leave” and “P seems to be unresponsive” 11/29/2018

Architecture Application processes membership views A {A} {A,B,D} {A,D} {A,D,C} {D,C} GMS processes join B leave GMS join C X Y Z D A seems to have failed 11/29/2018

GMS API Guess? 11/29/2018

GMS API Three operations: Join(process-id, callback) Leave(process-id) Monitor(process-id,callback) GMS needs to be highly available; 11/29/2018

Example Distributed system using the GMS is a air-traffic control system it would require itself to be reconfigured with existing processes after failure of a process. In some cases such as in grid VO it may be fact of life; membership may be changing dynamically. 11/29/2018

WS-Membership: Failure Management in webservices world WS-Membership .. by W.Vogels and C.Re 11/29/2018