6/27/2015Page 1 This presentation is based on WS-Membership: Failure Management in Web Services World B. Ramamurthy Based on Paper by Werner Vogels and.

Slides:



Advertisements
Similar presentations
웹 서비스 개요.
Advertisements

The Replica Location Service In wide area computing systems, it is often desirable to create copies (replicas) of data objects. Replication can be used.
REST Introduction 吴海生 博克软件(杭州)有限公司.
Web Service Composition Prepared by Robert Ma February 5, 2007.
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
1 Transactions and Web Services. 2 Web Environment Web Service activities form a unit of work, but ACID properties are not always appropriate since Web.
Replication. Topics r Why Replication? r System Model r Consistency Models r One approach to consistency management and dealing with failures.
Chapter 19: Network Management Business Data Communications, 5e.
Distributed System Structures Network Operating Systems –provide an environment where users can access remote resources through remote login or file transfer.
Lecture 7 Data distribution Epidemic protocols. EECE 411: Design of Distributed Software Applications Epidemic algorithms: Basic Idea Idea Update operations.
.NET Remoting. .Net Remoting Replaces DCOM (Distributed Component Object Model – a proprietary Microsoft technology for communication among software components.
Reliability on Web Services Presented by Pat Chan 17/10/2005.
A Computation Management Agent for Multi-Institutional Grids
Transparent Robustness in Service Aggregates Onyeka Ezenwoye School of Computing and Information Sciences Florida International University May 2006.
A Dependable Auction System: Architecture and an Implementation Framework
6/4/2015Page 1 Enterprise Service Bus (ESB) B. Ramamurthy.
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
Rheeve: A Plug-n-Play Peer- to-Peer Computing Platform Wang-kee Poon and Jiannong Cao Department of Computing, The Hong Kong Polytechnic University ICDCSW.
Distributed systems Module 2 -Distributed algorithms Teaching unit 1 – Basic techniques Ernesto Damiani University of Bozen Lesson 3 – Distributed Systems.
Business Process Orchestration
6/21/2015Page 1 This presentation is based on WS-Membership: Failure Management in Web Services World B. Ramamurthy Based on Paper by Werner Vogels and.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
ATSN 2009 Towards an Extensible Agent-based Middleware for Sensor Networks and RFID Systems Dirk Bade University of Hamburg, Germany.
Distributed Systems Fall 2011 Gossip and highly available services.
Internet Networking Spring 2002 Tutorial 13 Web Caching Protocols ICP, CARP.
Managing Agent Platforms with the Simple Network Management Protocol Brian Remick Thesis Defense June 26, 2015.
Transactional Web Services, WS-Transaction and WS-Coordination Based on “WS Transaction Specs,” by Laleci, Introducing WS-Transaction Part 1 & 2, by Little.
Client/Server Architectures
Communication (II) Chapter 4
Web Services Description Language CS409 Application Services Even Semester 2007.
Architecting Web Services Unit – II – PART - III.
SAMANVITHA RAMAYANAM 18 TH FEBRUARY 2010 CPE 691 LAYERED APPLICATION.
BFTCloud: A Byzantine Fault Tolerance Framework for Voluntary-Resource Cloud Computing Yilei Zhang, Zibin Zheng, and Michael R. Lyu
Distributed Data Mining System in Java Group Member D 王春笙 D 林俊甫 D 王慧芬.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
1 M. Tudruj, J. Borkowski, D. Kopanski Inter-Application Control Through Global States Monitoring On a Grid Polish-Japanese Institute of Information Technology,
SCALABLE EVOLUTION OF HIGHLY AVAILABLE SYSTEMS BY ABHISHEK ASOKAN 8/6/2004.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Replication (1). Topics r Why Replication? r System Model r Consistency Models – How do we reason about the consistency of the “global state”? m Data-centric.
1 University of California, Irvine Done By : Ala Khalifeh (Note : Not Presented)
Fault Tolerant Services
Qusay H. Mahmoud CIS* CIS* Service-Oriented Computing Qusay H. Mahmoud, Ph.D.
Group Communication Theresa Nguyen ICS243f Spring 2001.
An approach to Web services Management in OGSA environment By Shobhana Kirtane.
ZOOKEEPER. CONTENTS ZooKeeper Overview ZooKeeper Basics ZooKeeper Architecture Getting Started with ZooKeeper.
Seminar On Rain Technology
Business Process Execution Language (BPEL) Pınar Tekin.
SEMINAR TOPIC ON “RAIN TECHNOLOGY”
Supporting Mobile Collaboration with Service-Oriented Mobile Units
Self Healing and Dynamic Construction Framework:
Nurhak Karakaya & Murat Çavdar
EEC 688/788 Secure and Dependable Computing
CHAPTER 3 Architectures for Distributed Systems
EEC 688/788 Secure and Dependable Computing
Architectures of distributed systems Fundamental Models
Bina Ramamurthy Chapter 9
Architectures of distributed systems Fundamental Models
Bina Ramamurthy Chapter 9
Bina Ramamurthy Chapter 9
Presented by: Francisco Martin-Recuerda
EEC 688/788 Secure and Dependable Computing
Distributed Systems Bina Ramamurthy 4/7/2019 B.Ramamurthy.
B. Ramamurthy Based on Paper by Werner Vogels and Chris Re
Architectures of distributed systems
By Werner Vogels and Chris Re
By Werner Vogels and Chris Re
Architectures of distributed systems Fundamental Models
Presentation transcript:

6/27/2015Page 1 This presentation is based on WS-Membership: Failure Management in Web Services World B. Ramamurthy Based on Paper by Werner Vogels and Chris Re

6/27/2015Page 2 Introduction An important factor in the successful deployment of federated web-services-based business activities will be the ability to guarantee reliable distributed operation and execution. Failure management is essential for systems constructed out web services on the network. ws-membership –a coordination service –a generic web-service interface for tracking registered web-services and –for providing membership monitoring information. A prototype membership service –based on epidemic protocol techniques has been implemented Context: Obduro project which focuses on global scalable distributed systems based on web-service technologies.

Obduro Project Development of advanced distributed services in the context of WS Coordination framework Development of high performance server technology for web services routing Integration of reliability and other distributed services into coordination and choreography engines. Development of a framework for global event management 6/27/2015Page 3

6/27/2015Page 4 WS-Membership Membership services is about service availability Coordination protocol Tracks registered members Presents membership updates to monitors WS-Membership Failure detectionMembership dissemination

6/27/2015Page 5 Component services Epidemic communication State management Development of advanced distributed services in the context of the web- services Coordination framework. –These services will include a failure management service, a consensus service and a lightweight distributed state-sharing engine.

6/27/2015Page 6 Five Roles Modeled Coordination service –Receives activation and membership requests and routes them to membership service Membership Service – Provides failure detection of registered web-services and disseminates membership information

6/27/2015Page 7 File roles (contd.) Member Service – A software component that has registered itself for failure detection, either directly with a Membership Service Membership Proxy – A software component that is interposed between a member service and the Membership Service for reasons of efficiency or accuracy Membership Monitor – This service registers itself with the Membership Service to receive changes to the membership state

6/27/2015Page 8 Activation & Registration Activation: –createCoordinationContext returns coordinationContext Registration –requestMembership (serviceURI, coordContext, port for probe) Other methods: –memberProbe, memberAlive, memberLeaves

6/27/2015Page 9 See Fig.1 for activation & registration sequence Change App3  App2

6/27/2015Page 10 Epidemic membership Service (EMS) Each participant holds a list of known peers Eventual consistency Best for loosely coupled, asynchronous systems Operational details: Fig.2,3 : gossip received + local membership state  new membership state Gossip: If Membership fails all members are marked failed.

Features of EMS Strong mathematical underpinning allows us to compute probability of mistakes Communication techniques used to exchanges messages are highly robust Membership exchanges between members is asynchronous Participants are able to make decisions autonomously about failures of other participants. 6/27/2015Page 11

6/27/2015Page 12 Types of information thru’ gossip Members. This is the list of the Member Service URIs that are registered and are active. This information set includes a logical timestamp it was last updated. Joined. A list of Member Services that have recently registered, with each the logical timestamp of the moment of registration. Left. When a Member Service gracefully exits, it should send a MemberLeaves indication to the Membership Service it has registered with. This will remove the members from the Members list and place it in the Left set, annotated with the logical timestamp. Failed. After a member has been detected as failed it is removed from the Members set and placed in this set, annotated with the logical timestamp. Suspected. An option at Activation time is to specify a threshold that would mark a member as suspected, before it is marked failed.

Operational Details EMS developed in the context of XEROX Clearing house project Each participant maintains list of known peers Periodically they update a heartbeat counter and send msg to their peers. Push-pull model instead of just push Gossip model 6/27/2015Page 13

6/27/2015Page 14 Fault model? How would you use EMS to realize a fault model for your system?