By Werner Vogels and Chris Re

Slides:



Advertisements
Similar presentations
웹 서비스 개요.
Advertisements

Giggle: A Framework for Constructing Scalable Replica Location Services Ann Chervenak, Ewa Deelman, Ian Foster, Leanne Guy, Wolfgang Hoschekk, Adriana.
Implementing declarative overlays Boom Thau Loo Tyson Condie Joseph M. Hellerstein Petros Maniatis Timothy Roscoe Ion Stoica.
1 Transactions and Web Services. 2 Web Environment Web Service activities form a unit of work, but ACID properties are not always appropriate since Web.
Replication. Topics r Why Replication? r System Model r Consistency Models r One approach to consistency management and dealing with failures.
1 Semester 2 Module 4 Learning about Other Devices Yuda college of business James Chen
1 Algorithms and protocols for distributed systems We have defined process groups as having peer or hierarchical structure and have seen that a coordinator.
Reliability on Web Services Presented by Pat Chan 17/10/2005.
A Computation Management Agent for Multi-Institutional Grids
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
A Dependable Auction System: Architecture and an Implementation Framework
Sinfonia: A New Paradigm for Building Scalable Distributed Systems Marcos K. Aguilera, Arif Merchant, Mehul Shah, Alistair Veitch, Christonos Karamanolis.
6/21/2015Page 1 This presentation is based on WS-Membership: Failure Management in Web Services World B. Ramamurthy Based on Paper by Werner Vogels and.
Distributed Systems Fall 2011 Gossip and highly available services.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
6/27/2015Page 1 This presentation is based on WS-Membership: Failure Management in Web Services World B. Ramamurthy Based on Paper by Werner Vogels and.
Data Distribution Dynamic Data Distribution. Outline Introductory Comments Dynamic (Value based) Data Distribution: HLA Data Distribution Management –Routing.
BFTCloud: A Byzantine Fault Tolerance Framework for Voluntary-Resource Cloud Computing Yilei Zhang, Zibin Zheng, and Michael R. Lyu
Distributed Data Mining System in Java Group Member D 王春笙 D 林俊甫 D 王慧芬.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Gossiping Steve Ko Computer Sciences and Engineering University at Buffalo.
SCALABLE EVOLUTION OF HIGHLY AVAILABLE SYSTEMS BY ABHISHEK ASOKAN 8/6/2004.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Replication (1). Topics r Why Replication? r System Model r Consistency Models – How do we reason about the consistency of the “global state”? m Data-centric.
Fault Tolerant Services
Replication and Group Communication. Management of Replicated Data FE Requests and replies C Replica C Service Clients Front ends managers RM FE RM Instructor’s.
An approach to Web services Management in OGSA environment By Shobhana Kirtane.
Fault Tolerance (2). Topics r Reliable Group Communication.
CSE 486/586 CSE 486/586 Distributed Systems Gossiping Steve Ko Computer Sciences and Engineering University at Buffalo.
WCDP: A protocol for web cache consistency Renu Tewari IBM Almaden Research Thirumale Niranjan IBM Software Group
Virtual Local Area Networks In Security By Mark Reed.
SEMINAR TOPIC ON “RAIN TECHNOLOGY”
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
CSE 486/586 Distributed Systems Gossiping
Coordination and Agreement
Introduction to Windows Azure AppFabric
Introduction to Distributed Platforms
.NET Remoting Priyanka Bharatula.
Architecting Web Services
Self Healing and Dynamic Construction Framework:
Nurhak Karakaya & Murat Çavdar
Dynamo: Amazon’s Highly Available Key-value Store
Controlling the Cost of Reliability in Peer-to-Peer Overlays
Cluster Communications
Overview of SDN Controller Design
CHAPTER 3 Architectures for Distributed Systems
Internet Networking recitation #12
Service-centric Software Engineering
Enterprise Service Bus (ESB) (Chapter 9)
Architectures of distributed systems Fundamental Models
Active replication for fault tolerance
Bina Ramamurthy Chapter 9
Architectures of distributed systems Fundamental Models
Exam#2 Review CSE 486 B.Ramamurthy 1/13/2019 B.Ramamurthy.
EEC 688/788 Secure and Dependable Computing
Bina Ramamurthy Chapter 9
SAMANVITHA RAMAYANAM 18TH FEBRUARY 2010 CPE 691
Bina Ramamurthy Chapter 9
Presented by: Francisco Martin-Recuerda
JINI ICS 243F- Distributed Systems Middleware, Spring 2001
Indirect Communication Paradigms (or Messaging Methods)
Distributed Systems Bina Ramamurthy 4/7/2019 B.Ramamurthy.
B. Ramamurthy Based on Paper by Werner Vogels and Chris Re
Indirect Communication Paradigms (or Messaging Methods)
Architectures of distributed systems
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
By Werner Vogels and Chris Re
Architectures of distributed systems Fundamental Models
IS 698/800-01: Advanced Distributed Systems Membership Management
Presentation transcript:

By Werner Vogels and Chris Re This presentation is based on WS-Membership: Failure Management in Web Services World By Werner Vogels and Chris Re 4/15/2019

Introduction An important factor in the successful deployment of federated web-services-based business activities will be the ability to guarantee reliable distributed operation and execution. Failure management is essential for systems constructed out web services on the network. ws-membership a coordination service a generic web-service interface for tracking registered web-services and for providing membership monitoring information. A prototype membership service based on epidemic protocol techniques has been implemented Context: Obduro project which focuses on global scalable distributed systems based on web-service technologies. 4/15/2019

Membership dissemination WS-Membership Coordination protocol Tracks registered members Presents membership updates to monitors WS-Membership Failure detection Membership dissemination 4/15/2019

Component services Epidemic communication State management Development of advanced distributed services in the context of the web-services Coordination framework.  These services will include a failure management service, a consensus service and a lightweight distributed state-sharing engine. 4/15/2019

Five Roles Modeled Coordination service Membership Service Receives activation and membership requests and routes them to membership service Membership Service Provides failure detection of registered web-services and disseminates membership information 4/15/2019

File roles (contd.) Member Service Membership Proxy Membership Monitor A software component that has registered itself for failure detection, either directly with a Membership Service Membership Proxy A software component that is interposed between a member service and the Membership Service for reasons of efficiency or accuracy Membership Monitor This service registers itself with the Membership Service to receive changes to the membership state 4/15/2019

Activation & Registration createCoordinationContext returns coordinationContext Registration requestMembership (serviceURI, coordContext, port for probe) Other methods: memberProbe, memberAlive, memberLeaves 4/15/2019

See Fig.1 for activation & registration sequence Change App3  App2 4/15/2019

Epidemic membership Service (EMS) Each participant holds a list of known peers Eventual consistency Best for loosely coupled, asynchronous systems Operational details: Fig.2,3 : gossip received + local membership state  new membership state Gossip: <app id, heartbeat counter, timestamp> If Membership fails all members are marked failed. 4/15/2019

Types of information thru’ gossip Members. This is the list of the Member Service URIs that are registered and are active. This information set includes a logical timestamp it was last updated. Joined. A list of Member Services that have recently registered, with each the logical timestamp of the moment of registration. Left. When a Member Service gracefully exits, it should send a MemberLeaves indication to the Membership Service it has registered with. This will remove the members from the Members list and place it in the Left set, annotated with the logical timestamp. Failed. After a member has been detected as failed it is removed from the Members set and placed in this set, annotated with the logical timestamp. Suspected. An option at Activation time is to specify a threshold that would mark a member as suspected, before it is marked failed. 4/15/2019

Fault model? How would you use EMS to realize a fault model for your system? 4/15/2019