November 19 2004 - NC state university Group Communication Specifications Gregory V Chockler, Idit Keidar, Roman Vitenberg Presented by – Jyothish S Varma.

Slides:



Advertisements
Similar presentations
1 Process groups and message ordering If processes belong to groups, certain algorithms can be used that depend on group properties membership create (
Advertisements

Prof R. Guerraoui Distributed Programming Laboratory
Reliable Communication in the Presence of Failures Kenneth Birman, Thomas Joseph Cornell University, 1987 Julia Campbell 19 November 2003.
DISTRIBUTED SYSTEMS II FAULT-TOLERANT BROADCAST Prof Philippas Tsigas Distributed Computing and Systems Research Group.
Lab 2 Group Communication Andreas Larsson
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Virtual Synchrony Jared Cantwell. Review Multicast Causal and total ordering Consistent Cuts Synchronized clocks Impossibility of consensus Distributed.
Virtual Synchrony Ki Suh Lee Some slides are borrowed from Ken, Jared (cs ) and Justin (cs )
Algorithm for Virtually Synchronous Group Communication Idit Keidar, Roger Khazan MIT Lab for Computer Science Theory of Distributed Systems Group.
Group Communications Group communication: one source process sending a message to a group of processes: Destination is a group rather than a single process.
Transis Efficient Message Ordering in Dynamic Networks PODC 1996 talk slides Idit Keidar and Danny Dolev The Hebrew University Transis Project.
EEC 688/788 Secure and Dependable Computing Lecture 12 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Multicast Protocols Jed Liu 28 February Introduction  Recall Atomic Broadcast:  All correct processors receive same set of messages.  All messages.
Group Communication Phuong Hoai Ha & Yi Zhang Introduction to Lab. assignments March 24 th, 2004.
1 Principles of Reliable Distributed Systems Lecture 5: Failure Models, Fault-Tolerant Broadcasts and State-Machine Replication Spring 2005 Dr. Idit Keidar.
Scalable Group Communication for the Internet Idit Keidar MIT Lab for Computer Science Theory of Distributed Systems Group The main part of this talk is.
© Chinese University, CSE Dept. Distributed Systems / Distributed Systems Topic 9: Time, Coordination and Replication Dr. Michael R. Lyu Computer.
EEC 693/793 Special Topics in Electrical Engineering Secure and Dependable Computing Lecture 13 Wenbing Zhao Department of Electrical and Computer Engineering.
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
EEC 688/788 Secure and Dependable Computing Lecture 13 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Group Communication Robbert van Renesse CS614 – Tuesday Feb 20, 2001.
1 Idit Keidar MIT Lab for Computer Science Theory of Distributed Systems Group Paradigms for Building Distributed Systems: Performance Measurements and.
1 A Framework for Highly Available Services Based on Group Communication Alan Fekete Idit Keidar University of Sidney MIT.
Transis 1 Fault Tolerant Video-On-Demand Services Tal Anker, Danny Dolev, Idit Keidar, The Transis Project.
Lab 1 Bulletin Board System Farnaz Moradi Based on slides by Andreas Larsson 2012.
1 A Modular Approach to Fault-Tolerant Broadcasts and Related Problems Author: Vassos Hadzilacos and Sam Toueg Distributed Systems: 526 U1580 Professor:
CH2 System models.
TOTEM: A FAULT-TOLERANT MULTICAST GROUP COMMUNICATION SYSTEM L. E. Moser, P. M. Melliar Smith, D. A. Agarwal, B. K. Budhia C. A. Lingley-Papadopoulos University.
SPREAD TOOLKIT High performance messaging middleware Presented by Sayantam Dey Vipin Mehta.
Chapter 14 Asynchronous Network Model by Mikhail Nesterenko “Distributed Algorithms” by Nancy A. Lynch.
Total Order Broadcast and Multicast Algorithms: Taxonomy and Survey (Paper by X. Défago, A. Schiper, and P. Urbán) ACM computing Surveys, Vol. 36,No 4,
Lab 2 Group Communication Farnaz Moradi Based on slides by Andreas Larsson 2012.
CSE 486/586, Spring 2013 CSE 486/586 Distributed Systems Replication with View Synchronous Group Communication Steve Ko Computer Sciences and Engineering.
Farnaz Moradi Based on slides by Andreas Larsson 2013.
Group Communication Group oriented activities are steadily increasing. There are many types of groups:  Open and Closed groups  Peer-to-peer and hierarchical.
Farnaz Moradi Based on slides by Andreas Larsson 2013.
Dealing with open groups The view of a process is its current knowledge of the membership. It is important that all processes have identical views. Inconsistent.
Dealing with open groups The view of a process is its current knowledge of the membership. It is important that all processes have identical views. Inconsistent.
CSE 486/586, Spring 2012 CSE 486/586 Distributed Systems Replication Steve Ko Computer Sciences and Engineering University at Buffalo.
Replication (1). Topics r Why Replication? r System Model r Consistency Models – How do we reason about the consistency of the “global state”? m Data-centric.
Copyright © George Coulouris, Jean Dollimore, Tim Kindberg This material is made available for private study and for direct.
Totally Ordered Broadcast in the face of Network Partitions [Keidar and Dolev,2000] INF5360 Student Presentation 4/3-08 Miran Damjanovic
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
CSE 60641: Operating Systems Implementing Fault-Tolerant Services Using the State Machine Approach: a tutorial Fred B. Schneider, ACM Computing Surveys.
Scalable Group Communication for the Internet Idit Keidar MIT Lab for Computer Science Theory of Distributed Systems Group.
The Totem Single-Ring Ordering and Membership Protocol Y. Amir, L. E. Moser, P. M Melliar-Smith, D. A. Agarwal, P. Ciarfella.
Building Dependable Distributed Systems, Copyright Wenbing Zhao
Replication and Group Communication. Management of Replicated Data FE Requests and replies C Replica C Service Clients Front ends managers RM FE RM Instructor’s.
Group Communication Theresa Nguyen ICS243f Spring 2001.
Relying on Safe Distance to Achieve Strong Partitionable Group Membership in Ad Hoc Networks Authors: Q. Huang, C. Julien, G. Roman Presented By: Jeff.
Ordering in online games Objectives – Understand the ordering requirements of gaming – Realise how ordering may be achieved – Be able to relate ordering.
Indirect Communication Indirect Communication is defined as communication between entities in DS through intermediary with no direct coupling b/w sender.
NTT - MIT Research Collaboration — Bi-Annual Report, July 1—December 31, 1999 MIT : Cooperative Computing in Dynamic Environments Nancy Lynch, Idit.
Group Communication A group is a collection of users sharing some common interest.Group-based activities are steadily increasing. There are many types.
Distributed Systems Lecture 7 Multicast 1. Previous lecture Global states – Cuts – Collecting state – Algorithms 2.
Chapter 8 Fault Tolerance. Outline Introductions –Concepts –Failure models –Redundancy Process resilience –Groups and failure masking –Distributed agreement.
Distributed Computing Systems Replication Dr. Sunny Jeong. Mr. Colin Zhang With Thanks to Prof. G. Coulouris,
Replication Chapter Katherine Dawicki. Motivations Performance enhancement Increased availability Fault Tolerance.
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Replication & Fault Tolerance CONARD JAMES B. FARAON
Algorithm for Virtually Synchronous Group Communication
ACM Transactions on Information and System Security, November 2001
Active replication for fault tolerance
EEC 688/788 Secure and Dependable Computing
Indirect Communication Paradigms (or Messaging Methods)
Indirect Communication Paradigms (or Messaging Methods)
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
Group Service in CORBA Xing Gang Supervisor: Prof. Michael R. Lyu
COT 5611 Operating Systems Design Principles Spring 2014
Presentation transcript:

November NC state university Group Communication Specifications Gregory V Chockler, Idit Keidar, Roman Vitenberg Presented by – Jyothish S Varma

November NC state university Outline Group communication – background  Safety properties Membership service Multicast service  Safe messages  Ordering and reliability properties  Liveness

November NC state university What is Group Communication? What is a group?  A group is a collection of processes that cooperate to provide a service Group Communication is a means for providing multi-point to multi-point communication, by organizing processes in groups.

November NC state university Distributed applications Highly available servers  Web  Video-on-Demand Collaborative computing  Shared white-board, shared editor, etc.  Military command and control  On-line strategy games Stock market

November NC state university Important Issues in Building Distributed Applications Consistency of view  Same picture of game, same shared file Fault tolerance, high availability Performance  Conflicts with consistency? Scalability  Topology - WAN, long unpredictable delays  Number of participants

November NC state university Objectives of this paper Formulate a comprehensive set of specifications Combine representations of most existing GCS Correlate terminology

November NC state university Group Communication Group abstraction - a group of processes is one logical entity Dynamic Groups (join, leave, crash) Systems: Ensemble, Horus, ISIS, Newtop, Psync, Sphynx, Relacs, RMP, Totem, Transis G Send(G)

November NC state university Model Asynchronous message passing network Allows partitions and merges View  Set of process that belongs to a group

November NC state university General Framework Signature of GCS service  crash(p) : process p crashes  recover(p) : process p recovers  send(p,m) : p sends message m  recv(p,m) : p receives message m  view_change(p,(id,members),T) : p receives message that new view is id, consisting of processes in members. T is the transitional set.

November NC state university General Framework Join Group address expansion Multicast communication Group send Fail Group membership management Leave Process group

November NC state university Membership service – Safety properties Assumptions  All live process in a single group  No process leave or join the system  Membership changes when process crash/recover

November NC state university External actions of GCS

November NC state university Membership service Ideal membership service  Practically impossible due to unreliable asynchronous network Membership service  Monitors status of all processes and links  Keeps group members up to date through view_chg events  Each view labeled by value from a totally ordered set.  A process installs a view when it receives view_chg with that view

November NC state university Membership service – Basic properties Self Inclusion  A process is a member of any view it installs Local Monotonicity  The views that a process installs increase in time Initial View Event  Every event at a process occurs in some view

November NC state university Partitionable Vs Primary Component Membership service Partitionable  A membership service that allows multiple active groups Primary component  A membership service that allows only one active group  The set of views installed by all the processes can be totally ordered  For consecutive views in the ordering, there must be a process in smaller view which installed the larger view.

November NC state university Partitionable GCS

November NC state university Multicast service – Safety properties Basic properties  Delivery integrity For every receive event, there is a preceding send event of the same message  No duplication No message is received twice

November NC state university Multicast service – Safety properties Sending view delivery  If a process receives a message in some view, then the message was sent in that view. Same view delivery  All processes which receive some message receive it in the same view

November NC state university Multicast service – Safety properties Sending view delivery  Minimizes the context information with each message.  Useful for applications for which message received in the same view is important  Limitation GCS sometimes blocks view changes when messages from the old view are delivered – to avoid this we use same view delivery which don’t care what view a message was sent in

November NC state university Multicast service – Safety properties Virtual Synchrony  If two processes in a view V install the same new view, then the processes receive the same messages in V This property avoids state transfers for a view change in some applications  Since these two processes received the same messages in the previous view, state transfer is not required between them in the new view

November NC state university Virtual Synchrony Group members all see events in same order  Events: messages, process crash/join Powerful abstraction for replication Framework for fault tolerance, high availability

November NC state university Ordering and reliability properties FIFO delivery  If a process sends two messages, then every process that receives both messages receives them in the order they were send. Reliable FIFO  If a process sends two messages, then any process that receives the latter messages receives the earlier messages first.

November NC state university Ordering and reliability properties Causal delivery  If a message m causally precedes m’, then every process that receives both messages receives m before m’ Reliable causal  If a message m causally precedes m’, and both are send in the same view, then any process that receives m’ receives m earlier

November NC state university Totally ordered multicast 3 types  Strong total order Ensures that messages are delivered in the same order at all the process  Weak total order For every pair of view V and V’, there is a total order f on the messages so that every process that installs V in V’ receives messages in V’ in an order consistent with f.  Reliable total order There is a total order f on the messages such that if m and m’ are two messages sent in the same view and m precedes m’ in f, then any process that receives m’ receives m earlier

November NC state university Liveness Stable component  Set of processes that are alive and connected to each other and link from any process in this set to any process outside is down Perfect failure detector  If it reports to any stable component S that their reachable set is S

November NC state university Liveness For every process p in stable component S, there exists a view V such that we have –  Membership precision P installs V as its last view  Multicast Liveness Every message p sends in V is received by every process in S

November NC state university Thank You Q ?