Advanced Operating System

Slides:



Advertisements
Similar presentations
1 Process groups and message ordering If processes belong to groups, certain algorithms can be used that depend on group properties membership create (
Advertisements

Fault Tolerance CSCI 4780/6780. Reliable Group Communication Reliable multicasting is important for several applications Transport layer protocols rarely.
Brewer’s Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services Authored by: Seth Gilbert and Nancy Lynch Presented by:
Reliable Group Communication Quanzeng You & Haoliang Wang.
Ranveer Chandra , Kenneth P. Birman Department of Computer Science
Reliability on Web Services Presented by Pat Chan 17/10/2005.
Computer Science Lecture 18, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.
Virtual Synchrony Jared Cantwell. Review Multicast Causal and total ordering Consistent Cuts Synchronized clocks Impossibility of consensus Distributed.
Group Communications Group communication: one source process sending a message to a group of processes: Destination is a group rather than a single process.
Computer Science Lecture 17, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.
Chapter 7 Fault Tolerance Basic Concepts Failure Models Process Design Issues Flat vs hierarchical group Group Membership Reliable Client.
Distributed Systems CS Fault Tolerance- Part III Lecture 15, Oct 26, 2011 Majd F. Sakr, Mohammad Hammoud andVinay Kolar 1.
1 Principles of Reliable Distributed Systems Lecture 5: Failure Models, Fault-Tolerant Broadcasts and State-Machine Replication Spring 2005 Dr. Idit Keidar.
Multicast Transport Protocols: A Survey and Taxonomy Author: Katia Obraczka University of Southern California Presenter: Venkatesh Prabhakar.
Logical Clocks (2). Topics r Logical clocks r Totally-Ordered Multicasting r Vector timestamps.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Dec 4, 2007 Reliable Multicast Group Neelofer T. CMSC 621.
Group Communication A group is a collection of users sharing some common interest.Group-based activities are steadily increasing. There are many types.
1 8.3 Reliable Client-Server Communication So far: Concentrated on process resilience (by means of process groups). What about reliable communication channels?
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
7/26/ Design and Implementation of a Simple Totally-Ordered Reliable Multicast Protocol in Java.
Group Communication Group oriented activities are steadily increasing. There are many types of groups:  Open and Closed groups  Peer-to-peer and hierarchical.
Dealing with open groups The view of a process is its current knowledge of the membership. It is important that all processes have identical views. Inconsistent.
Hwajung Lee. A group is a collection of users sharing some common interest.Group-based activities are steadily increasing. There are many types of groups:
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Chap 7: Consistency and Replication
Fault Tolerance. Basic Concepts Availability The system is ready to work immediately Reliability The system can run continuously Safety When the system.
The Totem Single-Ring Ordering and Membership Protocol Y. Amir, L. E. Moser, P. M Melliar-Smith, D. A. Agarwal, P. Ciarfella.
Reliable Communication Smita Hiremath CSC Reliable Client-Server Communication Point-to-Point communication Established by TCP Masks omission failure,
V1.7Fault Tolerance1. V1.7Fault Tolerance2 A characteristic of Distributed Systems is that they are tolerant of partial failures within the distributed.
Chapter 11 Fault Tolerance. Topics Introduction Process Resilience Reliable Group Communication Recovery.
Building Dependable Distributed Systems, Copyright Wenbing Zhao
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Fault Tolerance (2). Topics r Reliable Group Communication.
Mutual Exclusion Algorithms. Topics r Defining mutual exclusion r A centralized approach r A distributed approach r An approach assuming an organization.
Group Communication A group is a collection of users sharing some common interest.Group-based activities are steadily increasing. There are many types.
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Reliable multicast Tolerates process crashes. The additional requirements are: Only correct processes will receive multicasts from all correct processes.
More on Fault Tolerance
Exercises for Chapter 11: COORDINATION AND AGREEMENT
Fault Tolerance Prof. Orhan Gemikonakli
Fault Tolerance Chap 7.
Fast Retransmit For sliding windows flow control we waited for a timer to expire before beginning retransmission of a packet TCP uses an additional mechanism.
Data Link Layer Flow Control.
EEC 688/788 Secure and Dependable Computing
DC7: More Coordination Chapter 11 and 14.2
Reliable group communication
湖南大学-信息科学与工程学院-计算机与科学系
Transport Layer Unit 5.
Reliable Multicast Group
CS412 Introduction to Computer Networking & Telecommunication
7.1. CONSISTENCY AND REPLICATION INTRODUCTION
Distributed Systems CS
Distributed Systems CS
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
Distributed Systems CS
EEC 688/788 Secure and Dependable Computing
Active replication for fault tolerance
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
Seminar Mobilkommunikation Reliable Multicast in Wireless Networks
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
COT 5611 Operating Systems Design Principles Spring 2014
Error Checking continued
Last Class: Fault Tolerance
Sisi Duan Assistant Professor Information Systems
Presentation transcript:

Advanced Operating System COMMUNICATION Shrada Pradhan Advanced Operating System Csc CSC 8320

Content Flow Basic Reliable-Multicasting Schemes Scalability in Reliable Multicasting Nonhierarchical Feedback Control Hierarchical Feedback Control Atomic Multicast Virtual synchronous multicast

Introduction Services guarantee that messages are delivered to all members in a process group Reliable multicast services are important But reliable multicasting is not that easy.

Basic Reliable-Multicasting Schemes A reliable communication in the presence of faulty process If all the non-faulty group receive message. A reliable communication when processes are assumed to operate correctly Neither processes fail, nor process join or leave the group during transmission. All the group memebers should receive the messages.

Contd.. Weaker multicast Unreliable multicast Condition: The numbers of receivers are limited. Unreliable multicast Messages sent may be lost or not delivered to all the receivers.

Contd.. Solution for Unreliable Multicast Assigning a sequence member to each message Detect missing message Send negative acknowledgement, requesting for retransmission Sender retransmit the message.

Scalability in Reliable Multicasting Problem Cannot handle large number of receivers Senders may be swapped with feedback messages Solution Receiver sending acknowledgement only for missing messages. But overflows history buffer. Removing messages after certain time.

Scalable multicast Non hierarchial feedback control (NHFC) To reduce the number of feedback messages sent. Feedback supression model is used. Receiver sends negative acknowledgement as feedback. Receiver multicasts its feedback to the rest of the group. Schedules feedback messages with random time delay.

Scalable Multicast Hierarchial Feedback Control Large group of receivers Each group is partitioned into smaller subgroups in a tree Each subgroup has a local coordinator responsible for handling retransmission requests. Local coordinaters have own buffer history

Hierarchaial Feedback control

Contd.. Asks its parent subgroup if coordinator misses message. If message and acknowledgement received, clears its history buffer.

Limitation NHFC: HFC: Reasonable accurate scheduling is not easy. Other receivers are forced to receive useless messages. HFC: Construction of tree is not easy.

Automatic Multicast Message is either delivered to all or not essayed at all. Reliable multicast with process failures. Believes in group membership for correctly operating replicas.

Contd. If one process breaks then it leaves the group After repair can join the group And then update with the state of the rest of the members. This maintains non-faulty process maintains consistent view Forces reconciliation when replica recovers and rejoins the group

Atomatic Multicast

Virtual synchronous Reliable multicast property If sender of the message crashes, it is either delivered to remaining process or ignored.

Different Ordering Unordered Multicast FIFO Multicast No guarantee concerning order of delivery of messages FIFO Multicast Should deliver incoming messages from the same process in the same order For eg. M1 is always delivered before M2

Contd. Casually Ordered Multicast Totally Ordered Multicast Preserves potential casuality between messages Even though m1 preceeds m2, communication layer delivers m2 after m1. Totally Ordered Multicast Messages should be delivered in the same ordered to all group members.

Contd. Casually Ordered Multicast Totally Ordered Multicast Preserves potential casuality between messages Even though m1 preceeds m2, communication layer delivers m2 after m1. Totally Ordered Multicast Messages should be delivered in the same ordered to all group members.

Latest Relevant knowledge Optimistic Atomic Multicast[2] Is expensive in terms of message delays a protocol that combines reduced latency increased throughput Messages are delivered optimistically in a single communication step. Does not rely on spontaneous message ordering for fast delivery

Future Work Atomic multicasts if not occurred between database nodes Instead transaction ordering is exposed as a service that is provided by a dedicated set of nodes. Improves transaction throughput and latency [3] So studies can be done in such fields to enhace performance.

References 1. Andrew S. Tanenbaum , and Maarten Van Steen. Distributed Systems Principles and paradigms. Second Edition, 2007 2. Optimistic Atomic Multicast, Distributed Computing Systems (ICDCS), 2013 IEEE 33rd International Conference on 8-11 July 2013. 3. An Atomic-Multicast Service for Scalable In-Memory Transaction Systems, Cloud Computing Technology and Science (CloudCom), 2014 IEEE 6th International Conference on Date 15-18 Dec. 2014