Ken Birman Cornell University. CS5410 Fall 2008..

Slides:



Advertisements
Similar presentations
Ranveer Chandra Ramasubramanian Venugopalan Ken Birman
Advertisements

Dr. Multicast for Data Center Communication Scalability Ymir Vigfusson Hussam Abu-Libdeh Mahesh Balakrishnan Ken Birman Cornell University Yoav Tock IBM.
Reliable Multicast for Time-Critical Systems Mahesh Balakrishnan Ken Birman Cornell University.
COS 461 Fall 1997 Replication u previous lectures: replication for performance u today: replication for availability and fault tolerance –availability:
Distributed Processing, Client/Server and Clusters
Reliable Distributed Systems Quorums. Quorum replication We developed a whole architecture based on our four-step recipe But there is a second major approach.
6.852: Distributed Algorithms Spring, 2008 Class 7.
Reliable Group Communication Quanzeng You & Haoliang Wang.
CS514: Intermediate Course in Operating Systems Professor Ken Birman Vivek Vishnumurthy: TA.
Computer Science Lecture 18, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.
Virtual Synchrony Jared Cantwell. Review Multicast Causal and total ordering Consistent Cuts Synchronized clocks Impossibility of consensus Distributed.
Virtual Synchrony Ki Suh Lee Some slides are borrowed from Ken, Jared (cs ) and Justin (cs )
Virtual Synchrony Scott Phung Nov 15, 2011 Some slides borrowed from Jared (‘09)
1 Sources of Instability in Data Center Multicast Dmitry Basin Ken Birman Idit Keidar Ymir Vigfusson LADIS 2010.
Ken Birman, Cornell University August 18/19, 20101Berkeley Workshop on Scalability.
Ken Birman Cornell University. CS5410 Fall
Computer Science Lecture 17, page 1 CS677: Distributed OS Last Class: Fault Tolerance Basic concepts and failure models Failure masking using redundancy.
CS514: Intermediate Course in Operating Systems Professor Ken Birman Vivek Vishnumurthy: TA.
CS514: Intermediate Course in Operating Systems Professor Ken Birman Vivek Vishnumurthy: TA.
Ken Birman Professor, Dept. of Computer Science.  Today’s cloud computing platforms are best for building “apps” like YouTube, web search  Highly elastic,
CS514: Intermediate Course in Operating Systems Professor Ken Birman Ben Atkin: TA Lecture 20 Nov. 2.
1 Principles of Reliable Distributed Systems Lecture 5: Failure Models, Fault-Tolerant Broadcasts and State-Machine Replication Spring 2005 Dr. Idit Keidar.
Scalable Low Overhead Delay Estimation Yossi Cohen Advance IP seminar.
Distributed Systems 2006 Group Membership * *With material adapted from Ken Birman.
CS514: Intermediate Course in Operating Systems Professor Ken Birman Ben Atkin: TA Lecture 21 Nov. 7.
Learning from the Past for Resolving Dilemmas of Asynchrony Paul Ezhilchelvan and Santosh Shrivastava Newcastle University England, UK.
CS 425 / ECE 428 Distributed Systems Fall 2014 Indranil Gupta (Indy) Lecture 18: Replication Control All slides © IG.
What Can Databases Do for Peer-to-Peer Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu Presented by: Ryan Huebsch CS294-4 P2P Systems.
Advanced UDP Sockets© Dr. Ayman Abdel-Hamid, CS4254 Spring CS4254 Computer Network Architecture and Programming Dr. Ayman A. Abdel-Hamid Computer.
Computer Science Lecture 16, page 1 CS677: Distributed OS Last Class:Consistency Semantics Consistency models –Data-centric consistency models –Client-centric.
Dr. Multicast for Data Center Communication Scalability Ymir Vigfusson Hussam Abu-Libdeh Mahesh Balakrishnan Ken Birman Cornell University Yoav Tock IBM.
CS514: Intermediate Course in Operating Systems Professor Ken Birman Ben Atkin: TA Lecture 24: Nov. 16.
Databases with Scalable capabilities Presented by Mike Trischetta.
Communication (II) Chapter 4
A Randomized Error Recovery Algorithm for Reliable Multicast Zhen Xiao Ken Birman AT&T Labs – Research Cornell University.
SPREAD TOOLKIT High performance messaging middleware Presented by Sayantam Dey Vipin Mehta.
Consistent and Efficient Database Replication based on Group Communication Bettina Kemme School of Computer Science McGill University, Montreal.
Ensemble and Beyond Presentation to David Tennenhouse, DARPA ITO Ken Birman Dept. of Computer Science Cornell University.
The InetAddress Class A class for storing and managing internet addresses (both as IP numbers and as names). The are no constructors but “class factory”
CS5412: SHEDDING LIGHT ON THE CLOUDY FUTURE Ken Birman 1 Lecture XXV.
CS5412: HOW DURABLE SHOULD IT BE? Ken Birman 1 CS5412 Spring 2012 (Cloud Computing: Birman) Lecture XV.
2007/1/15http:// Lightweight Probabilistic Broadcast M2 Tatsuya Shirai M1 Dai Saito.
CS603 Fault Tolerance - Communication April 17, 2002.
CSE 60641: Operating Systems Implementing Fault-Tolerant Services Using the State Machine Approach: a tutorial Fred B. Schneider, ACM Computing Surveys.
The Totem Single-Ring Ordering and Membership Protocol Y. Amir, L. E. Moser, P. M Melliar-Smith, D. A. Agarwal, P. Ciarfella.
CS514: Intermediate Course in Operating Systems Professor Ken Birman Vivek Vishnumurthy: TA.
Slingshot: Time-Critical Multicast for Clustered Applications Mahesh Balakrishnan Stefan Pleisch Ken Birman Cornell University.
Spring 2003CS 4611 Replication Outline Failure Models Mirroring Quorums.
Chapter 7: Consistency & Replication IV - REPLICATION MANAGEMENT By Jyothsna Natarajan Instructor: Prof. Yanqing Zhang Course: Advanced Operating Systems.
Tempest: An Architecture for Scalable Time-Critical Services Mahesh Balakrishnan Amar Phanishayee Tudor Marian Professor Ken Birman.
Fault Tolerance (2). Topics r Reliable Group Communication.
Reliable Adaptive Lightweight Multicast Protocol Ken Tang, Scalable Network Technologies Katia Obraczka, UC Santa Cruz Sung-Ju Lee, Hewlett-Packard Laboratories.
CS514: Intermediate Course in Operating Systems Professor Ken Birman Vivek Vishnumurthy: TA.
Consensus, impossibility results and Paxos Ken Birman.
EEC 688/788 Secure and Dependable Computing Lecture 10 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
The consensus problem in distributed systems
Unity Networking.
Distributed Systems – Paxos
A Selective Retransmission Protocol for Multimedia on the Internet
Reliable Multicast Group
EECS 498 Introduction to Distributed Systems Fall 2017
PERSPECTIVES ON THE CAP THEOREM
EEC 688/788 Secure and Dependable Computing
Lecture 21: Replication Control
EEC 688/788 Secure and Dependable Computing
EEC 688/788 Secure and Dependable Computing
Computer Networks Protocols
Lecture 21: Replication Control
Lecture 6, Computer Networks (198:552)
Impact of transmission errors on TCP performance
Presentation transcript:

Ken Birman Cornell University. CS5410 Fall 2008.

Snapshot: Replication “models” By now we’re starting to see that “replication” comes in many flavors No model: UDP multicast (IPMC), Scalable Reliable Multicast, TCP. Often called “best effort” but not always clear what this really means. In practice, loss occurs on sockets, not network. SRM uses timesouts, NAKs, retransmission to recover from loss, but with timeout at the core, model is like TCP –weak semantics, State machine model (GMS views, Paxos). Needs strong determinism. No partitioning (split brain). Group membership confers strong semantics. Can’t guarantee termination (FLP) Even stronger: Byzantine (State Machines + malicious nodes), Transactional (for databases with ACID properties) Probabilistic: Ricochet, Gossip: Converge towards guarantees

Replication protocols TypeCapsule SummaryProsCons UDP multicastFast, pretty reliable unless overloaded. But not always supported (“fear of multicast”, WAN issues) Raw speed: send 1, get n-1 deliveries for free Router load, “n:1” effect (instability), no flow control SRM (Scalable Reliable Multicast) A reliable protocol that runs over UDP multicast, well known and fairly popular. eBay uses it internally. Uses UDP multicast for NAK, retransmissions Great when all goes well, but prone to sudden destabilization GMS view updtUsually 2-phase, hence “pretty fast”. Can’t partition (no split brain) State machine model applies Slower than UDP multicast, scales poorly VsyncHosted within GMS, like a reliable UDP multicast + view synchrony Like state machine but more flexibility User needs to take cs5410 first! And can it scale? PaxosLike GMS view update, several versions. One has a very elegant proof of safety State machine modelSlower than UDP multicast, scales poorly ByzantineThese assume that at most t of N members of the service are malicious. Trusts clients. State machine modelHardens service but not its clients RicochetSeeks rapid, probabilistically reliable deliveryVery stable, scalableNot as strong as vsync or state machine model TransactionsACID database guarantees (1-copy serializability)Famous modelVery poor scalability GossipConvergent probabilistic guarantees, constant overhead costs Very robust at constant (low) cost, scales well Too slow for some uses

Today: Ricochet Remainder of today’s lecture will look at Ricochet Time-critical multicast protocol May become a standard in Red Hat Linux and other data center / enterprise settings Great stability and scalability, quasi-realtime guarantees Paper in NSDI 2007 has details