Evaluating the Running Time of a Communication Round over the Internet

Slides:



Advertisements
Similar presentations
Gossip and its application Presented by Anna Kaplun.
Advertisements

1 End to End Bandwidth Estimation in TCP to improve Wireless Link Utilization S. Mascolo, A.Grieco, G.Pau, M.Gerla, C.Casetti Presented by Abhijit Pandey.
Byzantine Generals Problem: Solution using signed messages.
1 Sources of Instability in Data Center Multicast Dmitry Basin Ken Birman Idit Keidar Ymir Vigfusson LADIS 2010.
Importance Sampling. What is Importance Sampling ? A simulation technique Used when we are interested in rare events Examples: Bit Error Rate on a channel,
Distributed systems Module 2 -Distributed algorithms Teaching unit 1 – Basic techniques Ernesto Damiani University of Bozen Lesson 3 – Distributed Systems.
 Idit Keidar, Principles of Reliable Distributed Systems, Technion EE, Spring Principles of Reliable Distributed Systems Lecture 7: Failure Detectors.
Efficient and Robust Computation of Resource Clusters in the Internet Efficient and Robust Computation of Resource Clusters in the Internet Chuang Liu,
Idit Keidar, Principles of Reliable Distributed Systems, Technion EE, Spring Principles of Reliable Distributed Systems Lecture 5: Synchronous Uniform.
Abstractions for Fault-Tolerant Distributed Computing Idit Keidar MIT LCS.
1 Idit Keidar MIT Lab for Computer Science Theory of Distributed Systems Group Paradigms for Building Distributed Systems: Performance Measurements and.
1 Failure Detectors: A Perspective Sam Toueg LIX, Ecole Polytechnique Cornell University.
Network Measurement Bandwidth Analysis. Why measure bandwidth? Network congestion has increased tremendously. Network congestion has increased tremendously.
Lecture 2-1 CS 425/ECE 428 Distributed Systems Lecture 2 Time & Synchronization Reading: Klara Nahrstedt.
1 Physical Clocks need for time in distributed systems physical clocks and their problems synchronizing physical clocks u coordinated universal time (UTC)
Timing-sync Protocol for Sensor Networks (TPSN) Presenter: Ke Gao Instructor: Yingshu Li.
A measurement study of vehicular internet access using in situ Wi-Fi networks Vladimir Bychkovsky, Bret Hull, Allen Miu, Hari Balakrishnan, and Samuel.
Evaluating the Running Time of a Communication Round over the Internet Omar Bakr Idit Keidar MIT MIT/Technion PODC 2002.
Parallel and Distributed Simulation Synchronizing Wallclock Time.
Distributed Systems Principles and Paradigms Chapter 05 Synchronization.
1 Evaluating NGI performance Matt Mathis
Scalable Group Communication for the Internet Idit Keidar MIT Lab for Computer Science Theory of Distributed Systems Group.
Network Coordinates : Internet Distance Estimation Jieming ZHU
Physical clock synchronization Question 1. Why is physical clock synchronization important? Question 2. With the price of atomic clocks or GPS coming down,
Network Computing Laboratory 1 Vivaldi: A Decentralized Network Coordinate System Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT Published.
1 Transport Control Protocol for Wireless Connections ElAarag and Bassiouni Vehicle Technology Conference 1999.
Distributed Algorithms Dr. Samir Tartir Extracted from Principles of Concurrent and Distributed Programming, Second Edition By M. Ben-Ari.
On the Performance of Consensus Algorithms: Theory and Practice Idit Keidar Technion & MIT.
Lecture 13 – Network Mapping
Satellite TCP Lecture 19 04/10/02.
Lec # 22 Data Communication Muhammad Waseem Iqbal.
Vivaldi: A Decentralized Network Coordinate System
Distributed Computing
Time and Clock Primary standard = rotation of earth
Spanner: Google’s Globally-Distributed Database
Multipath QUIC: Design and Evaluation
University of Maryland
Distributed Transactions and Spanner
LECTURE 34: WEB PROGRAMMING FOR SCALE
Transport Layer Unit 5.
Distributed Garbage Collection
Time and Clock.
Logical time (Lamport)
LECTURE 32: WEB PROGRAMMING FOR SCALE
Introduction There are many situations in which we might use replicated data Let’s look at another, different one And design a system to work well in that.
Alternating Bit Protocol
Distributed Consensus
LECTURE 33: WEB PROGRAMMING FOR SCALE
Distributed Systems, Consensus and Replicated State Machines
Time and Clock.
EE 368C Project Multi-stream Audio Transmission with Path Diversity
Distributed Algorithms
Performance Evaluation of Computer Networks
Abstraction.
Chapter 5 (through section 5.4)
Exercises for Chapter 11: TIME AND GLOBAL STATES
Lecture 16, Computer Networks (198:552)
Physical clock synchronization
CDK: Sections 11.1 – 11.4 TVS: Sections 6.1 – 6.2
Performance Evaluation of Computer Networks
Logical time (Lamport)
Logical time (Lamport)
LECTURE 33: WEB PROGRAMMING FOR SCALE
TCP: Transmission Control Protocol Part II : Protocol Mechanisms
CSE 486/586 Distributed Systems Time and Synchronization
Logical time (Lamport)
Performance Evaluation of a Communication Round over the Internet
Last Class: Naming Name distribution: use hierarchies DNS
Brahim Ayari, Abdelmajid Khelil and Neeraj Suri
Presentation transcript:

Evaluating the Running Time of a Communication Round over the Internet Omar Bakr Idit Keidar MIT MIT/Technion PODC 2002

Communication Round Exchange of information from all hosts to all hosts Part of many distributed algorithms, systems consensus, atomic commit, replication, ...

Common Metric for Evaluating Algorithms Number of rounds (or steps) they require

Questions What is the best way to implement a communication round over the Internet decentralized vs. centralized How long is a communication round over the Internet?

Prediction is Hard Internet is unpredictable, diverse, … Different answers for different topologies, different times Different performance metrics local running time one host is engaged in algorithm overall running time from when first host starts to when last host finishes

“Communication Round” Primitive Initiated by some host Propagates data from every host to every other host connected to it

Example Implementations All-to-all Leader Secondary Leader

Experiment I 10 hosts: Taiwan, Korea, US academia, ISPs TCP/IP (connections always up) Algorithms: All-to-all Leader (initiator) Secondary leader (not initiator) periodically initiated at each host - 650 times over 3.5 days

Computing Overall Running Time Elapsed time from initiation (at initiator) until all hosts terminate Requires estimating clock differences Clocks not synchronized, drift We compute difference over short intervals Compute 3 different ways Achieve accuracy within 20 ms. on 90% of runs

Teaser: Comparing Performance Based on Number of Steps All-to-all: 2 Leader: 3 Secondary Leader: 4

Predicting Overall Runnig Times From MIT Ping-measured latencies (IP): Longest link latency 240 milliseconds Longest link to MIT 150 milliseconds 150+240 = 390 150+150+150 = 450

Measured Running Times Runs Initiated at MIT All-to-All Leader Overall Local Prediction 390 300 450 Average (runs under 2 sec) 811 295 541 335 % runs over 2 seconds 55% 3% 13% 6% Running times in milliseconds

What’s Going On? Loss rates on two links are very high 42% and 37% Taiwan to two ISPs in the US Loss rates on other links up to 8% Upon loss, TCP’s timeout is big More than round-trip-time All-to-all sends messages on lossy links Often delayed by loss

Distribution of Running Times Up to 1.3 sec. at MIT

Running Times Runs Initiated at Taiwan % runs over 2 seconds Average (runs under 2 sec) Sec. Leader overall local Leader All-to-all 7% 13% 43% 64% 24% 54% 607 679 844 1120 645 866 Running times in milliseconds

Distribution of Running Times in Taiwan

What’s Going On? Taiwan Good link Lossy link MIT Hosts with bad links to Taiwan Other Hosts Leader Secondary Leader All-to-all

Experiment II: Removing Taiwan Overall running times much better For every initiator and algorithm, less than 10% over 2 seconds (as opposed to 55% previously) All-to-all overall still worse than others! either Leader or Secondary Leader best, depending on initiator loss rates of 2% - 8% are not negligible all-to-all sends O(n2) messages; suffers But, all-to-all has best local running times

Probability of Delay due to Loss If all links would have same latency assume 1% loss on all links; 10 hosts (n=10) Leader sends 3(n-1) = 27 messages probability of at least one loss: 1 -.9927 » 24% All-2-all sends n(n-1) = 90 messages probability of at least one loss: 1 -.9990 » 60% In reality, links don’t have same latency only loss on long links matters

Conclusions Message loss causes high variation in TCP link latencies latency distribution has high variance, heavy tail Latency distribution determines expected time for receiving O(n) concurrent messages Secondary leader helps No triangle inequality, especially for loss Different for overall vs. local running times Number of rounds/steps not sufficient metric One-to-all and all-to-all have different costs