Evaluation of a Novel Two-Step Server Selection Metric Presented by Karthik Lakshminarayanan 11-26-2003.

Slides:



Advertisements
Similar presentations
Peer-to-Peer and Social Networks An overview of Gnutella.
Advertisements

Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
LASTor: A Low-Latency AS-Aware Tor Client
Ningning HuCarnegie Mellon University1 Optimizing Network Performance In Replicated Hosting Peter Steenkiste (CMU) with Ningning Hu (CMU), Oliver Spatscheck.
University of Michigan Electrical Engineering and Computer Science Anatomizing Application Performance Differences on Smartphones Junxian Huang, Qiang.
LOAD BALANCING IN A CENTRALIZED DISTRIBUTED SYSTEM BY ANILA JAGANNATHAM ELENA HARRIS.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
1 Turning Heterogeneity into an Advantage in Overlay Routing Gisik Kwon Dept. of Computer Science and Engineering Arizona State University Published in.
Consensus Routing: The Internet as a Distributed System John P. John, Ethan Katz-Bassett, Arvind Krishnamurthy, and Thomas Anderson Presented.
What’s the Problem Web Server 1 Web Server N Web system played an essential role in Proving and Retrieve information. Cause Overloaded Status and Longer.
Project 4 U-Pick – A Project of Your Own Design Proposal Due: April 14 th (earlier ok) Project Due: April 25 th.
An Adaptable Benchmark for MPFS Performance Testing A Master Thesis Presentation Yubing Wang Advisor: Prof. Mark Claypool.
King : Estimating latency between arbitrary Internet end hosts Krishna Gummadi, Stefan Saroiu Steven D. Gribble University of Washington Presented by:
Application Layer Anycasting: A Server Selection Architecture and Use in a Replicated Web Service Presented in by Jayanthkumar Kannan On 11/26/03.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Informed Detour Selection Helps Reliability Boulat A. Bash.
1 The Content and Access Dynamics of a Busy Web Server: Findings and Implications Venkata N. Padmanabhan Microsoft Research Lili Qiu Cornell University.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
1 CAPS: A Peer Data Sharing System for Load Mitigation in Cellular Data Networks Young-Bae Ko, Kang-Won Lee, Thyaga Nandagopal Presentation by Tony Sung,
A Network Measurement Architecture for Adaptive Networked Applications Mark Stemm* Randy H. Katz Computer Science Division University of California at.
Large Scale File Distribution Troy Raeder & Tanya Peters.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
An Adaptive Multi-Objective Scheduling Selection Framework For Continuous Query Processing Timothy M. Sutherland Bradford Pielech Yali Zhu Luping Ding.
Characterizing and Predicting TCP Throughput on the Wide Area Network Dong Lu, Yi Qiao, Peter Dinda, Fabian Bustamante Department of Computer Science Northwestern.
Wide Web Load Balancing Algorithm Design Yingfang Zhang.
Q.Int_speed_test Draft Recommendation Eva Ibarrola SG11 Q15 Acting Rapporteur JCA-CIT meeting (electronic meeting)
12006/9/26 Load Balancing in Dynamic Structured P2P Systems Brighten Godfrey, Karthik Lakshminarayanan, Sonesh Surana, Richard Karp, Ion Stoica INFOCOM.
Observations from Router-level Traces Lisa Amini IBM T. J. Watson Research Center Joint with Henning Schulzrinne, Aurel Lazar Columbia University.
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
Active Network Applications Tom Anderson University of Washington.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
ON THE STABILITY OF SKYPE SUPER NODES Anat Bremler-Barr Ran Goldschmidt Interdisciplinary Center Herzliya Haifa University
OpenFlow-Based Server Load Balancing GoneWild Author : Richard Wang, Dana Butnariu, Jennifer Rexford Publisher : Hot-ICE'11 Proceedings of the 11th USENIX.
COGNITIVE RADIO FOR NEXT-GENERATION WIRELESS NETWORKS: AN APPROACH TO OPPORTUNISTIC CHANNEL SELECTION IN IEEE BASED WIRELESS MESH Dusit Niyato,
On the Power of Off-line Data in Approximating Internet Distances Danny Raz Technion - Israel Institute.
Dynamic and Decentralized Approaches for Optimal Allocation of Multiple Resources in Virtualized Data Centers Wei Chen, Samuel Hargrove, Heh Miao, Liang.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
Load Balancing in Structured P2P System Ananth Rao, Karthik Lakshminarayanan, Sonesh Surana, Richard Karp, Ion Stoica IPTPS ’03 Kyungmin Cho 2003/05/20.
Application-Layer Anycasting By Samarat Bhattacharjee et al. Presented by Matt Miller September 30, 2002.
1 A Feedback Control Architecture and Design Methodology for Service Delay Guarantees in Web Servers Presentation by Amitayu Das.
CCNA 1 Module 10 Routing Fundamentals and Subnets.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
1 CS 425 Distributed Systems Fall 2011 Slides by Indranil Gupta Measurement Studies All Slides © IG Acknowledgments: Jay Patel.
Aditya Akella The Performance Benefits of Multihoming Aditya Akella CMU With Bruce Maggs, Srini Seshan, Anees Shaikh and Ramesh Sitaraman.
Microsoft Research1 Characterizing Alert and Browse Services for Mobile Clients Atul Adya, Victor Bahl, Lili Qiu Microsoft Research USENIX Annual Technical.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Spamscatter: Characterizing Internet Scam Hosting Infrastructure By D. Anderson, C. Fleizach, S. Savage, and G. Voelker Presented by Mishari Almishari.
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R.Vijayprasanth 1, R. Kavithaa 2,3 and Raj Kettimuthu 2,3 1 Coimbatore.
Dual-Region Location Management for Mobile Ad Hoc Networks Yinan Li, Ing-ray Chen, Ding-chau Wang Presented by Youyou Cao.
Determining the Geographic Location of Internet Hosts Venkata N. Padmanabhan Microsoft Research Lakshminarayanan Subramanian University of California at.
P2p, Fall 06 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems Search in Unstructured P2p.
Drafting Behind Akamai (Travelocity-Based Detouring) Dr. Yingwu Zhu.
Network Coordinates : Internet Distance Estimation Jieming ZHU
1 Version 3.1 Module 6 Routed & Routing Protocols.
1 Adaptive Parallelism for Web Search Myeongjae Jeon Rice University In collaboration with Yuxiong He (MSR), Sameh Elnikety (MSR), Alan L. Cox (Rice),
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
1 11 Distributed Channel Assignment in Multi-Radio Mesh Networks Bong-Jun Ko, Vishal Misra, Jitendra Padhye and Dan Rubenstein Columbia University.
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
ALTO: A Multi Dimensional Peer Selection Problem IETF 73 Saumitra Das
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Drafting Behind Akamai (Travelocity-Based Detouring) Ao-Jan Su, David R. Choffnes, Aleksandar Kuzmanovic and Fabián E. Bustamante Department of EECS Northwestern.
Introduction to Load Balancing:
Vivaldi: A Decentralized Network Coordinate System
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Memory Management for Scalable Web Data Servers
Relax and Adapt: Computing Top-k Matches to XPath Queries
Presentation transcript:

Evaluation of a Novel Two-Step Server Selection Metric Presented by Karthik Lakshminarayanan

Problem statement Goal: Client wants to download content from the best of k servers, i.e. minimize total time to transfer a document Issues to consider: –Cost of choosing the target server Lightweight mechanisms preferable –Stability of ordering (over a period of time) More energy can be expended if stability is high –Nature of content and corresponding workloads Frequency of downloads, and size of documents

Outline Problem statement Proposed algorithm Existing/possible approaches Methodology Results

“Novel” two-step server selection Pick k best servers out of the entire set by using pings (k ~ 5) Retain the subset for a period of n days Choose servers from the subset of k servers –Choose from this subset randomly –Can choose from subset based on other metrics Call this Ping-twostep for convenience Main delay due to network delays, not server load

Selection metrics Dynamic metric (adapt to network condition) –Ping –Transfer of small files –Ping-twostep Static metric (oblivious to network condition) –Number of hops –Number of AS hops –Random Summary: Ping-twostep performs best!

Methodology Six client machines (USC, UNC, UCSC, Umass, UDel, Purdue) 193 servers in tucows.com mirror network Collected info continuously for 41 days Each “run” comprised –5 ICMP pings –Traceroute –Transfer times of files from 10KB – 1MB More extensive set of servers than previous work

Comparison –Ping metric RTT not always indication of transfer time –Not surprising! Some oddities experienced with –UNC –Purdue Relative positions between ping & 10k vary across nodes Do not care about the low end of the bw spectrum!

Comparison – Small file transfers Improved with size of transfer Low correlation between time for small transfer vs. time for large transfers

Comparison – Static selection Hop count –Mostly equivalent to random selection when used to estimate transfer time –Little correlation (restricted to USA and Canada)

Comparison – Static selection Hop count –Mostly equivalent to random selection when used to estimate transfer time –Little correlation (restricted to USA and Canada)

Comparison – Static selection Hop count –Mostly equivalent to random selection when used to estimate transfer time –Little correlation (restricted to USA and Canada) AS hop count –Does not work well for them –Global IP-Anycast (GIA) uses this Queried using BGP Small hop counts miss many servers, large hop counts would result in too much traffic

Stability of server ranking 70-98% of changes in rank are between zero and ten for top servers Average servers experience much higher change in rank Rankings of top servers is stable

Stability of server transfer times Consider different sizes of subsets of 193 hosts –Number of top servers in an n-subset is a small fraction of the size of subset (<10%) Little overlap of top servers across clients Consider a subset of servers How many of them were ever at the top in the 41-day period Caveat: they consider only the “top” server

Ping-random Motivation revisited: –Ping technique Low overhead Good performance –Top servers stable over time Choosing from the small subset: –Random – provides load-balance –Ping – use ping again among that set –Ping-best (for comparison)

Performance of Ping-Random Ping-ping >~ Ping-random > (10k, Ping) Ping-ping might not perform load-balance well

Effect of size of ping sets Influenced greatly by the size of ping sets chosen 40% of servers ever ranked first were in 20% of the pings

Effects of selection algorithms Load-balancing –Different clients have different top servers Oscillations –Respond to changing network conditions –“Fortunately, it is unlikely that many clients would be running tests at the same time” No quantitative results!

Discussion How do we use this in practice? –Useful for large file transfers –What about small web transfers? GNP, Geoping approaches might work –Set of servers is static? How can DHTs help in anycast? –DOLR network for proximity –Embed location information in Ids –Use longest-prefix matching tricks (like i3)