Performance Analysis of a Parallel Downloading Scheme from Mirror Sites Throughout the Internet Allen Miu, Eugene Shih 6.892 Class Project December 3,

Slides:



Advertisements
Similar presentations
A Digital Fountain Approach to Reliable Distribution of Bulk Data
Advertisements

Network Resource Broker for IPTV in Cloud Computing Lei Liang, Dan He University of Surrey, UK OGF 27, G2C Workshop 15 Oct 2009 Banff,
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
CHAPTER 15 WEBPAGE OPTIMIZATION. LEARNING OBJECTIVES How to test your web-page performance How browser and server interactions impact performance What.
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
Autonomic Systems Justin Moles, Winter 2006 Enabling autonomic behavior in systems software with hot swapping Paper by: J. Appavoo, et al. Presentation.
Jump to first page A. Patwardhan, CSE Digital Fountains Main Ideas : n Distribution of bulk data n Reliable multicast, broadcast n Ideal digital.
Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.
Distributed Logging in Java with Constrained Resource Usage Sunil Brown Varghese, Daniel Andresen Dept. of Computing and Information Sciences Kansas State.
2002/08/15 Using Dynamic Delay Pools for Bandwidth Management Gihan Dias and Chamara Gunaratne University of Moratuwa.
Dynamic Adaptive Streaming over HTTP2.0. What’s in store ▪ All about – MPEG DASH, pipelining, persistent connections and caching ▪ Google SPDY - Past,
What’s the Problem Web Server 1 Web Server N Web system played an essential role in Proving and Retrieve information. Cause Overloaded Status and Longer.
Efficient and Flexible Parallel Retrieval using Priority Encoded Transmission(2004) CMPT 886 Represented By: Lilong Shi.
CStream: Neighborhood Bandwidth Aggregation For Better Video Streaming Thangam Vedagiri Seenivasan Advisor: Mark Claypool Reader: Robert Kinicki 1 M.S.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 OSI Transport Layer Network Fundamentals – Chapter 4.
Technical Architectures
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Introducing: Cooperative Library Presented August 19, 2002.
Web Server Administration
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
A Server-less Architecture for Building Scalable, Reliable, and Cost-Effective Video-on-demand Systems Presented by: Raymond Leung Wai Tak Supervisor:
Dynamic parallel access to replicated content in the Internet Pablo Rodriguez and Ernst W. Biersack IEEE/ACM Transactions on Networking, August 2002.
Wide Web Load Balancing Algorithm Design Yingfang Zhang.
A. Frank - P. Weisberg Operating Systems Introduction to Cooperating Processes.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Accessing Multiple Mirror Sites in Parallel: Using Tornado Codes to Speed Up Downloads John Byers, Boston University Michael Luby, Digital Fountain, Inc.
Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.
Path selection Packet scheduling and multipath Sebastian Siikavirta and Antti aalto.
Design and Implement an Efficient Web Application Server Presented by Tai-Lin Han Date: 11/28/2000.
Lecture 9 of Advanced Databases Storage and File Structure (Part II) Instructor: Mr.Ahmed Al Astal.
Experiences in Design and Implementation of a High Performance Transport Protocol Yunhong Gu, Xinwei Hong, and Robert L. Grossman National Center for Data.
Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.
Jaringan Komputer Dasar OSI Transport Layer Aurelio Rahmadian.
10/1/2015 9:14 PM1 TCP in Mobile Ad-hoc Networks ─ Split TCP CSE 6590.
Do incentives build robustness in BitTorrent? Michael Piatek, Tomas Isdal, Thomas Anderson, Arvind Krishnamurthy, Arun Venkataramani.
Parallel Access For Mirror Sites in the Internet Yu Cai.
Copyright, 1996 © Dale Carnegie & Associates, Inc. Presented by Hsiuling Hsieh Christine Liu.
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
MULTI-TORRENT: A PERFORMANCE STUDY Yan Yang, Alix L.H. Chow, Leana Golubchik Internet Multimedia Lab University of Southern California.
Computing Infrastructure for Large Ecommerce Systems -- based on material written by Jacob Lindeman.
Transport Layer3-1 Announcements r Collect homework r New homework: m Ch3#3,4,7,10,12,16,18-20,25,26,31,33,37 m Due Wed Sep 24 r Reminder: m Project #1.
HighSpeed TCP for High Bandwidth-Delay Product Networks Raj Kettimuthu.
Collaborate Lesson 4C / Slide 1 of 22 Collaborate Knowledge Byte In this section, you will learn about: The EJB timer service Message linking in EJB 2.1.
CPSC 404, Laks V.S. Lakshmanan1 External Sorting Chapter 13: Ramakrishnan & Gherke and Chapter 2.3: Garcia-Molina et al.
Data Sharing. Data Sharing in a Sysplex Connecting a large number of systems together brings with it special considerations, such as how the large number.
Empirical Quantification of Opportunities for Content Adaptation in Web Servers Michael Gopshtein and Dror Feitelson School of Engineering and Computer.
A P2P-Based Architecture for Secure Software Delivery Using Volunteer Assistance Purvi Shah, Jehan-François Pâris, Jeffrey Morgan and John Schettino IEEE.
Transport Layer3-1 TCP throughput r What’s the average throughout of TCP as a function of window size and RTT? m Ignore slow start r Let W be the window.
Processes CSCI 4534 Chapter 4. Introduction Early computer systems allowed one program to be executed at a time –The program had complete control of the.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Distributed Systems CS Consistency and Replication – Part IV Lecture 21, Nov 10, 2014 Mohammad Hammoud.
WS-DREAM: A Distributed Reliability Assessment Mechanism for Web Services Zibin Zheng, Michael R. Lyu Department of Computer Science & Engineering The.
1 Adaptive Parallelism for Web Search Myeongjae Jeon Rice University In collaboration with Yuxiong He (MSR), Sameh Elnikety (MSR), Alan L. Cox (Rice),
Chapter 8 System Management Semester 2. Objectives  Evaluating an operating system  Cooperation among components  The role of memory, processor,
IPS Infrastructure Technological Overview of Work Done.
File Transfer And Access (FTP, TFTP, NFS). Remote File Access, Transfer and Storage Networks For different goals variety of approaches to remote file.
Distributed Server Scheduler Eyal Serero Alex Fishgate Supervisor : Vitaly Suchin.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
Presented by Deepak Varghese Reg No: Introduction Application S/W for server load balancing Many client requests make server congestion Distribute.
Accelerating Peer-to-Peer Networks for Video Streaming
Quick UDP Internet Connections
Hybrid Cloud Architecture for Software-as-a-Service Provider to Achieve Higher Privacy and Decrease Securiity Concerns about Cloud Computing P. Reinhold.
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Congestion Control, Internet transport protocols: udp
ECF: an MPTCP Scheduler to Manage Heterogeneous Paths
Multiple Path Connection through a Set of Connection Relay Servers
Distributed Systems CS
Algorithms for Selecting Mirror Sites for Parallel Download
Performance-Robust Parallel I/O
Presentation transcript:

Performance Analysis of a Parallel Downloading Scheme from Mirror Sites Throughout the Internet Allen Miu, Eugene Shih Class Project December 3, 1999

Overview Problem Statement Advantages/Disadvantages Operation of Paraloading Goals of Experiment Setup of Experiment Current Results Summary Questions

Problem Statement: Is “Paraloading” Good? Paraloading is the downloading from multiple mirror sites in parallel. Paraloader Mirror B Mirror C Mirror A

Advantages of Paraloading Performance is proportional to the realized aggregate bandwidth of the parallel connections Less prone to complete download failures compared to the single connection download Facilitates dynamic load balancing among parallel connections Facilitates reliable, out-of-order delivery (similar to Netscape)

Disadvantages of Paraloading Can be overly aggressive Consumes more server resources Overhead costs for scheduling, maintaining buffers, and sending block request messages Only effective when mirror servers are available

Step 1: Obtain Mirror List Paraloader Mirror List Mirror A Mirror B Mirror C Hard-coded DNS?

Step 2: Obtain File Length Paraloader Mirror A Mirror B Mirror C

Step 3: Send Block Requests Paraloader Mirror A Mirror B Mirror C

Step 4: Re-order Paraloader Mirror A Mirror B Mirror C

Step 5: Send Next Request Paraloader Mirror A Mirror B Mirror C

Goals of Experiment Main goal: To compare the performance of serial and parallel downloading To verify the results of Rodriguez et al. To examine whether varying the degree of parallelism, the number of mirror servers used, affects performance To gain experience with paraloading and to find out what issues are involved in designing efficient paraloading systems

Experiment Setup Implemented a paraloader application in Java, using HTTP1.1 (range-requests and persistent connections) Files are downloaded at MIT from 3 different sets (kernel, mars, tucows) of 7 mirror servers Degree of parallelism examined: M = 1, 3, 5, 7 Downloaded a 1MB and a 300KB file (S = 1MB, 300KB) in 1 hour intervals for 7 days Block Size = 32KB

Results Paraloading decreases download time over the average single connection case Speedup is far from optimal case (aggregate bandwidth) –Block request gaps result in wasted bandwidth Gaps are proportional to RTT –Congestion at client? Possible but unlikely.

S = 1MB

S - 763KS = 763KB, B = 30, M = 4

Acknowledgements Dave Anderson Dorothy Curtis Wendi Heinzelmann WIND Group

Questions

Summary of Contributions Implemented a paraloader Verified that paraloading indeed provides performance gain… sometimes –Increasing degree of parallelism improves overall performance Performance gains are not as good as those reported by Rodriguez et al.

Future Work Examine how block size affects performance gain Examine cost of paraloading Implement and test various optimization techniques Perform measurements at different client sites

Paraloading Will Not Be Effective In All Situations Clients should have enough “slack” bandwidth capacity to open more than one connection Parallel connections are bottleneck disjoint Target data on mirror servers is consistent and static Security and authentication services are installed where appropriate Data transport is reliable Mirror locations are quickly and easily obtained

Step-by-step Process of the Block Scheduling Paraloading Scheme 1. Obtain a list of mirror sites 2. Open a connection to a mirror server and obtain file length 3. Divide file length into blocks 4. Send a block request to each open connection 5. Wait for a response 6. Send a new block request to the first connection that finished downloading a block 7. Loop back to 5 until all blocks are retrieved

Paraloading is Not a Well-studied Concept Byers et al. proposed using Tornado codes to facilitate paraloading. Rodriguez et al. proposed the block scheduling paraloading scheme that is used in our project