Download presentation
Presentation is loading. Please wait.
1
Large Scale File Distribution Sequential Branching Distribution Final Presentation Grad Operating Systems Presented by Chris Miller & Pramita Mitra Dec 13, 2006
2
Problem Statement ● Research requires distribution of large datasets on distributed networks ● Methods such as multicast are too complicated to implement reliably ● Tools available for file distribution Chirp Parrot ● Algorithm needed to efficiently schedule the distribution of files
3
Solution ●Using CCL storage pool as model of distributed network ●Using small, measured steps to find what aspects of distribution work best in implementation Sequential distribution Parallel distribution Ineffiecient use of network resources. Total time for distribution O(n). Distributor Stage 1 Stage 2 Stage n … Distributor Node 1 Node 2 Node n … Total time for distribution O(n).
4
Baseline Results
5
Sequential Branching Distribution Distributor Stage 1 Stage 2 Stage 3 Thirdput Node set Total time for distribution O(log 2 n)
6
Best Neighbor Approximation
8
Probabilistic Weighted Average
9
Best Neighbor Approximation Data File100MB250MB500MB1GB Reduction in Net File SizeTransfer TimeOverHeadReductionOverHeadReductionOverHeadReductionOverHeadReduction 1MB16.64%39.93%-23.3%15.97%0.7%7.99%8.7%3.99%12.6% 2MB28.44%44.32%-15.9%17.73%10.7%8.86%19.6%4.43%24.0% 3MB29.12%50.16%-21.0%20.07%9.1%10.03%19.1%5.02%24.1% 4MB23.20%55.61%-32.4%22.24%1.0%11.12%12.1%5.56%17.6% 5MB27.39%67.07%-39.7%26.83%0.6%13.41%14.0%6.71%20.7% Latency16.59%15.55%1.0%6.22%10.4%3.11%13.5%1.56%15.0%
10
Results
11
Conclusions ● A fast and reliable distribution method is possible with simple file transfer methods ● Distribution system is fault tolerant for all nodes except distributor node ● Latency measurement moderate indicator of transfer rate low overhead ● Small file transfer approximation strong indicator of transfer rate high overhead ● Performance is near O(log 2 n)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.