Datacast: A Scalable and Efficient Reliable Group Data Delivery Service for Data Centers Jiaxin Cao, Chuanxiong Guo, Guohan Lu, Yongqiang Xiong, Yixin Zheng, Yongguang Zhang, Yibo Zhu, Chen Chen University of Science and Technology of China Microsoft Research Asia Tsinghua University University of California, Santa Barbara University of Pennsylvania
Reliable Group Data Delivery The problem of RGDD is: given a data source, Src,and a set of receivers, R 1, R 2, …, R n, how to reliably transmit bulk data from Src to all the receivers? In a data center network, Data
Reliable Group Data Delivery RGDD is important in DCNs: Bootstrapping or OS upgrading. Distributed file systems, e.g., GFS. VM setup. And more...
Reliable Group Data Delivery A good RGDD design should have the following properties: Scalable (large group numbers and large group sizes) High bandwidth efficiency
Existing solutions to RGDD Existing solutions can be classified into two categories: Reliable IP multicast.Not scalable, e.g., ACK implosion. End-host based overlays.Low bandwidth efficiency. None of the existing systems can perfectly achieve RGDD.
New opportunities in DCN Recently, there are two clear trends in DCN: Multiple edge-disjoint Steiner trees for RGDD. Practical packet caching abilities in network devices. We can cache packet!
The architecture of Datacast Fabric Manager Master i Master j Src R1R1 R2R2 IMD Src R1R1 R2R2 R3R3 R4R4 RGDD Group i 1 RGDD Group i 2 RGDD Group i n Network Topology How to calculate multiple Steiner trees? How to efficiently transmit data in each Steiner tree?
Multiple edge-disjoint Steiner trees in DCN Our multiple Steiner trees algorithm takes three steps: 1.Use specific algorithms to construct spanning trees. 2.Prune the spanning trees. 3.Use Breath First Search(BFS) to repair the trees broken by network failures. This algorithm is fast (O(k|V|) + O(|E|) + O(k|E|)) and efficient.
Datacast transport protocol Datacast is built on top of Content Centric Network (CCN): Inst Data Inst Data Inst Data Inst Data
Datacast transport protocol
Simulation: multiple Steiner trees algorithm We tested our algorithm in Fattree(24,3), BCube(8, 3), Torus(16, 3) under the link failure rates (LFR) of 1%, 3% and 5%. Running times.Steiner tree numbers.
Simulation: Datacast congestion control Steiner Tree 1.Steiner Tree 2.
Simulation: Datacast congestion control Based on Theorem 1, Datacast needs 125KB caches to work at full rate. Based on Theorem 2, the duplicate data ratios is 1.19%. Cache Size (KB)Throughput(Mbps)Duplicate Data Ratio (%)
Simulation: Datacast congestion control Compare with BitTorrent. Fattree.BCube.Torus.
Experiment: Datacast congestion control
We compare Datacast with BitTorrent. We use both of them to transmit 4GB data. Finish time (s)Link stress Datacast BitTorrent521.39
Related work Reliable IP multicast Pgm congestion control (pgmcc) Active Reliable Multicast (ARM) End-host based overlays SplitStream End System Multicast Cornet
Conclusion In this paper, we propose Datacast which Calculates multiple edge-disjoint Steiner trees in DCNs Uses CCN to turn hard group states to soft packet caching Uses a simple rate-based AIMD congestion control algorithm to achieve high efficiency Datacast is scalable and achieves high bandwidth efficiency
Thank you!