Department of Computer Science, Jinan University, Guangzhou, P.R. China Lijun Lyu, Junjie Xie, Yuhui Deng, Yongtao Zhou ICA3PP 2014: The 14th International.

Slides:



Advertisements
Similar presentations
IP Fast Reroute Using Tunnel-AT draft-xu-ipfrr-tunnelat-00 Mingwei Xu, Lingtao Pan, Qing Li Tsinghua University, China 75 th IETF Meeting, Stockholm July.
Advertisements

Scalable Content-Addressable Network Lintao Liu
BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers Chuanxiong Guo1, Guohan Lu1, Dan Li1, Haitao Wu1, Xuan Zhang2,
George Michelogiannakis, Nan Jiang, Daniel Becker, William J. Dally This work was completed in Stanford University.
1 Exploring Efficient and Scalable Multicast Routing in Future Data Center Networks Dan Li, Jiangwei Yu, Junbiao Yu, Jianping Wu Tsinghua University Presented.
Ranveer Chandra , Kenneth P. Birman Department of Computer Science
Mobile and Wireless Computing Institute for Computer Science, University of Freiburg Western Australian Interactive Virtual Environments Centre (IVEC)
Datacenter Network Topologies
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
7. Fault Tolerance Through Dynamic or Standby Redundancy 7.6 Reconfiguration in Multiprocessors Focused on permanent and transient faults detection. Three.
Routing and Routing Protocols
Chuanxiong Guo, Haitao Wu, Kun Tan,
Routing.
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Multipath Routing CS 522 F2003 Beaux Sharifi. Agenda Description of Multipath Routing Necessity of Multipath Routing 3 Major Components Necessary for.
Ji-Yong Shin * Bernard Wong +, and Emin Gün Sirer * * Cornell University + University of Waterloo 2 nd ACM Symposium on Cloud ComputingOct 27, 2011 Small-World.
A Scalable, Commodity Data Center Network Architecture.
FAR: A Fault-avoidance Routing Method for Data Center Networks with Regular Topology Bin Liu, ZTE.
Quasi Fat Trees for HPC Clouds and their Fault-Resilient Closed-Form Routing Technion - EE Department; *and Mellanox Technologies Eitan Zahavi* Isaac Keslassy.
Chapter 13: WAN Technologies and Routing 1. LAN vs. WAN 2. Packet switch 3. Forming a WAN 4. Addressing in WAN 5. Routing in WAN 6. Modeling WAN using.
Network Support for Cloud Services Lixin Gao, UMass Amherst.
Link State Routing Protocol W.lilakiatsakun. Introduction (1) Link-state routing protocols are also known as shortest path first protocols and built around.
Routing and Routing Protocols Dynamic Routing Overview.
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2007 (TPDS 2007)
Presenter: Po-Chun Wu. Outline Introduction BCube Structure BCube Source Routing (BSR) Other Design Issues Graceful degradation Implementation.
1 Introducing Routing 1. Dynamic routing - information is learned from other routers, and routing protocols adjust routes automatically. 2. Static routing.
M.Menelaou CCNA2 ROUTING. M.Menelaou ROUTING Routing is the process that a router uses to forward packets toward the destination network. A router makes.
Network Aware Resource Allocation in Distributed Clouds.
“Intra-Network Routing Scheme using Mobile Agents” by Ajay L. Thakur.
Routing & Architecture
1 Department of Computer Science, Jinan University 2 School of Computer Science & Technology, Huazhong University of Science & Technology Junjie Xie 1,
CS3502: Data and Computer Networks Local Area Networks - 4 Bridges / LAN internetworks.
 Network Segments  NICs  Repeaters  Hubs  Bridges  Switches  Routers and Brouters  Gateways 2.
Aadil Zia Khan and Shahab Baqai LUMS School of Science and Engineering QoS Aware Path Selection in Content Centric Networks Fahad R. Dogar Carnegie Mellon.
Multicast Routing Algorithms n Multicast routing n Flooding and Spanning Tree n Forward Shortest Path algorithm n Reversed Path Forwarding (RPF) algorithms.
Floodless in SEATTLE : A Scalable Ethernet ArchiTecTure for Large Enterprises. Changhoon Kim, Matthew Caesar and Jenifer Rexford. Princeton University.
Department of Computer Science A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares Alexander Loukissas Amin Vahdat SIGCOMM’08 Reporter:
N. Hu (CMU)L. Li (Bell labs) Z. M. Mao. (U. Michigan) P. Steenkiste (CMU) J. Wang (AT&T) Infocom 2005 Presented By Mohammad Malli PhD student seminar Planete.
McGraw-Hill©The McGraw-Hill Companies, Inc., 2004 Connecting Devices CORPORATE INSTITUTE OF SCIENCE & TECHNOLOGY, BHOPAL Department of Electronics and.
Dual Centric Data Center Network Architectures DAWEI LI, JIE WU (TEMPLE UNIVERSITY) ZHIYONG LIU, AND FA ZHANG (CHINESE ACADEMY OF SCIENCES) ICPP 2015.
Chuanxiong Guo, Haitao Wu, Kun Tan, Lei Shi, Yongguang Zhang, Songwu Lu SIGCOMM 2008 Presented by Ye Tian for Course CS05112.
IP Routing Principles. Network-Layer Protocol Operations Each router provides network layer (routing) services X Y A B C Application Presentation Session.
On Reducing Mesh Delay for Peer- to-Peer Live Streaming Dongni Ren, Y.-T. Hillman Li, S.-H. Gary Chan Department of Computer Science and Engineering The.
Subways: A Case for Redundant, Inexpensive Data Center Edge Links Vincent Liu, Danyang Zhuo, Simon Peter, Arvind Krishnamurthy, Thomas Anderson University.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
November 4, 2003Applied Research Laboratory, Washington University in St. Louis APOC 2003 Wuhan, China Cost Efficient Routing in Ad Hoc Mobile Wireless.
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
Improving Fault Tolerance in AODV Matthew J. Miller Jungmin So.
Jiahao Chen, Yuhui Deng, Zhan Huang 1 ICA3PP2015: The 15th International Conference on Algorithms and Architectures for Parallel Processing. zhangjiajie,
Routing Semester 2, Chapter 11. Routing Routing Basics Distance Vector Routing Link-State Routing Comparisons of Routing Protocols.
Performance Comparison of Ad Hoc Network Routing Protocols Presented by Venkata Suresh Tamminiedi Computer Science Department Georgia State University.
BUFFALO: Bloom Filter Forwarding Architecture for Large Organizations Minlan Yu Princeton University Joint work with Alex Fabrikant,
VL2: A Scalable and Flexible Data Center Network
Network Layer COMPUTER NETWORKS Networking Standards (Network LAYER)
Yiting Xia, T. S. Eugene Ng Rice University
Instructor Materials Chapter 5: Dynamic Routing
Data Center Network Architectures
Chuanxiong Guo, et al, Microsoft Research Asia, SIGCOMM 2008
Chapter 4 Data Link Layer Switching
Routing.
Chapter 5: Dynamic Routing
BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers Chuanxiong Guo1, Guohan Lu1, Dan Li1, Haitao Wu1, Xuan Zhang2,
Chuanxiong Guo, Haitao Wu, Kun Tan,
Data Center Architectures
Other Routing Protocols
Routing.
Virtual LAN (VLAN).
Multicasting Unicast.
Presentation transcript:

Department of Computer Science, Jinan University, Guangzhou, P.R. China Lijun Lyu, Junjie Xie, Yuhui Deng, Yongtao Zhou ICA3PP 2014: The 14th International Conference on Algorithms & Architectures for Parallel Processing. August 24-27, Dalian, China.

Motivation Challenges Related work Our idea System architecture Evaluation Conclusion 2

The Explosive Growth of Data  IDC: 1,800EB data in 2011, 40-60% annual increase  Larger Data Center  Google: 19 data centers > 1 million servers  Higher traffic  Cisco forecasts that annual traffic in global data centers will nearly triple over the next 5 years and reach 7.7ZB by the end of Google Data Center

Data Center Network Node increment  Scalability? Failures are common  Fault tolerance?  Google MapReduce in a 4,000-node cluster:  5 nodes fail during a job  1 disk fails every 6 hours Bandwidth-hungry services  Network capacity? Infrastructure services: MapReduce, GFS, … Network applications: Cloud disk, Video, …

Tree-based Structure Traditional tree  Bandwidth bottleneck, Single points of failure, Expensive Modified tree: Fat-tree  High capacity  Limited scalability 5 Traditional Tree-based Structure Fat-tree

Other novel, hybrid network structures Physical topology  Level-based, but not tree-based  Recursively defined Routing mechanism  No routers, without traditional internet routing mechanism  Put routing intelligence on servers  Take advantage of structural properties Typical structures  DCell, FiConn, BCube, Totoro… 6

DCell 7 Totoro FiConn BCube Physical structures

Routing mechanisms 8 DCellTotoroFiConnBCube Core ideaDivide-and-Conquer Correct different address digits CalculationHop by hopFull path Link stateBroadcast domainPath probing Path selectionDijkstra + ReroutingGreedyAvailable one Traffic-awareNo mentionYesNo mention Shortest distance NoYes

What we achieve: Athena Routing Mechanism Routing algorithm  Based on Dynamic Programming  Find the shortest path with lower complexity than classic algorithms  Support Multi-path Path probing mechanism  Bypass the failed nodes & links  Traffic-aware Properties  More resilient, shorter latency, higher capacity, Lower complexity 9

Athena Routing Mechanism Implement on the structure of Totoro Compare with the original Totoro Fault-tolerant Routing Algorithm (TFR) and Shortest Path Algorithm (SPA, based on Floyd-Warshall). Applicable to DCell, FiConn, BCube… Similar topology: level-based, recursively defined.. Put routing intelligence on servers 10

Totoro  Two-port servers  Low-end switches  Level-based  Recursively defined two-port NIC 11 Totoro Structure of One Level

Building Totoro Connect N servers to an N-port switch Here, N=4 Basic partition: Totoro 0 Intra-switch A Totoro 0 Structure 12

Building Totoro Available ports in Totoro 0 : c. Here, c=4 Connect n Totoro 0 s to n-port switches by using c/2 ports Inter-switch A Totoro 1 structure consists of n Totoro 0 s. 13

Building Totoro Connect n Totoro i-1 s to n-port switches to build a Totoro i Recursively defined Half of available ports ⇒ Open & Scalable The number of paths among Totoro i s is n/2 times of the number of paths among Totoro i-1 s ⇒ Multi- redundant links ⇒ High network capacity 14

15 Totoro 2 structure with N = 4, n = 4, K = 2.

16 Athena Routing Algorithm (ARA) Based on Dynamic Programming (DP) Applicable to problems which exhibit the properties of  Overlapping subproblems  Optimal substructure Recursively calculate

17 Steps of ARA: 1.Suppose src and dst belong to two partitions. 2.Get all paths connecting these two partitions. 3.For each path, recursively calculate it. 4.Store all paths. 5.Sort all path by length. 6.Remove the extra paths. This function is based on the corresponding structural properties. Cartesian product

18 Case study of ARA work out the path from src to dst

19 Case study of ARA Step. 1: src and dst belong to two different sub- partitions respectively

20 Case study of ARA Step. 2: there exist two paths between these two sub- partitions

21 Case study of ARA Step. 3: for Path 1, recursively work out the sub-paths in these sub-partitions, and join them for a full path

22 Case study of ARA Step. 4: similarly, work out the full path for Path 2

23 Case study of ARA Step. 5: add all paths into the result set

24 Case study of ARA Step. 5: sort the paths by lengths

25 Case study of ARA Step. 5: remove the extra paths (here, we suppose the size of set to return is 1, i.e., it is the shortest path)

26 Path Probing Mechanism Source host sends the probing request packets Destination host sends probing reply packets Intermediate servers record the link capacities in the probing packets and forward them

27 Path Probing Mechanism Detect the failed paths  No extra rerouting technique is required Detect the link capacity  Support load balance…

28

29

30 Protocol Implementation ARM Packet format  Path-probing packet  Data packet

31 Protocol Implementation Protocol  2.5-layer protocol  How an intermediate server determines the next hop?  A fact: two adjacent servers in a path only differ at one “bit”  Hence, we only store the different “bit”s in the vector Ethernet ARMIPTCP

Evaluating Path Failure & Average Path Lengths ARM vs. TFR vs. SPA TFR : the original Totoro Fault-tolerant Routing algorithm SPA: Shortest Path Algorithm, Floyd-Warshall, performance bound Evaluating Resource Usage 32

33 Evaluating Path Failure & Average Path Lengths Experimental parameters Types of failuresLink, Node, Switch & Rack failures PlatformTotoro 2 (4096 servers) Failures ratios2% - 20% Communication modeAll-to-all Simulation times20 times

34 Evaluating Path Failure P ath failure ratio vs. server/rack failure ratio The performance of ARM/TFR are almost identical to that of SPA!

35 Evaluating Path Failure P ath failure ratio vs. switch failure ratio The performance of ARM is almost identical to that of SPA! But TFR isn’t.

36 Evaluating Path Failure Path failure ratio vs. link failure ratio When a high link failure occurs: ARM achieves slightly better capacity than TFR. Performance gap between ARM and SPA still exists! SPA traverse all feasible links in the whole structure until finding a valid path! This is a tradeoff that ARM makes to facilitate algorithmic complexity and save computation resources.

37 Evaluating Average Path Lengths ARM: 1.Better than TFR. 2.Almost identical to SPA. 3.Shorter than SPA, this is because the path failure ratio of ARM is a bit higher than that of SPA, thus our total path length is shorter.

38 Evaluating Resource Usage Experimental parameters TestbedLenovo T350, Quad-core, 8GB memory PlatformTotoro 2 (4096 servers) Size of each result10 paths Communication modeOne-to-all in 4 Totoro 1

39 Evaluating Resource Usage +10nodes/s 28% 18s 0% CPU: 1.Increase by 10 per second 2.Peak value of 28% at 18s 3.Benefited from the cache Memory: For each host, it only costs 164KB at most.

More resilient Shorter latency Higher capacity Lower complexity In the future work, we will focus on the implementation of ARM in DCell, FiConn and other structures! 40

41 ICA3PP 2014: The 14th International Conference on Algorithms & Architectures for Parallel Processing. August 24-27, Dalian, China.