A Scalable, Commodity Data Center Network Architecture Jingyang Zhu.

Slides:



Advertisements
Similar presentations
IP Router Architectures. Outline Basic IP Router Functionalities IP Router Architectures.
Advertisements

PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric
PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric. Presented by: Vinuthna Nalluri Shiva Srivastava.
Data Center Fabrics. Forwarding Today Layer 3 approach: – Assign IP addresses to hosts hierarchically based on their directly connected switch. – Use.
1 Exploring Efficient and Scalable Multicast Routing in Future Data Center Networks Dan Li, Jiangwei Yu, Junbiao Yu, Jianping Wu Tsinghua University Presented.
Towards Virtual Routers as a Service 6th GI/ITG KuVS Workshop on “Future Internet” November 22, 2010 Hannover Zdravko Bozakov.
Datacenter Network Topologies
Path Protection in MPLS Networks Using Segment Based Approach.
Chuanxiong Guo, Haitao Wu, Kun Tan,
Data Center Network Topologies: FatTree
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Ji-Yong Shin * Bernard Wong +, and Emin Gün Sirer * * Cornell University + University of Waterloo 2 nd ACM Symposium on Cloud ComputingOct 27, 2011 Small-World.
A Scalable, Commodity Data Center Network Architecture Mohammad AI-Fares, Alexander Loukissas, Amin Vahdat Presented by Ye Tao Feb 6 th 2013.
A Scalable, Commodity Data Center Network Architecture
A Scalable, Commodity Data Center Network Architecture.
Performance and Power Efficient On-Chip Communication Using Adaptive Virtual Point-to-Point Connections M. Modarressi, H. Sarbazi-Azad, and A. Tavakkol.
Virtual LANs. VLAN introduction VLANs logically segment switched networks based on the functions, project teams, or applications of the organization regardless.
Quasi Fat Trees for HPC Clouds and their Fault-Resilient Closed-Form Routing Technion - EE Department; *and Mellanox Technologies Eitan Zahavi* Isaac Keslassy.
Network Support for Cloud Services Lixin Gao, UMass Amherst.
Before We Start How to read a research paper?
Introduction to Network Layer. Network Layer: Motivation Can we built a global network such as Internet by extending LAN segments using bridges? –No!
High-Performance Networks for Dataflow Architectures Pravin Bhat Andrew Putnam.
End-to-end resource management in DiffServ Networks –DiffServ focuses on singal domain –Users want end-to-end services –No consensus at this time –Two.
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
1 Dynamic Interconnection Networks Miodrag Bolic.
Floodless in SEATTLE : A Scalable Ethernet ArchiTecTure for Large Enterprises. Changhoon Kim, Matthew Caesar and Jenifer Rexford. Princeton University.
Anshul Kumar, CSE IITD CSL718 : Multiprocessors Interconnection Mechanisms Performance Models 20 th April, 2006.
Shanghai Jiao Tong University 2012 Indirect Networks or Dynamic Networks Guihai Chen …with major presentation contribution from José Flich, UPV (and Cell.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Department of Computer Science A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares Alexander Loukissas Amin Vahdat SIGCOMM’08 Reporter:
Anshul Kumar, CSE IITD ECE729 : Advanced Computer Architecture Lecture 27, 28: Interconnection Mechanisms In Multiprocessors 29 th, 31 st March, 2010.
CS 740: Advanced Computer Networks IP Lookup and classification Supplemental material 02/05/2007.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
Ch 8. Switching. Switch  Devices that interconnected with each other  Connecting all nodes (like mesh network) is not cost-effective  Some topology.
CS440 Computer Networks 1 Packet Switching Neil Tang 10/6/2008.
PeerNet: Pushing Peer-to-Peer Down the Stack Jakob Eriksson, Michalis Faloutsos, Srikanth Krishnamurthy University of California, Riverside.
1 A quick tutorial on IP Router design Optics and Routing Seminar October 10 th, 2000 Nick McKeown
Packet Switch Architectures The following are (sometimes modified and rearranged slides) from an ACM Sigcomm 99 Tutorial by Nick McKeown and Balaji Prabhakar,
[Fat- tree]: A Scalable, Commodity Data Center Network Architecture
Spring 2000CS 4611 Router Construction Outline Switched Fabrics IP Routers Extensible (Active) Routers.
Univ. of TehranIntroduction to Computer Network1 An Introduction to Computer Networks University of Tehran Dept. of EE and Computer Engineering By: Dr.
Schedulers for Hybrid Data Center Network Neelakandan Manihatty Bojan 2 nd Year PhD Student Advisor: Dr. Andrew W. Moore Eurosys Doctoral Workshop, 18.
VL2: A Scalable and Flexible Data Center Network
Data Center Architectures
Xin Li, Chen Qian University of Kentucky
Lecture 2: Cloud Computing
Data Center Network Topologies II
Lecture 2: Leaf-Spine and PortLand Networks
Data Center Network Architectures
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
Chuanxiong Guo, et al, Microsoft Research Asia, SIGCOMM 2008
Ch 13 WAN Technologies and Routing
Advanced Computer Networks
Revisiting Ethernet: Plug-and-play made scalable and efficient
A Study of Group-Tree Matching in Large Scale Group Communications
Virtual LANs.
NTHU CS5421 Cloud Computing
A Scalable, Commodity Data Center Network Architecture
Intra-Domain Routing Jacob Strauss September 14, 2006.
BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers Chuanxiong Guo1, Guohan Lu1, Dan Li1, Haitao Wu1, Xuan Zhang2,
Chuanxiong Guo, Haitao Wu, Kun Tan,
EECS 582 Final Review Mosharaf Chowdhury EECS 582 – F16.
Bridges and Extended LANs
Indirect Networks or Dynamic Networks
NTHU CS5421 Cloud Computing
EE 122: Lecture 7 Ion Stoica September 18, 2001.
Data Center Architectures
CS 6290 Many-core & Interconnect
CMPE 252A : Computer Networks
2019/10/9 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Jin-Li Ye, Yu-Huang Chu, Chien Chen.
Presentation transcript:

A Scalable, Commodity Data Center Network Architecture Jingyang Zhu

Outline Motivation Background Fat Tree Architecture –Topo –Routing –Fault Tolerent Results

Motivation Large Data Shuffle Map Reduce

Intuitive Approach High End Hardware (e.g., InfiniBand) FDRFourth Data Rate EDREnhanced Data Rate HD R High Data Rate

Alternative Approach A dedicated interconnection network –Scalablilty –Cost –Compability (i.e., app, os, hardware)

Typical Topology

Clos Network (m, n, r) = (5, 3, 4) 1. strictly non- blocking (m >= 2n - 1) 2. rearrangeably non-blocking (m >= n)

Benes Network A Clos Network with 2x2 switches

Fat Tree Multi-path Routing: UpLink (right) + DownLink (left) Oversubscription: ideal BW / actual BW of host end. e.g. 1 : 1 is good; 5 : 1 is bad Node 1 (0001) -> Node 6 (0110): 2 possible paths

Topo of Data Center - Hierachy Multi-Path = 2 Conventional Topo in Data Center GigE Link 10 GigE Link

Topo of Data Center - Fat Tree Fat Tree Topo (k = 4) (k/2)^2 k-port # of hosts: k^3 / 4, e.g., k = 48 => # of hosts: (Scalability!!!) k pods k/2 k-port

Addressing - Compability!!! Pod switches: 10.pod #.switch #.1 Core Address: 10.k.j.i (k - radix, - coordinate) j,i = 1,2,...,k/

Addressing (con't) Host: 10.pod #.switch #.ID switch 0 switch 1 switch 0 switch 1 Addressing Format is for further routing purpose

2-level table routing - pod switch Downlink to Host Uplink to Core MSB 8 - LSB Traffic diffusion occurs only in the first half of a packet’s journey

Generation of routing table addPrefix (pod switch, pre, port) addSuffix (pod switch, suf, port)

1-level table routing - core switch PrefixOutput Port / / / /163

Routing Table Implementation Content Addressable Memory (CAM) Input: data; output: match / mismatch

Routing Table Implementation (con't) Host Address Match RAM Address

Routing Example: Hierarchical Tree > >

Routing Example: Fat Tree > > No Contention!!!

Dynamic Routing Up to now, the routing alg is based on static table...any improvement??? –Yes, using dynamic routing Dynamic Routing –Flow Classification –Flow Scheduling

Dynamic Routing 1 - Flow Classification Flow: A set of packets that must have its order preserved Dynamic Routing –Avoid reordering for same flow –Reassign a minimum number of flows to minimize the disparity between ports Flow Classifier: identify flows

Flow Classification Check src & dst address Balance the port load dynamically Avoid reordering Balance the port DYNAMICALLY Every t seconds to rearrange flows Max 3 flows to be rearranged Have some risks to reorder the flow!!! - For performance consideration, not for correctness

Dynamic Routing 2 - Flow Scheduling Large flows are critical - schedule the large flows independently // edge switches if (length (flow_in) > threshold) notify central schedular else route as normal // central schedular if (receive notification) foreach possible path if (path not reserved) reserve the path & notify switches along the path Data StructureFunction bool Link [LINKSIZE]link status hash Flow record reservation & clear reservation (retire the flow)

Discussion Which one is better? –Flow classification –Flow scheduling Locally, inter pod switch Globally, among all the paths and switches Locally, inter pod switch Globally, among all the paths and switches

Fault Tolerance How to know links or switches fail? Bidirectional Forwarding Detection (BFD)

Fault Tolerance (con't) Basic ideas –Mark the link unavailable when routing, e.g., marking the load inf in flow classification –Broadcast the fault to other switches and avoid routing it

Cost 1:1

Power & Heat Power and Heat for different switches 10 GigE

Performance Different Benchmarks Percentage of ideal bisection bandwidth

Conclusion Fat tree for data center interconnection –Scalable –Cost efficient –Compatible Routing details, locally & globally Fault tolerant