Lecture 2: Leaf-Spine and PortLand Networks

Slides:



Advertisements
Similar presentations
Interconnection: Switching and Bridging CS 4251: Computer Networking II Nick Feamster Fall 2008.
Advertisements

PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric
COS 461 Fall 1997 Routing COS 461 Fall 1997 Typical Structure.
CSE331: Introduction to Networks and Security Lecture 8 Fall 2002.
J. K. Kim Portland. Trend of Data Center By J. Nicholas Hoover, InformationWeek June 17, :00 AMJ. Nicholas Hoover 200 million Euro Data centers.
PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric. Presented by: Vinuthna Nalluri Shiva Srivastava.
Radhika Niranjan Mysore, Andreas Pamboris, Nathan Farrington, Nelson Huang, Pardis Miri, Sivasankar Radhakrishnan, Vikram Subramanya, and Amin Vahdat Department.
PortLand: A Scalable Fault- Tolerant Layer 2 Data Center Network Fabric B 財金三 婁瀚升 1.
Data Center Fabrics. Forwarding Today Layer 3 approach: – Assign IP addresses to hosts hierarchically based on their directly connected switch. – Use.
CSE 534 Fundamentals of Computer Networks Lecture 4: Bridging (From Hub to Switch by Way of Tree) Based on slides from D. Choffnes Northeastern U. Revised.
CS 4700 / CS 5700 Network Fundamentals Lecture 7: Bridging (From Hub to Switch by Way of Tree) Revised 1/14/13.
Data Communications and Computer Networks Chapter 4 CS 3830 Lecture 22 Omar Meqdadi Department of Computer Science and Software Engineering University.
An Engineering Approach to Computer Networking
Oct 26, 2004CS573: Network Protocols and Standards1 IP: Routing and Subnetting Network Protocols and Standards Autumn
Oct 21, 2004CS573: Network Protocols and Standards1 IP: Addressing, ARP, Routing Network Protocols and Standards Autumn
1 Network Layer: Host-to-Host Communication. 2 Network Layer: Motivation Can we built a global network such as Internet by extending LAN segments using.
Portland: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric Offense Kai Chen Shih-Chi Chen.
PortLand Presented by Muhammad Sadeeq and Ling Su.
Chapter 19 Binding Protocol Addresses (ARP) Chapter 20 IP Datagrams and Datagram Forwarding.
Introduction to Computer Networks 09/23 Presenter: Fatemah Panahi.
A Scalable, Commodity Data Center Network Architecture.
Layer 2 Switch  Layer 2 Switching is hardware based.  Uses the host's Media Access Control (MAC) address.  Uses Application Specific Integrated Circuits.
Network Layer (3). Node lookup in p2p networks Section in the textbook. In a p2p network, each node may provide some kind of service for other.
DNS (Domain Name System) Protocol On the Internet, the DNS associates various sorts of information with domain names. A domain name is a meaningful and.
TRansparent Interconnection of Lots of Links (TRILL) March 11 th 2010 David Bond University of New Hampshire: InterOperability.
Introduction to Network Layer. Network Layer: Motivation Can we built a global network such as Internet by extending LAN segments using bridges? –No!
Cisco – Chapter 11 Routers All You Ever Wanted To Know But Were Afraid to Ask.
NUS.SOC.CS2105 Ooi Wei Tsang Application Transport Network Link Physical you are here.
CS 640: Introduction to Computer Networks
Network and Communications Ju Wang Chapter 5 Routing Algorithm Adopted from Choi’s notes Virginia Commonwealth University.
Chi-Cheng Lin, Winona State University CS 313 Introduction to Computer Networking & Telecommunication Chapter 5 Network Layer.
Floodless in SEATTLE : A Scalable Ethernet ArchiTecTure for Large Enterprises. Changhoon Kim, Matthew Caesar and Jenifer Rexford. Princeton University.
1 TCP/IP Internetting ä Subnet layer ä Links stations on same subnet ä Often IEEE LAN standards ä PPP for telephone connections ä TCP/IP specifies.
The Network Layer Introduction  functionality and service models Theory  link state and distance vector algorithms  broadcast algorithms  hierarchical.
CS 447 Networks and Data Communication ARP (Address Resolution Protocol) for the Internet Department of Computer Science Southern Illinois University Edwardsville.
Computer Networks 15-1 Chapter 15. Connecting LANs, Backbone Networks, and Virtual LANs 15.1 Connecting devices 15.2 Backbone networks 15.3 Virtual LANs.
1 Network Administration Module 3 ARP/RARP. 2 Address Resolution The problem Physical networks use physical addresses, not IP addresses Need the physical.
IP1 The Underlying Technologies. What is inside the Internet? Or What are the key underlying technologies that make it work so successfully? –Packet Switching.
Routing Algorithms and IP Addressing Routing Algorithms must be ▪ Correctness ▪ Simplicity ▪ Robustness ▪ Stability ▪ Fairness ▪ Optimality.
1 K. Salah Module 5.1: Internet Protocol TCP/IP Suite IP Addressing ARP RARP DHCP.
PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric Radhika Niranjan Mysore, Andreas Pamboris, Nathan Farrington, Nelson Huang, Pardis.
IP: Addressing, ARP, Routing
CIS 700-5: The Design and Implementation of Cloud Networks
COMPUTER NETWORKS CS610 Lecture-27 Hammad Khalid Khan.
Communication Networks: Technology & Protocols
CS 3700 Networks and Distributed Systems
Routing Jennifer Rexford.
ETHANE: TAKING CONTROL OF THE ENTERPRISE
Scaling the Network: The Internet Protocol
Networking Recap Storage Intro
Revisiting Ethernet: Plug-and-play made scalable and efficient
ICMP ICMP – Internet Control Message Protocol
Chapter 4 Data Link Layer Switching
Introduction to Networking
CS 3700 Networks and Distributed Systems
FAR: A Fault-avoidance Routing Method for Data Center Networks with Regular Topology Please send.
NTHU CS5421 Cloud Computing
Intra-Domain Routing Jacob Strauss September 14, 2006.
One Upon A Time Computer Networks
CS 457 – Lecture 10 Internetworking and IP
CS 4700 / CS 5700 Network Fundamentals
What’s “Inside” a Router?
Chapter 16 Connecting LANs, Backbone Networks, and Virtual LANs
CS 4700 / CS 5700 Network Fundamentals
Scaling the Network: The Internet Protocol
Chapter 15. Connecting Devices
Ch 17 - Binding Protocol Addresses
Other Routing Protocols
An Engineering Approach to Computer Networking
Presentation transcript:

Lecture 2: Leaf-Spine and PortLand Networks Cloud Computing Lecture 2: Leaf-Spine and PortLand Networks

Leaf and Spine Spanning Trees: Ewwww! Layer 2: SPB or Trill Layer 3: OSPF

Leaf and Spine Type of Clos network Essentially folded, but still N-to-N connections Derived from old phone company architecture, invented in 1950s. All paths are same length from edge to edge Great for switch vendors Need to pick path, as can choose any middle router Very redundant Can implement at layer-2 or layer-3

Networking over time Flat – Gee whiz! We have a LAN! Ut-Oh! Too many addresses to remember what to do with them all, never mind to manage each one individually Hierarchical – Look ma’! Global scale! Ut-Oh! Hierarchy requires management. Creates hot spots and fragments address space. IP addresses limit migration. Forwarding is painful. Fat Tree – Many roots means many paths, relieves hot spots Still hierarchical. Still fragment. Still needs managing. Etc.

Goals Connectivity – inside, outside, no loops, etc. Efficiency – short paths, no hot spots, etc. Self configuring – Humans effort can’t be scaled in this way Robust – Consequences of failure aren’t disproportional; Failure is recoverable VMs and Migration – Network structure doesn’t compartmentalize VMs Commodity hardware – Certainly facilitates adoption

Implications Connectivity – Scale and hierarchy requires big routing/forwarding tables or impossibly large flat space. Routing protocols adapt to change slowly, leaving temporary inconsistency, e.g. loops, from failure and other changes. Efficiency – Look toward fat trees, clos networks Self-configuring – Look toward a discovery protocol, p2p or coordinated Robust – Rapidly learn changed, avoid single points of failure w/high responsibility Migration – Want a flat address space to avoid compartmentalization. Trying to fix above IP layer is a mess as it all comes back down and has to find its way. Commodity hardware – Services provided in off-switch software

Key Idea Minimize use of distributed state, because coordination for recovery or change takes time Maximize use of central state for lower volume services, because it can change/recover faster, and won’t bottleneck at low volume

Portland Solution Use commodity switches and off-load services into software on commodity server Start With Fat Tree for a topology without hot spots Use layer-2 to avoid routing, forwarding, and related complexity Separate host identifier from host location IP addresses identify host, but not location, just and ID Use “Pseudo MAC Address” to identify location at Level-2

PortLand Addresses Normally MAC addresses are arbitrary – no clue about location IP normally is hierarchical, but here we are using it only as a host identifier If MAC addresses are not tied to location, switch tables grow linearly with growth of network, i.e. O(n) PortLand uses hierarchical MAC addresses, called “Pseudo MAC” or PMAC addresses to provide for switch location <pod:port:position:vmid>

PortLand PMAC Addresses 1 2 3 Position 1 1 1 PMAC: <pod.position.port.vmid> 48 bits: <16-bits.8-bits.8-bits.16-bits>

Portland PMAC Addresses

VM Migration Flat address space. IP address unchanged after migration, higher level doesn’t see state change After migration IP<->PMAC changes, as PMAC is location dependent VM sends gratuitous ARP with new mapping. Fabric Manager receives ARP and sends invalidation to old switch Old switch sets flow table to software, causing ARP to be sent to any stray packets Forwarding the packet is optional, as retransmit (if reliable) will fix delivery

Location Discovery: Configuring Switch IDs Humans = Not right Answer Discovery = Right Answer Send messages to neighbors – Get Tree Level Hosts don’t reply, so edge only hears back from above Aggregate hears back from both levels Core hears back only from aggregate Contact Fabric Manager with tree level to get ID Fabric Manager is service running on commodity host Assigns ID

Name Resolution: MACPMACIP End hosts continue to use Actual MAC (AMAC) addresses Switches convert PMAC<->AMAC for the host Edge switch responsible for creating PMAC:AMAC mapping and telling Fabric Manager Software on commodity server, can be replicated, etc. Simplicity is a virtue. Mappings timed out of Fabric Manager’s cache, if not used. ARPs are for PMACs First ask fabric manager which keeps cache. Then, if needed, broadcast.

No loops, No Spanning Trees Forwarding can only go up the tree. Cycles not possible.

Failure Keep-alives like the link discovery messages Miss a keep alive? Tattle to the Fabric Manager Fabric manager tells effected switches, which adjust own tables. O(N) vs O(N2) for traditional routing algorithms (Fabric Manager tells every switch vs every switch tells every switch)

Looking Back Connectivity – Hosts can talk! No possibility of loops Efficiency – Much less memory needed in switches, O(N) fault handlingh Self configuring – Discovery protocol + ARP Robust – Failure handling coordinated by FM VMs and Migration – Each has own IP address, each has own MAC address Commodity hardware – Nothing magic.