NetLord: A Scalable Multi-tenant Network Architecture for Virtualized Cloud Datacenters Jayaram Mudigonda Praveen Yalagandula, Jeff Mogul, Bryan Stiekes,

Slides:



Advertisements
Similar presentations
MPLS VPN.
Advertisements

All Rights Reserved © Alcatel-Lucent 2009 Enhancing Dynamic Cloud-based Services using Network Virtualization F. Hao, T.V. Lakshman, Sarit Mukherjee, H.
VCRIB: Virtual Cloud Rule Information Base Masoud Moshref, Minlan Yu, Abhishek Sharma, Ramesh Govindan HotCloud 2012.
PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric
NetFPGA Project: 4-Port Layer 2/3 Switch Ankur Singla Gene Juknevicius
DOT – Distributed OpenFlow Testbed
Network Virtualization Overlay Control Protocol Requirements draft-kreeger-nvo3-overlay-cp-00 Lawrence Kreeger, Dinesh Dutt, Thomas Narten, David Black,
Introduction into VXLAN Russian IPv6 day June 6 th, 2012 Frank Laforsch Systems Engineer, EMEA
PARIS: ProActive Routing In Scalable Data Centers Dushyant Arora, Theophilus Benson, Jennifer Rexford Princeton University.
Radhika Niranjan Mysore, Andreas Pamboris, Nathan Farrington, Nelson Huang, Pardis Miri, Sivasankar Radhakrishnan, Vikram Subramanya, and Amin Vahdat Department.
Data Center Fabrics. Forwarding Today Layer 3 approach: – Assign IP addresses to hosts hierarchically based on their directly connected switch. – Use.
Cisco 3 - Switch Perrine. J Page 15/8/2015 Chapter 8 What happens to the member ports of a VLAN when the VLAN is deleted? 1.They become inactive. 2.They.
NetLord: A Scalable Multi-Tenant Network Architecture for Virtualized Datacenters SIGCOMM ’11 Authors: Jayaram Mudigonda, Praveen Yalagandula, Jeff Mogul.
Datacenter Network Topologies
William Stallings Data and Computer Communications 7 th Edition (Selected slides used for lectures at Bina Nusantara University) Internetworking.
ProActive Routing In Scalable Data Centers with PARIS Joint work with Dushyant Arora + and Jennifer Rexford* + Arista Networks *Princeton University Theophilus.
COS 461: Computer Networks
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
A Scalable, Commodity Data Center Network Architecture.
Secure Cloud Computing with Virtualized Network Infrastructure HotCloud 10 By Xuanran Zong.
Jennifer Rexford Princeton University MW 11:00am-12:20pm SDN Software Stack COS 597E: Software Defined Networking.
Microsoft Virtual Academy Module 4 Creating and Configuring Virtual Machine Networks.
An Overview of Software-Defined Network Presenter: Xitao Wen.
Virtual LANs. VLAN introduction VLANs logically segment switched networks based on the functions, project teams, or applications of the organization regardless.
Networking the Cloud Presenter: b 電機三 姜慧如.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Connecting to the Network Networking for Home and Small Businesses.
ECE 526 – Network Processing Systems Design Networking: protocols and packet format Chapter 3: D. E. Comer Fall 2008.
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul,
Mahindra-British Telecom Ltd. Exploiting Layer 2 By Balwant Rathore.
Microsoft Windows Server 2003 TCP/IP Protocols and Services Technical Reference Slide: 1 Lesson 7 Internet Protocol (IP) Routing.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Chapter 3 - VLANs. VLANs Logical grouping of devices or users Configuration done at switch via software Not standardized – proprietary software from vendor.
W&L Page 1 CCNA CCNA Training 3.4 Describe the technological requirements for running IPv6 in conjunction with IPv4 Jose Luis Flores /
1 Version 3.0 Module 7 Spanning Tree Protocol. 2 Version 3.0 Redundancy Redundancy in a network is needed in case there is loss of connectivity in one.
Network Virtualization in Multi-tenant Datacenters Author: VMware, UC Berkeley and ICSI Publisher: 11th USENIX Symposium on Networked Systems Design and.
The Goals Proposal Realizing broadcast/multicast in virtual networks
Cisco Confidential © 2013 Cisco and/or its affiliates. All rights reserved. 1 Cisco Networking Training (CCENT/CCT/CCNA R&S) Rick Rowe Ron Giannetti.
Atrium Router Project Proposal Subhas Mondal, Manoj Nair, Subhash Singh.
Network Virtualization Ben Pfaff Nicira Networks, Inc.
VL2: A Scalable and Flexible Data Center Network
Multi Node Label Routing – A layer 2.5 routing protocol
CIS 700-5: The Design and Implementation of Cloud Networks
Exploiting Layer 2 By Balwant Rathore.
Link Layer 5.1 Introduction and services
Heitor Moraes, Marcos Vieira, Italo Cunha, Dorgival Guedes
Praveen Tammana† Rachit Agarwal‡ Myungjin Lee†
Advanced Computer Networks
Scaling the Network: The Internet Protocol
Revisiting Ethernet: Plug-and-play made scalable and efficient
Improving Datacenter Performance and Robustness with Multipath TCP
NOX: Towards an Operating System for Networks
Chapter 4 Data Link Layer Switching
1.
Virtual LANs.
NTHU CS5421 Cloud Computing
Multipath TCP Yifan Peng Oct 11, 2012
Aled Edwards, Anna Fischer, Antonio Lain HP Labs
Software Defined Networking
NTHU CS5421 Cloud Computing
2018/12/10 Energy Efficient SDN Commodity Switch based Practical Flow Forwarding Method Author: Amer AlGhadhban and Basem Shihada Publisher: 2016 IEEE/IFIP.
VL2: A Scalable and Flexible Data Center Network
Internet and Web Simple client-server model
Data Center Architectures
Scaling the Network: The Internet Protocol
Ch 17 - Binding Protocol Addresses
IS-IS VPLS for Data Center Network draft-xu-l2vpn-vpls-isis-02
Elmo Muhammad Shahbaz Lalith Suresh, Jennifer Rexford, Nick Feamster,
Reconciling Zero-conf with Efficiency in Enterprises
Towards Predictable Datacenter Networks
Chapter 4: outline 4.1 Overview of Network layer data plane
Presentation transcript:

NetLord: A Scalable Multi-tenant Network Architecture for Virtualized Cloud Datacenters Jayaram Mudigonda Praveen Yalagandula, Jeff Mogul, Bryan Stiekes, Yanick Pouffary Hewlett-Packard Company

The Goal Build the right network for a cloud datacenter? 12 November 201812 November 2018

The Goal Build the right network for a cloud datacenter? 12 November 201812 November 2018

Cloud Datacenter Provides Infrastructure as a Service Virtualized Shared across multiple tenants Virtualized Tenants run Virtual Machines (VMs) 12 November 201812 November 2018

The Right Network Virtualization + Multi-tenancy Scale Inexpensive A Virtual Network to each tenant Flexible abstraction: No restrictions on addressing or protocols Scale Tenants, Servers: 10s of 1000s, VMs: 100s of 1000s Adequate bandwidth Inexpensive Capital: Cheap COTS components Operational: Ease of management, High bisection BW 12 November 201812 November 2018

The Challenge Basic COTS switching gear  Limited functionality Limited resources: Not enough Forwarding Information Base (FIB) space COTS Switches + Bisection BW  Multipathing Multipathig and/or Scale  More switch resources 12 November 201812 November 2018

The Challenge Assume no switch support and conserve switch resources (FIB) to Simultaneously achieve: Scale, Multi-tenancy, Bisection BW, and Ease of config 12 November 201812 November 2018

State of the Art – Scale Most prior work is limited by one or more of: New protocols Modified control and/or data planes Preferred topologies Resources (such as table space) on switches 12 November 201812 November 2018

State of the Art – Multi-tenancy Traditional VLANs Single tenant No L3 abstraction Cannot scale beyond 4K Commercial: Amazon EC2 and Microsoft Azure Most of recent work is on segregation not virtualization Diverter [WREN 2010] Carves a SINGLE IP address space among tenants Restricts: VM placements and IP address assignment Can be limited by the FIB space in switches 12 November 201812 November 2018

NetLord An encapsulation scheme and A complementary switch configuration  Scalable multi-tenancy Ease of configuration Order of magnitude reduction in FIB requirements High bisection BW 12 November 201812 November 2018

Why Encapsulate? Unmodified VM packets onto the network Excessive FIB pressure and no L3 abstraction Alternative: Rewrite headers Cannot assume L3 header  Rewrite only MAC hdr Rewrite with server MACs  Somewhat reduced FIB Cannot identify the right VM on destination Server 12 November 201812 November 2018

NetLord Encapsulation Two headers: a MAC and an IP Reduced FIB pressure Outer Src MAC = MAC of the Src edge switch Outer Dst MAC = MAC of the Dst edge switch Correct delivery Right edge switch: The outer MAC header Right server: Right port # in the outer dest IP addr Right VM: Tenant-ID frm outer dest IP + Inner dest MAC Clean abstraction No assumptions about VM protocols and/or addressing 12 November 201812 November 2018

NetLord Encapsulation VM1 Tenant 15 Hypervisor Server 1 ES-MAC1 P1 NetLord Agent Hypervisor Server 2 ES-MAC2 P2 NetLord Agent VM2 Tenant 15 Port# TenantID 7 bits 24 bits IP Address Format IP-Range-2 COTS Ethernet ES-MAC2 ES-MAC1 VM1  VM2 P1.0.0.15 P2.0.0.15 ES-MAC1  ES-MAC2 12 November 201812 November 2018

Switch Configuration Outer MAC hdr takes pkt to egress edge switch A switch on MAC Pkt addressed to itself Strips MAC hdr and forwards based on IP hdr inside Standard behavior Correct forwarding  Configure the L3 forwarding tables right Make sure to match the server configs 12 November 201812 November 2018

… … Switch Configuration 1.*.*.* 1.0.0.1 2.*.*.* 2.0.0.1 48.*.*.* Hypervisor Server 1 1.*.*.* 1.0.0.1 NLA Hypervisor Server 2 NLA Hypervisor Server 48 NLA … 2.*.*.* 2.0.0.1 48.*.*.* 48.0.0.1 Prefix Gateway, port 1.*.*.* 1.0.0.1, 1 2.*.*.* 2.0.0.1, 2 … 48.*.*.* 48.0.0.1, 48 … 2 1 48 12 November 201812 November 2018

NL-ARP Builds and maintains the mapping <VM-MAC,Subnet,Tenant-ID>  <SWITCH-MAC,PORT> Query – response. Similar to regular ARP NL-Agent broadcasts on VM boot-up and migration 12 November 201812 November 2018 HP Confidential 12/11/18

Goals Achieved: Abstraction No assumptions about VM-to-VM pkt contents  No restrictions on addressing schemes No restrictions on L3 protocols 12 November 201812 November 2018

Goals Achieved: Scale 24-bit tenant ID  Millions of tenants Order of magnitude reduction in FIB requirements Non-edge switches see only edge-switch MACs Edge switches see their local server MACs in addition Small number of L3 table entries in edge switches One per downlink 12 November 201812 November 2018

Goals Achieved: Ease of Config Automatic Local Same across all switches  Easy to debug Does not change  Easy to debug Easy per-tenant traffic engineering in the network Tenant ID readily available in standard IP header 12 November 201812 November 2018

Bisection Bandwidth Reduced FIB pressure  Utilize SPAIN effectively J. Mudigonda, P. Yalagandula, M. Al-Fares, and J. C. Mogul. SPAIN: COTS Data-Center Ethernet for Multipathing over Arbitrary Topologies. In Proc. USENIX NSDI, 2010. 12 November 201812 November 2018

SPAIN in One Slide Multipath forwarding in container/pod datacenters Few thousand servers (or unique MAC addresses) COTS Ethernet (with VLAN support) Arbitrary topologies A central, off-line manager: Discover topology Compute paths; exploit redundancy in the topology Merge paths into a minimal set of trees Install trees as VLANs into switches End-host (NIC driver): Download VLAN maps from the manager Spread flows over VLANs (i.e., paths) 12 November 201812 November 2018

SPAIN & Extreme Scalability Critical resource: Forwarding table entries (FIB) Each MAC address appears on multiple VLANs Worst case FIB = #VLANS × #MACs Overflowing FIB  Floods and poor goodput 64K Entries  only a few 1000 unique MACs NetLord’s ability to hide MACs helps greatly 12 November 201812 November 2018

Putting It All Together VM1 Hypervisor Server 1 P1.*.*.* NetLord Agent VM2 Hypervisor Server 2 P2.*.*.* NetLord Agent VM1VM2 VM1VM2 IP1IP2 VM1VM2 IP1IP2 macAmacB VM1VM2 IP1IP2 macAmacB macB macA COTS Ethernet 12 November 201812 November 2018

Evaluation Prototype + VM emulation Analytical arguments: Max VMs : ~650 K using 48-port 64K FIB switches NL-ARP overheads: < 1% of link BW, ~0.5% of Server memory Prototype + VM emulation NLA, encaping overheads are ignorable Substantial goodput improvements 12 November 201812 November 2018

Experimental Setup: Topology 74 Servers in a 2-level fat-tree topology 12 November 201812 November 2018

Prototype, VM Emulation NLA Kernel module VM Emulator A thin module above NLA Exports a virtual device Re-writes MAC addresses Unique pair for each TCP cxn Up to 3K VMs / Servers 200K VMs in all User-level Process … Protocol Stack VM Emulator NetLord Agent NIC Driver Physical NIC Network 12 November 201812 November 2018

Experimental Setup: Workload Parallel shuffles Emulating shuffle-phase of Map-Reduce jobs Each shuffle: 74 mappers & 74 reducers, one per server Each mapper transfers 10MB of data to all reducers 12 November 201812 November 2018

Goodput Traditional SPAIN NetLord Goodput (in Gbps) Number of VMs

Flooded Packets (in Millions) Floods Flooded Packets (in Millions) Number of VMs HP Confidential 12/11/18

Summary NetLord combines simple existing primitives in a novel fashion to achieve several out-sized benefits of practical importance: Scale Multi-tenancy Ease-of-use Bisection BW 12 November 201812 November 2018

BACKUP 12 November 201812 November 2018

Putting It All Together VM1 Hypervisor Server 1 P1.*.*.* NetLord Agent VM2 Hypervisor Server 2 P2.*.*.* NetLord Agent VM1VM2 VM1VM2 IP1IP2 VM1VM2 IP1IP2 macAmacB VM1VM2 IP1IP2 macAmacB MAC-A COTS Ethernet 12 November 201812 November 2018