Managing Open vSwitch Across a Large Heterogeneous Fleet

Slides:



Advertisements
Similar presentations
© MIRANTIS 2012PAGE 1© MIRANTIS 2012 Does Hypervisor Matter in OpenStack Greg Elkinbard Senior Technical Director.
Advertisements

Modeling and Maintaining Virtualized Services Microsoft System Center Virtual Machine Manager 2012 (c) 2011 Microsoft. All rights reserved.
Ed Duguid with subject: MACE Cloud
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice OpenVMS Guests Under HPVM Barry Kierstein.
© 2012 IBM Corporation Architecture of Quantum Folsom Release Yong Sheng Gong ( 龚永生 ) gongysh #openstack-dev Quantum Core developer.
DOT – Distributed OpenFlow Testbed
Profit from the cloud TM Parallels Dynamic Infrastructure AndOpenStack.
Open vSwitch. What is Open vSwitch ?  Open vSwitch is an open source OpenFlow capable virtual switch that is typically used with hypervisor to interconnect.
|ESDS SOFTWARE SOLUTION PVT. LTD.| Enterprise Datacenter Management Suite.
What is a Virtual Tap? Intelligent Access and Monitoring Architecture Solutions.
Network Implementation for Xen and KVM Class project for E : Network System Design and Implantation 12 Apr 2010 Kangkook Jee (kj2181)
Rackspace Lump Sum Policy
Data Center Virtualization: Open vSwitch Hakim Weatherspoon Assistant Professor, Dept of Computer Science CS 5413: High Performance Systems and Networking.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
VIRTUALIZATION AND YOUR BUSINESS November 18, 2010 | Worksighted.
Extreme Networks Confidential and Proprietary. © 2010 Extreme Networks Inc. All rights reserved.
Networking Virtualization Using FPGAs Russell Tessier, Deepak Unnikrishnan, Dong Yin, and Lixin Gao Reconfigurable Computing Group Department of Electrical.
A Cloud is a type of parallel and distributed system consisting of a collection of inter- connected and virtualized computers that are dynamically provisioned.
Cross Platform From the Metal Up ISV Partner Alliance Value Self-service storage provisioning.
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
EarthLink Server Management and Monitoring Updated August 6, 2015.
Xen I/O Overview. Xen is a popular open-source x86 virtual machine monitor – full-virtualization – para-virtualization para-virtualization as a more efficient.
Xen I/O Overview.
Created by: Thomas Weeks Modified Date: Classification: Public Demystifying Cloud City (Rackspace Cloud Systems)
Module 7: Fundamentals of Administering Windows Server 2008.
VirtualBox What you need to know to build a Virtual Machine.
1 The Fast(est) Path to Building a Private/Hybrid Cloud October 25th, 2011 Paul Mourani RightScale.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
Rick Lopez, Director of Quality Engineering Rainya Mosher, Dev Manager, Deploy Infrastructure Deploying from OpenStack Trunk into a Production Environment.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Filtering Traffic Using Access Control Lists Introducing Routing and Switching.
26 September 2013 Federating OpenStack: a CERN and Rackspace Collaboration Tim Bell Toby Owen
©2015 EarthLink. All rights reserved. Private Cloud Hosting Create Your Own Private IT Environment.
Created by: Thomas Weeks & Robert McAden Modified Date: 6/23/2014 Classification: Public Racksburg (Rackspace Blacksburg) Fostering a Local Tech Ecosystem.
Private Cloud Hosting. IT Business Challenges I need to extend my on-premises virtualized environment to utilize the Cloud and manage the entire environment.
Queensland University of Technology CRICOS No J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.
Virtual Machines Created within the Virtualization layer, such as a hypervisor Shares the physical computer's CPU, hard disk, memory, and network interfaces.
Abstract A Structured Approach for Modular Design: A Plug and Play Middleware for Sensory Modules, Actuation Platforms, Task Descriptions and Implementations.
SOFTWARE DEFINED NETWORKING/OPENFLOW: A PATH TO PROGRAMMABLE NETWORKS April 23, 2012 © Brocade Communications Systems, Inc.
© 2006 Cisco Systems, Inc. All rights reserved.Cisco Public 1 Version 4.0 Filtering Traffic Using Access Control Lists Introducing Routing and Switching.
Introduction to Mininet, Open vSwitch, and POX
1 Veloxum Corporation © Veloxum ACO solution improves the efficiency and capacity of your environment for both physical and.
Amanda Escobedo Jeanetta Harris Leena Asmar Lucille Reddic Shannon Carroll Research Group Six: Cloud Computing RACKSPACE.
E Virtual Machines Lecture 1 What is Virtualization? Scott Devine VMware, Inc.
Plesk 8 for Linux/UNIX Server Automation SWSOFT GLOBAL HOSTING SUMMIT 2006 Todd L. Crumpler May 30-June 1, 2006.
Intro To Virtualization Mohammed Morsi
Network Virtualization Ben Pfaff Nicira Networks, Inc.
Network customization
Hosted Services Led by Jason Gross, Terrice McClain, & Jen Paulin
Chapter 6: Securing the Cloud
Let's talk about Linux and Virtualization in 'vLAMP'
Infrastructure Orchestration to Optimize Testing
6WIND MWC IPsec Demo Scalable Virtual IPsec Aggregation with DPDK for Road Warriors and Branch Offices Changed original subtitle. Original subtitle:
Building a Virtual Infrastructure
SteelHead 9.2 New Feature & Updates
Logo here Module 3 Microsoft Azure Web App. Logo here Module Overview Introduction to App Service Overview of Web Apps Hosting Web Applications in Azure.
Stress-free IT REMOTE INFRASTRUCTURE MANAGEMENT
Virtualization, Cloud Computing and Big Data
Performance Testing Methodology for Cloud Based Applications
What is Crestron Virtual Control?
Indigo Doyoung Lee Dept. of CSE, POSTECH
Smart Team Making a Beautiful software
Microsoft Virtual Academy
Open vSwitch HW offload over DPDK
Neutron at Scale Justin Hammond - Developer
Managing Services with VMM and App Controller
Partition Starter Find out what disk partitioning is, state key features, find a diagram and give an example.
All or Nothing The Challenge of Hardware Offload
What we have and what we want
Network customization
Openstack Summit November 2017
Presentation transcript:

Managing Open vSwitch Across a Large Heterogeneous Fleet Chad Norgan Systems Engineer BeardyMcBeards in #openvswitch

100 60% 9 Worldwide About Rackspace We Serve Global Footprint Customers in 120+ Countries Annualized Revenue Over $1B 60% 100 OF THE We Serve FORTUNE® OVER 300,000+ Customers ≅70 PB Stored 5,000+ Rackers 9 Worldwide Data Centers Portfolio of Hosted Solutions Dedicated - Cloud - Hybrid www.rackspace.com

Rackspace’s Public Cloud Large Fleet Heterogenous Several different hardware manufacturers Several XenServer major versions (sometimes on varying kernels) Five networking configurations Six production public clouds Six internal private clouds Various non-production environments Tens of thousands of hypervisors Hundreds of thousands of virtual machines 1 - 3+ Interfaces Worth mentioning the # of kernel versions?

Networks Available to Customers IPv4 & IPv6 Publicly Accessible Network Bandwidth Metered Public Net DC-Routable IPv4 IP Access Other Rackspace Products Unmetered Bandwidth Service Net NSX L2 Overlay Network Extendable to dedicated hardware via NSX Gateways Cloud Networks

Our History With OVS Rackspace has used Open vSwitch since the 0.9 version Behind most of First Generation Cloud Servers (Slicehost) Powers 100% of Next Generation Cloud Servers Upgraded OVS nine times since the launch of Next Gen Public Cloud in August 2012

Why We Use OVS Service provider features: Software = Flexible Overlay Networks QoS VLAN Tagging Port Security LACP Software = Flexible Upgrades are easier than hardware

Our Favorite Improvements Save & restore datapath flows during kmod reload OVS 1.7 Logging removed from main loop, faster flow setups OVS 1.9 Collapsed data path & flow-eviction-threshold raised to 2500 OVS 1.10 Megaflows & wildcarding OVS 1.11 Multi-treading! OVS 2.0 flow-limit replaces flow-eviction-threshold & TCP flags OVS 2.1

Example: Busy HV With Syslog Collector

Mission Accomplished! We moved the bottleneck! New bottlenecks: Guest OS kernel configuration Xen Netback/Netfront Driver

Challenges of Upgrading OVS Matching the OVS kernel module to both the running and staged kernel Hypervisor updates often come with a newer kernel We often don’t immediately reboot Running kernel != Kernel at next reboot Detect both kernels and install both sets of OVS kernel modules Heterogeneous Scale

OVS Upgrade Solution Playbook-style upgrades Asynchronous plays with parallel limits Extensible Easy to build validations and pre-checks to prevent unwanted impact We would not be able to achieve the velocity of improvements at our scale without it. It allows us to make very complex changes with confidence.

Architectural Basics VIF PIF Integration Bridge VIF Interface Bridge Tunnel Encapsulation PIF VIF Interface Bridge Patch Port PIF VIF

Ansible + OVS = Flexible Network Rewiring VIF Interface Bridge PIF Patch Port Integration Bridge VIF Tunnel Encap PIF VIF

Ansible + OVS = Flexible Network Rewiring VIF Public Net Bridge Patch Port Interface Bridge PIF VIF Patch Port Integration Bridge VIF Tunnel Encap PIF

Ansible + OVS = Flexible Network Rewiring VIF Public Net Bridge Patch Port Interface Bridge PIF VIF Patch Port Integration Bridge VIF Tunnel Encap PIF

Ansible + OVS = Flexible Network Rewiring VIF Public Net Bridge Interface Bridge Patch Port PIF Patch Port Service Net Bridge VIF Integration Bridge VIF Tunnel Encap

Ansible + OVS = Flexible Network Rewiring VIF Public Net Bridge Interface Bridge Patch Port PIF Patch Port Service Net Bridge VIF Cloud Net Bridge Integration Bridge VIF Patch Port Tunnel Encap

Ansible + OVS = Flexible Network Rewiring Public Net Bridge Patch Port Interface Bridge Public Net Bridge_old Public Net Bridge VIF PIF Patch Port

Measuring OVS – PavlOVS.py Publishes metrics to StatsD/Graphite Per bridge byte, packet, open flow count Datapath hit, missed, lost, flow counts Open vSwitch CPU utilization Instance count Tunnels configured and in fault state

Datapath Flow Count 2000 Eviction Threshold

Datapath Flow Count

Hit, Miss, Lost Hit, Miss, Lost

OVS CPU By Cell OVS CPU

The OVS Of Our Dreams Connection Tracking More (efficient) performance JSON Output from ovs-*ctl commands

QUESTIONS?