Software Defined Networking COMS 6998-10, Fall 2014 Instructor: Li Erran Li 6998-10SDNFall2014/

Slides:



Advertisements
Similar presentations
Key Metrics for Effective Storage Performance and Capacity Reporting.
Advertisements

Capacity Planning in a Virtual Environment
Virtualisation From the Bottom Up From storage to application.
SDN + Storage.
CloudWatcher: Network Security Monitoring Using OpenFlow in Dynamic Cloud Networks or: How to Provide Security Monitoring as a Service in Clouds? Seungwon.
Virtualization and Cloud Computing. Definition Virtualization is the ability to run multiple operating systems on a single physical system and share the.
The Case for Enterprise Ready Virtual Private Clouds Timothy Wood, Alexandre Gerber *, K.K. Ramakrishnan *, Jacobus van der Merwe *, and Prashant Shenoy.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Data Center Storage and Networking Hakim Weatherspoon Assistant Professor, Dept of Computer Science CS 5413: High Performance Systems and Networking December.
Software Defined Networking COMS , Fall 2013 Instructor: Li Erran Li SDNFall2013/
Virtualization and Cloud Computing Virtualization David Bednárek, Jakub Yaghob, Filip Zavoral.
Low-Cost Data Deduplication for Virtual Machine Backup in Cloud Storage Wei Zhang, Tao Yang, Gautham Narayanasamy University of California at Santa Barbara.
Course Name- CSc 8320 Advanced Operating Systems Instructor- Dr. Yanqing Zhang Presented By- Sunny Shakya Latest AOS techniques, applications and future.
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
What will my performance be? Resource Advisor for DB admins Dushyanth Narayanan, Paul Barham Microsoft Research, Cambridge Eno Thereska, Anastassia Ailamaki.
SLA-aware Virtual Resource Management for Cloud Infrastructures
An Adaptable Benchmark for MPFS Performance Testing A Master Thesis Presentation Yubing Wang Advisor: Prof. Mark Claypool.
End-to-End Analysis of Distributed Video-on-Demand Systems Padmavathi Mundur, Robert Simon, and Arun K. Sood IEEE Transactions on Multimedia, February.
Virtualizing Mission-Critical Apps 1PM EST, 3/29/2011 Ilya Mirman Philip Thomas.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
ENFORCING PERFORMANCE ISOLATION ACROSS VIRTUAL MACHINES IN XEN Diwaker Gupta, Ludmila Cherkasova, Rob Gardner, Amin Vahdat Middleware '06 Proceedings of.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
Software Defined Networking COMS , Fall 2014 Instructor: Li Erran Li SDNFall2014/
1© Copyright 2015 EMC Corporation. All rights reserved. SDN INTELLIGENT NETWORKING IMPLICATIONS FOR END-TO-END INTERNETWORKING Simone Mangiante Senior.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Matei Ripeanu.
Datacenter Wide-areaEnterprise LOAD-BALANCER Client Servers.
IOFlow: a Software-Defined Storage Architecture
IOFlow: A Software-defined Storage Architecture Eno Thereska, Hitesh Ballani, Greg O’Shea, Thomas Karagiannis, Antony Rowstron, Tom Talpey, Richard Black,
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Microsoft Virtual Academy Module 4 Creating and Configuring Virtual Machine Networks.
Multi-level Selective Deduplication for VM Snapshots in Cloud Storage Wei Zhang*, Hong Tang †, Hao Jiang †, Tao Yang*, Xiaogang Li †, Yue Zeng † * University.
Elad Hayun Agenda What's New in Hyper-V 2012 Storage Improvements Networking Improvements VM Mobility Improvements.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Black-box and Gray-box Strategies for Virtual Machine Migration Timothy Wood, Prashant.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Advisor: Professor.
A Workflow-Aware Storage System Emalayan Vairavanathan 1 Samer Al-Kiswany, Lauro Beltrão Costa, Zhao Zhang, Daniel S. Katz, Michael Wilde, Matei Ripeanu.
Yury Kissin Infrastructure Consultant Storage improvements Dynamic Memory Hyper-V Replica VM Mobility New and Improved Networking Capabilities.
Virtual Machine Hosting for Networked Clusters: Building the Foundations for “Autonomic” Orchestration Based on paper by Laura Grit, David Irwin, Aydan.
How to Resolve Bottlenecks and Optimize your Virtual Environment Chris Chesley, Sr. Systems Engineer
The Center for Autonomic Computing is supported by the National Science Foundation under Grant No NSF CAC Seminannual Meeting, October 5 & 6,
Storage Management in Virtualized Cloud Environments Sankaran Sivathanu, Ling Liu, Mei Yiduo and Xing Pu Student Workshop on Frontiers of Cloud Computing,
COMS E Cloud Computing and Data Center Networking Sambit Sahu
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Profiling and Modeling Resource Usage.
MClock: Handling Throughput Variability for Hypervisor IO Scheduling in USENIX conference on Operating Systems Design and Implementation (OSDI ) 2010.
Swapping to Remote Memory over InfiniBand: An Approach using a High Performance Network Block Device Shuang LiangRanjit NoronhaDhabaleswar K. Panda IEEE.
DBI313. MetricOLTPDWLog Read/Write mixMostly reads, smaller # of rows at a time Scan intensive, large portions of data at a time, bulk loading Mostly.
Constructing Services with Interposable Virtual Hardware Author: Andrew Whitaker, Richard S. Cox, Marianne Shaw, and Steven D. Gribble Presenter: Huajing.
Vic Liu Liang Xia Zu Qiang Speaker: Vic Liu China Mobile Network as a Service Architecture draft-liu-nvo3-naas-arch-01.
Windows Server 2012 Hyper-V Networking
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
Aaron Gember, Theophilus Benson, Aditya Akella University of Wisconsin-Madison.
Consolidation and Optimization Best Practices: SQL Server 2008 and Hyper-V Dandy Weyn | Microsoft Corp. Antwerp, March
Hyper-V Performance, Scale & Architecture Changes Benjamin Armstrong Senior Program Manager Lead Microsoft Corporation VIR413.
VMware vSphere Configuration and Management v6
SOFTWARE DEFINED NETWORKING/OPENFLOW: A PATH TO PROGRAMMABLE NETWORKS April 23, 2012 © Brocade Communications Systems, Inc.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
An End-to-End Service Architecture r Provide assured service, premium service, and best effort service (RFC 2638) Assured service: provide reliable service.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
An End-to-End Service Architecture r Provide assured service, premium service, and best effort service (RFC 2638) Assured service: provide reliable service.
Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Experiences with VI Communication for Database Storage Yuanyuan Zhou, Angelos Bilas, Suresh Jagannathan, Cezary Dubnicki, Jammes F. Philbin, Kai Li.
sRoute: Treating the Storage Stack Like a Network
Software Defined Networking BY RAVI NAMBOORI. Overview  Origins of SDN.  What is SDN ?  Original Definition of SDN.  What = Why We need SDN ?  Conclusion.
Microsoft Virtual Academy
Networking Devices.
CS 31006: Computer Networks – The Routers
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Building a Database on S3
Specialized Cloud Architectures
Towards Predictable Datacenter Networks
Presentation transcript:

Software Defined Networking COMS , Fall 2014 Instructor: Li Erran Li SDNFall2014/ 12/8/2014 SDN Storage

Outline Reminder: Course Evaluation Due today! SDN Storage 12/8/14 Software Defined Networking (COMS ) 2

Background: Enterprise data centers General purpose applications Application runs on several VMs Separate network for VM-to-VM traffic and VM-to-Storage traffic Storage is virtualized Resources are shared Switch S-NIC NIC S-NICNIC VM Virtual Machine vDisk VM Virtual Machine vDisk 2 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Motivation It is hard to provide such SLAs today Want: predictable application behaviour and performance Need system to provide end-to-end SLAs, e.g., Guaranteed storage bandwidth B Guaranteed high IOPS and priority Per-application control over decisions along IOs’ path 4 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Switch S-NIC NIC S-NICNIC VM Virtual Machine vDisk VM Virtual Machine vDisk Example: guarantee aggregate bandwidth B for Red tenant App OS App OS … 5 Deep IO path with 18+ different layers that are configured and operate independently and do not understand SLAs Software Defined Networking (COMS )

Challenges in enforcing end-to-end SLAs No storage control plane No enforcing mechanism along storage data plane Aggregate performance SLAs - Across VMs, files and storage operations Want non-performance SLAs: control over IOs’ path Want to support unmodified applications and VMs 6 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

… IOFlow architecture App OS App OS Controller High-level SLA 7 IOFlow API Decouples the data plane (enforcement) from the control plane (policy logic) Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Contributions Defined and built storage control plane Controllable queues in data plane Interface between control and data plane (IOFlow API) Built centralized control applications that demonstrate power of architecture 8 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Storage flows Storage “Flow” refers to all IO requests to which an SLA applies ---> SLA Aggregate, per-operation and per-file SLAs, e.g., ---> high priority ---> min 100,000 IOPS Non-performance SLAs, e.g., path routing ---> bypass malware scanner 9 source set destination sets Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

IOFlow API: programming data plane queues 1. Classification [IO Header -> Queue] 2. Queue servicing [Queue -> ] 3. Routing [Queue -> Next-hop] Malware scanner 10 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Lack of common IO Header for storage traffic SLA: --> Bandwidth B 11 Block device Z: (/device/scsi1) Server and VHD \\serverX\AB79.vhd Volume and file H:\AB79.vhd Block device /device/ssd5 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Flow name resolution through controller SLA: {VM 4, *, *, //share/dataset} --> Bandwidth B Controller SMBc exposes IO Header it understands: Queuing rule (per-file handle): --> Q1 Q1.token rate --> B 12 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Rate limiting for congestion control Queue servicing [Queue -> ] Important for performance SLAs Today: no storage congestion control Challenging for storage: e.g., how to rate limit two VMs, one reading, one writing to get equal storage bandwidth? 13 IOs tokens Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Rate limiting on payload bytes does not work 14 VM 8KB Writes 8KB Reads Source: Thereska, et al, MSR 12/8/14

Rate limiting on bytes does not work 15 VM 8KB Writes 8KB Reads Source: Thereska, et al, MSR 12/8/14

Rate limiting on IOPS does not work 16 VM 8KB Writes 64KB Reads Need to rate limit based on cost Source: Thereska, et al, MSR 12/8/14

Rate limiting based on cost  Controller constructs empirical cost models based on device type and workload characteristics  RAM, SSDs, disks: read/write ratio, request size  Cost models assigned to each queue  ConfigureTokenBucket [Queue -> cost model]  Large request sizes split for pre-emption 17 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Recap: Programmable queues on data plane  Classification [IO Header -> Queue]  Per-layer metadata exposed to controller  Controller out of critical path  Queue servicing [Queue -> ]  Congestion control based on operation cost  Routing [Queue -> Next-hop] How does controller enforce SLA? 18 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Distributed, dynamic enforcement SLA needs per-VM enforcement Need to control the aggregate rate of VMs 1-4 that reside on different physical machines Static partitioning of bandwidth is sub-optimal --> Bandwidth 40 Gbps 19 VM 40Gbps Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Work-conserving solution VMs with traffic demand should be able to send it as long as the aggregate rate does not exceed 40 Gbps Solution: Max-min fair sharing 20 VM Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Max-min fair sharing Well studied problem in networks  Existing solutions are distributed  Each VM varies its rate based on congestion  Converge to max-min sharing  Drawbacks: complex and requires congestion signal But we have a centralized controller  Converts to simple algorithm at controller 21 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Controller-based max-min fair sharing What does controller do? Infers VM demands Uses centralized max-min within a tenant and across tenants Sets VM token rates Chooses best place to enforce Controller 22 INPUT: per-VM demands OUTPUT: per-VM allocated token rate t s t = control interval s = stats sampling interval Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Controller decides where to enforce 23 SLA constraints  Queues where resources shared  Bandwidth enforced close to source  Priority enforced end-to-end Efficiency considerations  Overhead in data plane ~ # queues  Important at 40+ Gbps Minimize # times IO is queued and distribute rate limiting load VM Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Centralized vs. decentralized control Centralized controller in SDS allows for simple algorithms that focus on SLA enforcement and not on distributed system challenges Analogous to benefits of centralized control in software- defined networking (SDN) 24 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

IOFlow implementation Controller 25 2 key layers for VM-to-Storage performance SLAs 4 other layers. Scanner driver (routing). User-level (routing). Network driver. Guest OS file system Implemented as filter drivers on top of layers Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Evaluation map IOFlow’s ability to enforce end-to-end SLAs Aggregate bandwidth SLAs Priority SLAs and routing application in paper Performance of data and control planes 26 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Evaluation setup 27 VM Switch VM … Clients:10 hypervisor servers, 12 VMs each 4 tenants (Red, Green, Yellow, Blue) 30 VMs/tenant, 3 VMs/tenant/server Storage network: Mellanox 40Gbps RDMA RoCE full-duplex 1 storage server: 16 CPUs, 2.4GHz (Dell R720) SMB 3.0 file server protocol 3 types of backend: RAM, SSDs, Disks Controller: 1 separate server 1 sec control interval (configurable) Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Workloads 4 Hotmail tenants {Index, Data, Message, Log} Used for trace replay on SSDs (see paper) IoMeter is parametrized with Hotmail tenant characteristics (read/write ratio, request size) 28 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Enforcing bandwidth SLAs 4 tenants with different storage bandwidth SLAs Tenants have different workloads  Red tenant is aggressive: generates more requests/second Tena nt SLA Red{VM1 – 30} -> Min 800 MB/s Gree n {VM31 – 60} -> Min 800 MB/s Yello w {VM61 – 90} -> Min 2500 MB/s Blue{VM91 – 120} -> Min 1500 MB/s 2912/8/14

Things to look for Distributed enforcement across 4 competing tenants  Aggressive tenant(s) under control Dynamic inter-tenant work conservation  Bandwidth released by idle tenant given to active tenants Dynamic intra-tenant work conservation  Bandwidth of tenant’s idle VMs given to its active VMs 30 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Results Controller notices red tenant’s performance Tenants’ SLAs enforced. 120 queues cfg. 31 Inter-tenant work conservation Intra-tenant work conservation Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Data plane overheads at 40Gbps RDMA Negligible in previous experiment. To bring out worst case varied IO sizes from 512Bytes to 64KB 32 Reasonable overheads for enforcing SLAs

Control plane overheads: network and CPU 33 Overheads (MB) <0.3% CPU overhead at controller Controller configures queue rules, receives statistics and updates token rates every interval Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

Summary of contributions Defined and built storage control plane Controllable queues in data plane Interface between control and data plane (IOFlow API) Built centralized control applications that demonstrate power of architecture Ongoing work: applying to public cloud scenarios 34 Software Defined Networking (COMS ) Source: Thereska, et al, MSR 12/8/14

What is Policy-based Management? VMWare’s SDS Framework Simple policies to specify app SLA requirements Automation of storage provisioning and VM placement across clusters Works for any protocol : block, file and object SLA Compliance monitoring & automatic remediation Key Features Drastically simplify storage provisioning Management of different storage tiers as one Reduce storage cost by optimizing consumption Software Defined Storage Software Defined Storage Enterprise SAN/ NAS Virtual SAN BLOB Storage Availability = 99.99% DR RTO = 1 hour Max Laten SLA Definitions Availability = 99.99% DR RTO = 1 hour Back up = daily Storage capacity = 1 TB Performance = High I/O Security = High Availability = 99.99% DR RTO = 1 hour Max Laten SLA Definitions Availability = 99% DR RTO = 4 hour Back up = weekly Storage capacity = 10 TB Performance = High I/O Security = High Customer Benefits

Questions? 12/8/14 Software Defined Networking (COMS ) 36